Pong-in-a-Dish
Ever hear of the Turk-the 19th-century mechanism topped by a turbaned head that played chess against all comers? In fact, hidden inside was a diminutive chessmaster, one you might imagine deadpanning, Eh, It's a living."
Then there's its namesake, the Mechanical Turk-a 21st-century service offered by Amazon to mark up images on the Web with the help of crowdsourced freelancers. They, too, might intone, glassy-eyed, It's a living."
Now we have a kind of Biological Turk. A mass of neurons act as a computer that mimics a human being playing the classic computer game Pong. The neurons, some taken from mouse embryos, others grown from human precursor cells, spread out into a one-layer, 800,000-cell mesh called a biological neural network, which lives in a giant petri dish called the DishBrain. There it interfaces with arrays of electrodes that form an interface to silicon hardware. Software mounted on that hardware provides stimulation and feedback, and the minibrain learns how to control a paddle on a simulated ping-pong table.
The work was described recently in the journal Neuron by Brett Kagan, the chief scientific officer of Cortical Labs, a startup in Melbourne, Australia, and nine colleagues at that company.
The authors talk hopefully about the emergence of sentience, a notion that other brain-in-a-dish researchers have also recently floated. But they seem to stand on solid ground when they say their method will help to advance brain science, on the one hand, and computer science, on the other. A bio-neuro-network might model the effects of drugs on the brain in ways that single-cell neurons can't. Also, neurons may show themselves to be more than just protoplasmic logic switches but more like entire computers.
The question before us, though, is how does the thing play Pong?
Pong-in-a-Dishyoutu.be
First, the electronic scaffolding hits the minibrain with electrical signals that represent the position and movement of the virtual ball. It's rather like the action potential that a firing neuron would use to convey, say, a sensory signal from the eye to the brain. Because the electrodes are placed at different points in the cell network, the system physically represents the different possible locations. Further information comes from the frequency of the signals, which varies with the distance of the ball to the virtual paddle.
The network responds to these stimuli like a motor neuron, sending out a signal that moves the virtual paddle. If the resulting movement causes the ball to bounce, the neural network gets a reward." Failure results in a signal that has the opposite effect.
Reward" is put in sneer quotes because these cells don't have feelings. They can't experience the joy of victory, the agony of defeat. There's no dopamine, no salted popcorn. Instead, the researchers say, the network is working to minimize unpredictability. In this view, the so-called reward is a predictable signal, the anti-reward is an unpredictable one.
Kagan tells IEEE Spectrum that the system as a whole then reorganizes to become better at playing the game. The most marked improvement came in the first five minutes of play.
It seems amazing that a mere 800,000 neurons can model the world, even a simplified world. But, Kagan says, such feats are seen in nature. "Flies have even fewer neurons but must be able to do some modeling-although perhaps not in a way a human may-to navigate a complex and changing 3D world," he says.
As he and his colleagues point out in their report the ability of neurons to adapt to external stimuli is well established in vivo; it forms the basis for all animal learning. But theirs, they say, is the first in vitro demonstration involving a goal-directed behavior.
The current version of Pong is forgiving. The paddle is broad, the volley slow, the ball unspinning. Even a neophyte would crush DishBrain. Then again, the same was true of all of AI's early assays in game playing.
The early chess machines would sometimes senselessly give up first a pawn, then a piece, then the queen-all because they were attempting to put off a disagreeable action to a point beyond the built-in planning horizon. Poker-playing programs got good pretty fast, but the early ones sometimes played too well-that is, too cautiously-against weak human opponents, which reduced their winnings. Car navigation programs would send you into a vacant lot.
You might think that just getting a machine to play a decent game is the hard part, and that further improving it to perfection ought to be a snap. Edgar Allan Poe made that judgement when he called the Turk a fraud because it occasionally erred. His conclusion was correct but his reasoning was faulty.
It's not easy turning a barely there machine into a world champion at chess or Go. And yet it has been done.