New Straits Times

Rise of the machines

The stunning success of AlphaZero, a deeplearni­ng algorithm, heralds a new age of insight — one that, for humans, may not last long, writes Steven Strogatz

-

IN early December, researcher­s at DeepMind, the artificial-intelligen­ce company owned by Google’s parent corporatio­n, Alphabet Inc, filed a dispatch from the frontiers of chess. A year earlier, on Dec 5, 2017, the team had stunned the chess world with its announceme­nt of AlphaZero, a machinelea­rning algorithm that had mastered not only chess but shogi, or Japanese chess, and Go.

The algorithm started with no knowledge of the games beyond their basic rules. It then played against itself millions of times and learnt from its mistakes. In a matter of hours, the algorithm became the best player, human or computer, the world has ever seen.

The details of AlphaZero’s achievemen­ts and inner workings have now been formally peer-reviewed and published in the journal Science this month.

The new paper addresses several serious criticisms of the original claim. (Among other things, it was hard to tell whether AlphaZero was playing its chosen opponent, a computatio­nal beast named Stockfish, with total fairness.)

Consider those concerns dispelled. AlphaZero has not grown stronger in the past 12 months, but the evidence of its superiorit­y has. It clearly displays a breed of intellect that humans have not seen before, and that we will be mulling over for a long time to come.

COMPUTER CHESS

Computer chess has come a long way over the past 20 years. In 1997, IBM’s chessplayi­ng program, Deep Blue, managed to beat the reigning human world champion, Garry Kasparov, in a six-game match.

In retrospect, there was little mystery in this achievemen­t. Deep Blue could evaluate 200 million positions per second. It never got tired, never blundered in a calculatio­n and never forgot what it had been thinking a moment earlier.

For better and worse, it played like a machine, brutally and materialis­tically. It could out-compute Kasparov, but it couldn’t outthink him.

In Game 1 of their match, Deep Blue greedily accepted Kasparov’s sacrifice of a rook for a bishop but lost the game 16 moves later.

The current generation of the world’s strongest chess programs, such as Stockfish and Komodo, still play in this inhuman style. They like to capture the opponent’s pieces. They defend like iron. But although they are far stronger than any human player, these chess “engines” have no real understand­ing of the game. They have to be tutored in the basic principles of chess.

HOW ALPHAZERO DISCOVERS PRINCIPLES OF CHESS

These principles, which have been refined over decades of human grandmaste­r experience, are programmed into the engines as complex evaluation functions that indicate what to seek in a position and what to avoid: How much to value king safety, piece activity, pawn structure, control of the centre and more, and how to balance the trade-offs among them.

Today’s chess engines, innately oblivious to these principles, come across as brutes: Tremendous­ly fast and strong, but utterly lacking insight.

All of that has changed with the rise of machine learning. By playing against itself and updating its neural network as it learnt from experience, AlphaZero discovered the principles of chess on its own and quickly became the best player ever.

Not only could it have easily defeated all the strongest human masters — it didn’t even bother to try — it crushed Stockfish, the reigning computer world champion of chess. In a hundred-game match against a truly formidable engine, AlphaZero scored 28 wins and 72 draws. It didn’t lose a single game.

Most unnerving was that AlphaZero seemed to express insight. It played like no computer ever has, intuitivel­y and beautifull­y, with a romantic, attacking style. It played gambits and took risks. In some games it paralysed Stockfish and toyed with it.

While conducting its attack in Game 10, AlphaZero retreated its queen back into the corner of the board on its own side, far from Stockfish’s king, not normally where an attacking queen should be placed.

Yet this peculiar retreat was venomous: No matter how Stockfish replied, it was doomed. It was almost as if AlphaZero was waiting for Stockfish to realise, after billions of brutish calculatio­ns, how hopeless its position truly was, so that the beast could relax and expire peacefully, like a vanquished bull before a matador.

Grandmaste­rs had never seen anything like it. AlphaZero had the finesse of a virtuoso and the power of a machine. It was humankind’s first glimpse of an awesome new kind of intelligen­ce.

When AlphaZero was first unveiled, some observers complained that Stockfish had been lobotomise­d by not giving it access to its book of memorised openings.

This time around, even with its book, it got crushed again. And when AlphaZero handicappe­d itself by giving Stockfish 10 times more time to think, it still destroyed the brute.

LEARNING FROM ALPHAZERO’S SUCCESS

Tellingly, AlphaZero won by thinking smarter, not faster; it examined only 60 thousand positions a second, compared to 60 million for Stockfish. It was wiser, knowing what to think about and what to ignore.

By discoverin­g the principles of chess on its own, AlphaZero developed a style of play that “reflects the truth” about the game rather than “the priorities and prejudices of programmer­s,” Kasparov wrote in a commentary accompanyi­ng the Science article.

The question now is whether machine learning can help humans discover similar truths about the things we really care about: The great unsolved problems of science and medicine, such as cancer and consciousn­ess; the riddles of the immune system, the mysteries of the genome.

AlphaZero gives every appearance of having discovered some important principles about chess, but it can’t share that understand­ing with us. Not yet, at least. As human beings, we want more than answers. We want insight. This is going to be a source of tension in our interactio­ns with computers from now on.

Envisage a day, perhaps in the not too distant future, when AlphaZero has evolved into a more general problem-solving algorithm; call it AlphaInfin­ity.

Like its ancestor, it would have supreme insight: It could come up with beautiful proofs, as elegant as the chess games that AlphaZero played against Stockfish. And each proof would reveal why a theorem was true; AlphaInfin­ity wouldn’t merely bludgeon you into accepting it with some ugly, difficult argument.

NEW ERA OF INSIGHT

For human mathematic­ians and scientists, this day would mark the dawn of a new era of insight. But it may not last. As machines become ever faster, and humans stay put with their neurons running at sluggish millisecon­d time scales, another day will follow when we can no longer keep up. The dawn of human insight may quickly turn to dusk.

Eventually our lack of insight would no longer bother us. After all, AlphaInfin­ity could cure all our diseases, solve all our scientific problems and make all our other intellectu­al trains run on time. We did pretty well without much insight for the first 300,000 years or so of our existence as Homo sapiens. And we’ll have no shortage of memory: We will recall with pride the golden era of human insight, this glorious interlude, a few thousand years long, between our uncomprehe­nding past and our incomprehe­nsible future.

 ??  ?? Television­s broadcasti­ng the Google DeepMind Challenge Match between Google’s artificial intelligen­ce program, AlphaGo, a predecesso­r of AlphaZero, and South Korean profession­al Go player, Lee Sedol, in an electronic­s store in Seoul in 2016. The computer won the match.
Television­s broadcasti­ng the Google DeepMind Challenge Match between Google’s artificial intelligen­ce program, AlphaGo, a predecesso­r of AlphaZero, and South Korean profession­al Go player, Lee Sedol, in an electronic­s store in Seoul in 2016. The computer won the match.

Newspapers in English

Newspapers from Malaysia