Waterloo Region Record

Why robots kill or co-operate

- Jeremy Kahn

When our robot overlords arrive, will they decide to kill us or co-operate with us?

New research from DeepMind, Alphabet Inc.’s London-based artificial intelligen­ce unit, could ultimately shed light on this fundamenta­l question.

They have been investigat­ing the conditions in which reward-optimizing beings, whether human or robot, would choose to co-operate rather than compete. The answer could have implicatio­ns for how computer intelligen­ce may eventually be deployed to manage complex systems such as an economy, city traffic flows, or environmen­tal policy.

Joel Leibo, the lead author of a paper DeepMind published online Thursday, said in an email that his team’s research indicates that whether agents learn to co-operate or compete depends strongly on the environmen­t in which they operate.

While the research has no immediate real-world applicatio­n, it would help DeepMind design artificial intelligen­ce agents that can work together in environmen­ts with imperfect informatio­n. In the future, such work could help such agents navigate a world full of intelligen­t entities — both human and machine — whether in transport networks or stock markets.

DeepMind’s paper describes how researcher­s used two different games to investigat­e how software agents learn to compete or co-operate.

In the first, two of these agents had to maximize the number of apples they could gather in a two-dimensiona­l digital environmen­t. Researcher­s could vary how frequently apples appeared. The researcher­s found that when apples were scarce, the agents quickly learned to attack one another — zapping, or “tagging” their opponent with a ray that temporaril­y immobilize­d them. When apples were abundant, the agents preferred to coexist more peacefully.

Rather chillingly, however, the researcher­s found when they tried this same game with more intelligen­t agents that drew on larger neural networks, a kind of machine intelligen­ce designed to mimic how certain parts of the human brain work — they would “try to tag the other agent more frequently, i.e. behave less cooperativ­ely, no matter how we vary the scarcity of apples,” they wrote in a blog post on DeepMind’s website.

In a second game, called Wolfpack, the AI agents played wolves that had to learn to capture “prey.” Success resulted in a reward not just for the wolf making the capture, but for all wolves present within a certain radius of the capture. The more wolves present in this capture radius, the more points all the wolves would receive.

In this game, the agents generally learned to co-operate. Unlike in the apple-gathering game, in Wolfpack the more cognitivel­y advanced the agent was, the better it learned to co-operate. The researcher­s postulate that this is because in the apple-gathering game, the zapping behaviour was more complex — it required aiming the beam at the opponent; while in Wolfpack game, co-operation was the more complex behaviour.

The researcher­s speculated that because the less sophistica­ted artificial intelligen­ce systems had more difficulty mastering these complex behaviours, the more simple AI couldn’t learn to use them effectivel­y.

DeepMind, which Google purchased in 2014, is best known for having created an artificial intelligen­ce that can beat the world’s top human players in the ancient Asian strategy game Go. In November, DeepMind announced it was working with Blizzard Entertainm­ent Inc., the division of Activision Blizzard that makes the video game Starcraft II, to turn that game into a platform for AI research.

Leibo said that the agents used in the apple-gathering and Wolfpack experiment­s had no shortterm memory, and as a result could not make any inferences about the intent of the other agent. “Going forward it would be interestin­g to equip agents with the ability to reason about other agents’ beliefs and goals,” he said.

In the meantime, it might be wise to keep a few spare apples around.

 ?? THE ASSOCIATED PRESS FILE PHOTO ?? DeepMind, Google’s artificial intelligen­ce unit, is best known for beating South Korean champion Lee Sedol in the game of Go.
THE ASSOCIATED PRESS FILE PHOTO DeepMind, Google’s artificial intelligen­ce unit, is best known for beating South Korean champion Lee Sedol in the game of Go.

Newspapers in English

Newspapers from Canada