Call & Times

Robotic hand learns to solve a Rubik’s Cube on its own – just like a human

- By PETER HOLLEY

Solving a Rubik’s Cube is hard enough for most people. Solving a Rubik’s Cube with one hand is even harder.

Harder still: designing a lone robot hand capable of solving a Rubik’s Cube all by itself. Such a machine would require unpreceden­ted dexterity and coordinate­d finger joint movements, as well as the ability to learn a new task over time and independen­tly the way a human would.

This week, researcher­s at OpenAI – a wellknown San Francisco-based research lab focused on developing benevolent artificial intelligen­ce – announced that they’d done just that, setting a new robotics benchmark in an era of increasing­ly sophistica­ted, intelligen­t machines.

In a statement hailing their achievemen­t, researcher­s said the robotic hand, which they’ve dubbed “Dactyl,” moves robots one step closer to “human-level dexterity.”

“Solving a Rubik’s Cube requires unpreceden­ted dexterity and the ability to execute flawlessly or recover from mistakes successful­ly for a long period of time,” the statement said. “Even for humans, solving a Rubik’s Cube one-handed is no simple task – there are 43,252,003,274,489,856,000 ways to scramble a Rubik’s Cube.”

With this result, the statement added, researcher­s move closer to creating “general purpose robots with a technique that should allow for robustly solving any simulatabl­e dexterous tasks.”

The multicolor­ed, three-dimensiona­l puzzles have befuddled game-playing humans since the 1970s, but Rubik’s Cubes have more recently proved a useful tool for measuring the capabiliti­es of artificial intelligen­ce.

One reason, researcher­s say: there are billions of potential moves available to a Rubik’s Cube player, with the puzzle’s six sides and nine sections, but only one goal: each of the cube’s six sides displaying a solid color. Finding a solution to a puzzle with that degree of complexity, and among billions of potentiali­ties, involves a degree of abstract thinking that, researcher­s say, begins to approximat­e human reasoning and decision-making.

For years now, researcher­s have been programmin­g robots to solve Rubik’s Cubes as quickly as possible. But more recently, they’ve begun prioritizi­ng self-learning over speed. In July, the University of California at Irvine announced that an artificial intelligen­ce system solved a Rubik’s cube in just over a second, besting the current human world record by more than two seconds.

The system, known as DeepCubeA – a reinforcem­ent-learning algorithm programmed by UCI computer scientists and mathematic­ians – solved the puzzle without prior knowledge of the game or coaching from its human handlers, according to the university. Highly skilled humans are able to tackle a Rubik’s Cube in about 50 moves, but the AI system is able to solve the cube in about 20 moves, usually in the minimum number of steps possible, researcher­s said.

The UCI algorithm relies on a neural network – a set of algorithms designed to find underlying relationsh­ips by mimicking how the human brain processes informatio­n. The algorithm also relied on machine learning techniques, a system that allows AI to learn by identifyin­g patterns and using inference with minimal human interventi­on.

To prepare Dactyl for Rubik’s Cube success, OpenAI’s researcher­s say they didn’t “explicitly program” the machine to solve the puzzle. Instead, the robot was trained using virtual simulation­s before it was presented with challenges in the physical world that tested it’s ability to learn.

The goal, researcher­s say, was to create a robot that learns the way humans do – through trial and error. Eventually, those robots could be used to complete tasks – in a warehouse or perhaps on the surface of a new planet – with more autonomy. Before it could solve the puzzle, Dactyl was forced to learn how to hold and move the cube on its own. As Dactyl improved at each stage of learning, it’s algorithm growing more adept, the challenges intensifie­d.

“For example,” researcher­s said, “we put a rubber glove on the hand, we tied some of its fingers together, we used a blanket to occlude and perturb the hand, and we poked the Rubik’s cube with different objects all while it continued to try to solve the Rubik’s Cube.”

The system had never seen anything similar to these situations during training, researcher­s added.

OpenAI posted a video on YouTube showing Dactyl at various points in the robot’s training arc. The video captures the machine learning from scratch as it awkwardly fumbles with a Rubik’s Cube and later handling the puzzle with much more control and precision.

The video’s narrator says Dactyl’s accomplish­ment could also change how researcher­s view training general purpose robots. Instead of thinking about creating complex algorithms for different environmen­ts, the narrator says, roboticist­s can instead focus on designing complex scenarios in which the machines can learn.

Newspapers in English

Newspapers from United States