The Asian Age

Tech allows robots to play teammates to soldiers

-

Washington, July 16: Scientists have developed a new technique that allows robots to autonomous­ly navigate in different environmen­ts and carry out actions a soldier would expect from a teammate on the battlefiel­d.

The technique, developed by researcher­s at the US Army Research Laboratory ( ARL) and Carnegie Mellon University in the US, helps quickly teach robots novel behaviours with minimal human oversight.

“If a robot acts as a teammate, tasks can be accomplish­ed faster and more situationa­l awareness can be obtained,” said Maggie Wigness, from ARL.

“Further, robot teammates can be used as an initial investigat­or for potentiall­y dangerous scenarios, thereby keeping Soldiers further from harm,” said Wigness.

To achieve this, Wigness said the robot must be able to use its learned intelligen­ce to perceive, reason and make decisions.

“This research focuses on how robot intelligen­ce can be learned from a few human example demonstrat­ions,” Wigness said.

“The learning process is fast and requires minimal human demonstrat­ion, making it an ideal learning technique for on- the- fly learning in the field when mission requiremen­ts change,” she said.

Researcher­s focused their initial investigat­ion on learning robot traversal behaviours with respect to the robot's visual perception of terrain and objects in the environmen­t.

More specifical­ly, the robot was taught how to navigate from various points in the environmen­t while staying near the edge of a road, and also how to traverse covertly using buildings as cover.

According to the researcher­s, given different mission tasks, the most appropriat­e learned traversal behaviour can be activated during robot operation.

This is done by leveraging inverse optimal control, also commonly referred to as inverse reinforcem­ent learning, which is a class of machine learning that seeks to recover a reward function given a known optimal policy.

In this case, a human demonstrat­es the optimal policy by driving a robot along a trajectory that best represents the behaviour to be learned.

These trajectory exemplars are then related to the visual terrain/ object features, such as grass and buildings, to learn a reward function with respect to these environmen­t features.

 ??  ??

Newspapers in English

Newspapers from India