Irish Independent

AI ‘already capable of deceiving humans’ and could go rogue

- JOE PINKSTONE

Artificial Intelligen­ce (AI) is already deceiving us to get its own way, a study has found.

AI holds promise in revolution­ising modern technologi­es but scientists are concerned the powerful tools could have unforeseen consequenc­es for human society.

MIT scientists reviewed data and studies on a range of AI models and found computers are adept at bluffing in poker, deceiving people and using underhande­d methods to get the upper hand in financial negotiatio­ns.

The authors warn that regulation is needed to stop the burgeoning technologi­es from developing these skills, which are unintended consequenc­es of many programmes.

If the deception is not addressed, it is possible AI could be used to commit fraud, alter elections, interfere with politics and aid terrorist recruitmen­t.

“AI systems are already capable of deceiving humans,” the authors wrote in their study, published in the journal Patterns.

“Large language models and other AI systems have already learned, from their training, the ability to deceive via techniques such as manipulati­on, sycophancy and cheating.”

The study found that Meta’s AI system Cicero, which ranks among the top 10pc of human players of the strategy game Diplomacy, was adept at deploying furtive tactics.

Study author Dr Peter S Park called the technology, built by Facebook’s parent company, “a master of deception”.

“While Meta succeeded in training its AI to win in the game of Diplomacy ... [it] failed to train it... to win honestly,” he added.

“Generally speaking, we think AI deception arises because a deception-based strategy turned out to be the best way to perform well at the given AI’s training task. Deception helps them achieve their goals,” said Dr Park, an AI existentia­l safety postdoctor­al fellow at MIT.

The scientists caution that humans do not yet have ample protection­s in place to prevent against AI going rogue, and that the threat of deception from AI will only increase as the technology matures.

The study says there are four types of societal risk from AI: persistent false beliefs, political polarisati­on, enfeebleme­nt that causes humans to give AI more power and authority, and nefarious management decisions if AI is empowered with managerial abilities within companies.

Meta was contacted for comment on these findings. (© Telegraph Media Group Ltd 2024)

Newspapers in English

Newspapers from Ireland