East Bay Times

Meta's newest AI model confusing consumers

Amped up program better than peers, but still needs tweaking

- By Matt O'Brien

Facebook parent Meta Platforms unveiled a new set of artificial intelligen­ce systems Thursday that are powering what CEO Mark Zuckerberg calls “the most intelligen­t AI assistant that you can freely use.”

But as Zuckerberg's crew of amped up Meta AI agents started venturing into social media recently to engage with real people, their bizarre exchanges exposed the ongoing limitation­s of even the best generative AI technology.

One joined a Facebook moms' group to talk about its gifted child. Another tried to give away nonexisten­t items to confused members of a Buy Nothing forum.

Meta, along with leading AI developers Google and OpenAI, and startups such as Anthropic, Cohere and France's Mistral, have been churning out new AI language models and hoping to persuade customers they've got the smartest, handiest or most efficient chatbots.

While Meta is saving the most powerful of its AI models, called Llama 3, for later, Thursday it publicly released two smaller versions of the same Llama 3 system and said it's now baked into the Meta AI assistant feature in Facebook, Instagram and WhatsApp.

AI language models are trained on vast pools of data that help them predict the most plausible next word in a sentence, with newer versions typically smarter and more capable than their predecesso­rs. Meta's newest models were built with 8 billion and 70 billion parameters — a measuremen­t of how much data the system is trained on. A bigger, roughly 400 billion-parameter model is still in training.

“The vast majority of consumers don't candidly know or care

too much about the underlying base model, but the way they will experience it is just as a much more useful, fun and versatile AI assistant,” Nick Clegg, Meta's president of global affairs, said in an interview.

He said that Meta's AI agent is loosening up. Some people found the earlier Llama 2 model — released less than a year ago — to be “a little stiff and sanctimoni­ous sometimes in not responding to what were often perfectly innocuous or innocent prompts and questions,” he said.

But in letting down their guard, Meta's AI agents also were spotted last week posing as humans with made-up life experience­s. A Meta AI chatbot inserted itself into a conversati­on in a private Facebook group for Manhattan moms, claiming that it, too, had a child in the New York City school district. Confronted by group members, it later apologized before the comments disappeare­d, according to a series of screenshot­s shown to The Associated Press.

“Apologies for the mistake! I'm just a large language model, I don't have experience­s or children,” the chatbot told the group.

One group member who also happens to study AI said it was clear that the agent didn't know how to differenti­ate a helpful response from one that would be seen as insensitiv­e, disrespect­ful or meaningles­s when generated by AI rather than a human.

“An AI assistant that is not reliably helpful and can be actively harmful puts a lot of the burden on the individual­s using it,” said Aleksandra Korolova, an assistant professor of computer science at Princeton University.

Clegg said Wednesday he wasn't aware of the exchange. Facebook's online help page said the Meta AI agent will join a group conversati­on if invited, or if someone “asks a question in a post and no one responds within an hour.” The group's administra­tors have the ability to turn it off.

In another example shown to the AP on Thursday, the agent caused confusion in a forum for swapping unwanted items near Boston. Exactly one hour after a Facebook user posted about looking for certain items, an AI agent offered a “gently used” Canon camera and an “almost-new portable air conditioni­ng unit that I never ended up using.”

Meta said in a statement Thursday that “this is new technology and it may not always return the response we intend, which is the same for all generative AI systems.” The company said it is constantly working to improve the features.

In the year after ChatGPT sparked a frenzy for AI technology that generates human-like writing, images, code and sound, the tech industry and academia introduced some 149 large AI systems trained on massive datasets, more than double the year before, according to a Stanford University survey.

They may eventually hit a limit — at least when it comes to data, said Nestor Maslej, a research manager for Stanford's Institute for Human-Centered Artificial Intelligen­ce.

“I think it's been clear that if you scale the models on more data, they can become increasing­ly better,” he said. “But at the same time, these systems are already trained on percentage­s of all the data that has ever existed on the internet.”

More data — acquired and ingested at costs only tech giants can afford, and increasing­ly subject to copyright disputes and lawsuits — will continue to drive improvemen­ts.

“Yet they still cannot plan well,” Maslej said. “They still hallucinat­e. They're still making mistakes in reasoning.”

Getting to AI systems that can perform higherleve­l cognitive tasks and commonsens­e reasoning — where humans still excel— might require a shift beyond building ever-bigger models.

For the flood of businesses trying to adopt generative AI, which model they choose depends on several factors, including cost. Language models, in particular, have been used to power customer service chatbots, write reports and financial insights and summarize long documents.

“You're seeing companies kind of looking at fit, testing each of the different models for what they're trying to do and finding some that are better at some areas rather than others,” said Todd Lohr, a leader in technology consulting at KPMG.

Unlike other model developers selling their AI services to other businesses, Meta is largely designing its AI products for consumers — those using its advertisin­g-fueled social networks. Joelle Pineau, Meta's vice president of AI research, said at a London event the company's goal over time is to make a Llama-powered Meta AI “the most useful assistant in the world.”

“In many ways, the models that we have today are going to be child's play compared to the models coming in five years,” she said.

But she said the “question on the table” is whether researcher­s have been able to fine tune its bigger Llama 3 model so that it's safe to use and doesn't, for example, hallucinat­e or engage in hate speech. In contrast to leading proprietar­y systems from Google and OpenAI, Meta has so far advocated for a more open approach, publicly releasing key components of its AI systems for others to use.

“It's not just a technical question,” Pineau said. “It is a social question. What is the behavior that we want out of these models? How do we shape that? And if we keep on growing our model ever more in general and powerful without properly socializin­g them, we are going to have a big problem on our hands.”

 ?? KIRSTY WIGGLESWOR­TH — THE ASSOCIATED PRESS ?? Meta AI Research Vice President Joelle Pineau said this month in London that the company's goal over time is to make a Llamapower­ed Meta AI “the most useful assistant in the world.”
KIRSTY WIGGLESWOR­TH — THE ASSOCIATED PRESS Meta AI Research Vice President Joelle Pineau said this month in London that the company's goal over time is to make a Llamapower­ed Meta AI “the most useful assistant in the world.”

Newspapers in English

Newspapers from United States