Google updates Bard chatbot
SAN FRANCISCO — For more than a year, Google has raced to build technology that could match ChatGPT, the eyeopening chatbot offered by the San Francisco artificial intelligence startup OpenAI.
On Wednesday, the tech giant took another step in the ongoing race, releasing a new version of its own chatbot, Google Bard. Available to English speakers in more than 170 territories and countries, including the United States, beginning immediately, the updated bot is underpinned by new AI technology called Gemini, which the company has been developing since the start of the year.
“This is the beginning of the Gemini era,” Sundar Pichai, Google’s CEO, said in an interview. “It’s the realization of the vision we had when we set up Google DeepMind,” the company’s AI lab.
Pichai and Demis Hassabis, who oversees Google DeepMind, said Gemini was more powerful than Google’s previous chatbot technologies and that it could generate more accurate responses and come closer to mimicking human reasoning in some situations.
When OpenAI wowed the world with the AI chatbot ChatGPT late last year, Google was caught flat-footed. The tech giant had spent years developing similar technology, but like other tech giants — most notably, Meta — it was reluctant to release a technology that could generate biased, false, or otherwise toxic information.
In March, Google released its own chatbot, Bard, to middling reviews. A month later, the company announced that it had combined its two AI labs — Google Brain and DeepMind — bringing together more than 2,000 researchers and engineers. And in May, at its flagship Google I/O conference, it announced that the new Google DeepMind lab had started developing Gemini.
After founding the Brain lab in 2011, Google acquired DeepMind in 2014, paying $650 million for the London AI startup. DeepMind operated largely independently of the Brain lab and the rest of Google for a decade. But as Google struggled to catch up with OpenAI, Pichai combined the two labs under Hassabis, a neuroscientist who cofounded DeepMind.
Google released benchmark test results claiming that Gemini’s most powerful version outperformed OpenAI’s latest technology, GPT-4, in several key areas. It is better at generating computer code than previous Google technologies, Pichai said, and it can more accurately summarize news articles and other text documents.
Gemini was also designed to analyze images and sounds, but those skills will not be rolled into the Bard chatbot until a later date.
Google has built three versions of Gemini with three different sets of skills. The largest, Ultra, is designed to tackle complex tasks and will debut next year. Pro, the midtier offering, will be rolled out to numerous Google services, starting Wednesday, with the Bard chatbot. Nano, the smallest version, will power some features on the Pixel 8 Pro smartphone, such as summarizing audio recordings and offering suggested text responses in WhatsApp starting Wednesday.
Gemini is what scientists call a large language model, or LLM, a complex mathematical system that can learn skills by analyzing vast amounts of data, including digital books, Wikipedia articles, and online bulletin boards. By identifying patterns in all of that text, an LLM learns to generate text on its own. That means it can write term papers, generate computer code, and even carry on a conversation.
With Gemini, Google has also trained the technology on digital images and sounds. It is what researchers call a “multimodal” system, meaning it can analyze and respond to both images and sounds. If you give it a math problem that includes lines, shapes and other images, for example, it can answer in much the way a high school student would.
That portion of the technology, however, will not be available to consumers until sometime next year. Google also acknowledged that like similar systems, Gemini is prone to mistakes. It can get facts wrong or even “hallucinate” — make stuff up.