Google’s translation service adds non-European tongues
LIMA, Peru — About 10 million people speak Quechua, but trying to automatically translate emails and text messages into the most widely spoken Indigenous language family in the Americas was long all but impossible.
That changed this month, when Google added Quechua and a variety of other languages to its digital translation service.
The internet giant says new artificial intelligence technology is enabling it to vastly expand Google Translate’s repertoire of the world’s languages. It added 24 of them last week, including Quechua and other Indigenous South American languages such as Guarani and Aymara. It is also adding a number of widely spoken African and South Asian languages that have been missing from popular tech products.
“We looked at languages with very large, underserved populations,” Google research scientist Isaac Caswell told reporters.
Quechua was the lingua franca of the Inca Empire, which stretched from what is now southern Colombia to central Chile. Its status began to decline following the Spanish conquest of Peru more than 400 years ago.
Adding it to the languages recognized by Google is a big victory for Quechua language activists like Luis Illaccanqui, a Peruvian who created the website Qichwa 2.0, which includes dictionaries and resources for learning the language.
Illaccanqui said the translator will also help keep the language alive with a new generation of young people and teenagers, “who speak Quechua and Spanish at the same time and are fascinated by social networks.”
Caswell called the news a “very big technological step forward” because until recently, it was not possible to add languages if researchers couldn’t find a big enough trove of online text — such as digital books, newspapers or social media posts — for their AI systems to learn from.
U.S. tech giants don’t have a great track record of making their language technology work well outside the wealthiest markets, a problem that’s also made it harder for them to detect dangerous misinformation on their platforms. Until this week, Google Translate was offered in European languages such as Frisian, Maltese, Icelandic and Corsican — each with fewer than 1 million speakers — but not East African languages like Oromo and Tigrinya, which have millions of speakers.
The new languages won’t yet be understood by Google’s voice assistant, which limits them to textto-text translations for now. Google said it is working on adding speech recognition and other capabilities, such as translating a sign by pointing a camera at it.
That will be important for largely spoken languages like Quechua, especially in the health field, because many Peruvian doctors and nurses who speak only Spanish “are unable to understand patients who speak mostly Quechua,” Illaccanqui said.
“The next frontier, or challenge, is to work on speech,” said Arturo Oncevay, a Peruvian machine translation researcher at the University of Edinburgh who co-founded a research coalition to improve Indigenous language technology in the Americas. “The native languages of the Americas are traditionally oral.”
Harvard University language scholar Americo Mendoza-Mori, who speaks Quechua, said getting Google’s attention brings some needed visibility to the language in places like Peru, where Quechua speakers are still lacking in many public services. The survival of many of these languages “will depend on their use in digital contexts,” he said.