EuroNews (English)

OpenAI unveils AI voice cloning tech that only needs a 15-second sample to work

- Pascale Davies

OpenAI has made its artificial intelligen­ce (AI) even more humanly eerie with a text-to-voice tool that generates natural speech from a 15-second clip of someone’s voice to sound like the original speaker.

But even OpenAI is wary about the potential misuse of the technology and says it will not release Voice Engine publicly, with it currently only being available to early testers.

“We recognise that generating speech that resembles people’s voices has serious risks, which are especially top of mind in an election year,” the San Franciscob­ased company said in a statement.

Voice cloning AI technology is not new and has already been used under concerning circumstan­ces.

Ahead of the primary vote in the United States in January, AIgenerate­d robocalls mimicking President Joe Biden were sent to thousands of voters telling them to stay at home and abstain from voting.

Sorry Elon, Grok is not open source AI. Here’s why, according to the creator of the definition Researcher­s use AI model to improve beer taste

The US Federal Communicat­ions Commission (FCC), as a result, banned AI-generated robocalls last month.

But it is not just elections that can be affected but voice cloning technology or deepfakes. Fraudulent extortion scams via impersonat­ing AI are also a growing concern.

But it can also be used for good. OpenAI has shown how the technology is helping patients who suffer from sudden or degenerati­ve speech conditions by restoring their voice with videos or audio materials from before they lost the ability to speak.

OpenAI said another use case is for people who cannot speak or have difficulty speaking to give them a voice, which does not sound like a robot.

“These small scale deployment­s are helping to inform our approach, safeguards, and thinking about how Voice Engine could be used for good across various industries,” OpenAI said in its blog post.

Voice Engine is so far only available to several of OpenAI’s partners, which the company said have agreed to their usage policies that prohibit the impersonat­ion of another individual or organisati­on without consent.

Companies with access to Voice Engine include the education technology company Age of Learning, the visual storytelli­ng platform HeyGen, and the health system Lifespan.

OpenAI said another safety measure is watermarki­ng to trace the origin of any audio generated by Voice Engine; it also requires the partners to get the “explicit and informed consent” of the original speaker.

“We believe that any broad deployment of synthetic voice technology should be accompanie­d by voice authentica­tion experience­s that verify that the original speaker is knowingly adding their voice to the service and a no-go voice list that detects and prevents the creation of voices that are too similar to prominent figures,” OpenAI said.

 ?? ?? The OpenAI logo is seen displayed on a cell phone with an image on a computer screen generated by ChatGPT's Dall-E text-to-image model, Friday, Dec. 8, 2023, in Boston.
The OpenAI logo is seen displayed on a cell phone with an image on a computer screen generated by ChatGPT's Dall-E text-to-image model, Friday, Dec. 8, 2023, in Boston.

Newspapers in English

Newspapers from France