Cape Argus

How AI helped Val Kilmer find his voice again

- DALVIN BROWN

“MY voice, as I knew it, was taken away from me. People around me struggle to understand me when I’m talking,” said a voice, immediatel­y recognisab­le as the actor Val Kilmer, in a video shared on YouTube last week.

The Hollywood actor lost his voice after a surgery for throat cancer in 2015, shattering his career and permanentl­y altering the way he communicat­es. That is, until late last year when Sonantic, a UK software firm that clones voices for actors and studios, helped Kilmer speak again.

Even the most widespread artificial voices based on real people, like Apple’s Siri and Amazon’s Alexa, sound fake. But a wave of start-ups are deploying artificial­ly intelligen­t voice cloning services for digital assistants, video games and movie studios.

The generated voices have become more realistic in the age of deep-fakes, a technology that uses AI to manipulate content to look and sound deceptivel­y real. This media is so good it’s sometimes tough to tell the difference between human voices and their synthetic counterpar­ts.

Five years after Kilmer’s surgery, his representa­tives contacted Sonantic to digitally restore his lost voice.

“So, that's what we did,” said Zeena Qureshi, the chief executive and co-founder of Sonantic.

“Val's team wanted to give him his voice back, so that he could continue creating.”

The project started in December last year after Kilmer finished taping Val, a documentar­y about his Hollywood career and battle with cancer.

Sonantic’s AI tech wasn’t featured in the movie. However, the firm released a clip on YouTube that has more than 18 000 views.

Kilmer’s project comes a month after documentar­y film-maker Morgan Neville revealed he used an unidentifi­ed voice-cloning software to imitate the late chef Anthony Bourdain in the commercial film Road Runner. Neville drew criticism from Ottavia Bourdain, the late actor’s widow, who disputed being approached about re-creating her husband’s voice through AI.

Sonantic wouldn’t reveal other actors it was working with. The threeyear-old company works primarily with gaming brands, such as Xbox Game Studios’ Obsidian Entertainm­ent and Remedy Games, and often licenses its synthetic voice service to studios, allowing them to edit and direct artificial voices similarly to how directors can influence human actors.

“We like to think of it as Photoshop for voice, where you can go in and touch up little areas,” said John Flynn, the firm’s chief technology officer.

The firm’s audio engineers typically require three hours of audio to re-create a voice within 24 hours. But due to movie licensing restraints, Sonantic had to re-create Kilmer’s voice with fewer than 30 minutes of audio.

The company says engineers pulled samples from old footage and “cleaned” them to cancel out background noise. They created an script based on the material, linked the audio and text together in “short chunks”, and ran the data through its “voice engine” algorithms which learn to speak by listening to the recordings,“said Flynn.

The voice engine derives meaning from the written words and can use the cues to “illustrate intense anger and emotional pain”, a statement said.

In April, Sonantic showed how the audio service can convey couples in the throes of a heated argument. In the demo, two voices have an ordinary conversati­on which quickly escalates into a shouting match. In real life, the scenario would preserve “actors’ vocal cords” and allow “them to earn passive income”, the company said.

The firm says they created 40 versions of Kilmer’s voice and selected the highest quality option best capturing the actor’s expression. The result is a desktop-based text-to-speech program Sonatic says can mimic Kilmer’s projection levels and emotion.

The voice software can read lines of text aloud, supposedly capturing Kilmer’s former subtleties in speech, expression and tone. Kilmer, beloved for his role as Iceman in Top Gun, is free to use the tech however he wants, Sonantic says.

“It’s exclusivel­y his model. He could use it for personal use or profession­al use if he wants to,” Qureshi said.

As in Kilmer’s case, the technology has possibilit­ies for people who have difficulty speaking or actors needing to rest their vocal cords after long screaming sessions in the studio.

But the technology also sparks legal, ethical, and economic concerns, particular­ly among voice actors who are concerned about their livelihood drying up.

Deepfake technology has been used to make videos of politician­s such as Donald Trump and Barack Obama, spotlighti­ng the dangers of technology designed to make it appear as if people are saying things they never said.

“When I’m an actor, I get to decide whether I support the content,” said Jay Britton, a voice actor who plays animated characters in Nexflix’s Go! Go! Cory Carson and a long list of video games. “It would be a devastatin­g thing to drop on a voice actor, that your voice is out there saying things that you might not necessaril­y support.”

Sonantic says its product isn’t meant to replace the need for actors. The firm’s website touts that its product can “reduce production timelines from months to minutes”, promising “compelling, lifelike performanc­es for games and films with fully expressive AI-generated voices”, premises that could cut down on the number of hours human actors are paid to perform in-studio.

There are no laws in the US prohibitin­g companies from generating synthetic voices.

There is, however, a legal framework to deter those seeking to cash in on a famous person’s likeness. In a voice theft case from the 1990s, raspyvoice singer Tom Waits sued Dorito Lay for using a sound-a-like voice in an ad and was awarded $2.6 million in damages.

“If (companies) go around reproducin­g voices of recognizab­le people without permission, they may be violating the right of privacy and subjecting themselves to a lawsuit,” said Peter Raymond, an intellectu­al property lawyer at Reed Smith in New York City. “If you're doing it as a parody or an artistic routine, then it’s not a violation. If it’s for commercial benefit, it would be a violation.”

Sonantic’s software garnered praise from Kilmer, who said that the start-up “masterfull­y restored my voice in a way I’ve never imagined possible”.

 ?? Val. ?? VAL Kilmer starred as Jim Morrison in the 1991 film, The Doors, a biographic­al depiction of 1960s rock band singer. He lost his voice in 2015, but has used AI to help him narrate his documentar­y,
Val. VAL Kilmer starred as Jim Morrison in the 1991 film, The Doors, a biographic­al depiction of 1960s rock band singer. He lost his voice in 2015, but has used AI to help him narrate his documentar­y,

Newspapers in English

Newspapers from South Africa