The Arizona Republic

Voice recognitio­n can’t replace typing yet

-

Question: Is there a voice-recognitio­n system that can replace typing that actually works well?

Answer: As much as technology has advanced since Bell Labs developed the Audrey in the 1950s, which could only recognize digits spoken by a single user, we’re still far from what you’ve seen in science-fiction movies.

Commands vs. dictation

Voice-command platforms like many automated phone systems use are reasonably effective because they severely limit the number of verbal commands you can use.

Natural-speech recognitio­n is what most people want, and that’s a challenge that has yet to be met in a way for it to be widely adopted.

We’re surrounded by options that offer some form of voice command/recognitio­n from Apple, Google and Amazon, but they are far from perfect, as we all well know.

Accurate dictation has been the challenge that many very sophistica­ted companies, including IBM, have been trying to solve for more than 60 years.

To put the problem into perspectiv­e, a system with a 90 percent accuracy means that every 10th word is wrong. Ninety-five percent accuracy gives us a one-in-20 ratio and even at 98 percent, we’re still looking at roughly one in 50 words being incorrect.

With an average paragraph in the 100150 word range, you can start to see how the time we may save in generating the text can get eaten up in editing what was captured.

Throw in how our voices change when we’re sick, various accents, the speed at which we speak and a host of other variables, and you start to understand how much more sophistica­ted of a processor the human brain is.

The context problem

Another huge challenge is context, both in command and dictation technology.

Google recently started to bridge the context gap with its latest Google Assistant technology, which allows you to have more of a conversati­on.

For example, you can ask, “Do I need an umbrella today?” And after it responds, you can follow up with, “What about tomorrow?”

Another advance is possible because of what many consider the “creepy” factor of today’s technology.

Because our smartphone­s can remember virtually everything we’ve done in the past and consider our current location or what we’ve previous searched online or in a mapping program, they can use this additional info to better understand your verbal commands.

Tips for being successful

If dictation is your key need, the company that’s been at it the longest is Dragon NaturallyS­peaking.

As good as the program is, expecting to install the software and have it magically become your new way of “typing” will guarantee failure. You essentiall­y are going to be learning a new language.

If you aren’t willing to take the necessary time to train yourself to learn how to speak to your computer, you shouldn’t bother spending the money.

To be successful, you’ll also need the proper hardware, such as enough processing power and RAM and a good microphone. Be sure to review the system requiremen­ts before taking the plunge.

Newspapers in English

Newspapers from United States