Microsoft creates human speech recognition bot
In a major breakthrough in the field of speech recognition, Microsoft researchers have created a technology that accurately recognises the words in a conversation like people do — a feat that may soon help people suf- fering from speech-related issues.
The team from Microsoft Artificial Intelligence and Research reported a speech recognition system that makes the same or fewer errors than professional transcriptionists.
The researchers reported a word error rate (WER) of 5.9 %, down from the 6.3% WER the team reported just last month.
The 5.9 % error rate is about equal to that of people who were asked to transcribe the same conversation, and it's the lowest ever recorded against the industry standard “Switchboard” speech recognition task.
“We've reached human parity. This is an historic achievement,” said Xuedong Huang, the company's chief speech scientist in a Microsoft blog post. The milestone means that, for the first time, a computer can recognize the words in a conversation as well as a person would.
In doing so, the team has beat a goal they set less than a year ago— and greatly ex- ceeded everyone else's expectations as well.
“Even five years ago, I wouldn't have thought we could have achieved this. I just wouldn't have thought it would be possible,” said Harry Shum, executive vice president who heads the Microsoft Artificial Intelligence and Research group.
The research milestone comes after decades of research in speech recognition, beginning in the early 1970s with DARPA, the US agency tasked with making technology breakthroughs in the interest of national security.
“This accomplishment is the culmination of over 20 years of effort,” said Geoffrey Zweig, who manages the Speech & Dialog research group.
The milestone will have broad implications for consumer and business products that can be significantly augmented by speech recognition. That includes consumer entertainment devices like the Xbox, accessibility tools such as instant speechto- text transcription and personal digital assistants such as Cortana. “This will make Cortana more powerful, making a truly intelligent assistant possible,” Shum said.To reach the human parity milestone, the team used Microsoft's Computational Network Toolkit (CNTK), a home-grown system for deep learning. IANS