Linux Format

Google open sources ‘Parsey McParsefac­e’

‘The World’s Most Accurate Parser’ has been made available by search giant for anyone to download.

-

Google (who makes no secret of its research into machine learning) recently open-sourced SyntaxNet, a neural network framework implemente­d in TensorFlow (Google’s open source software library for machine learning). Google says it can be used as a foundation for Natural Language Understand­ing (NLU) systems.

The release includes all the code needed to train new SyntaxNet models as well as ‘Parsey McParsefac­e’, an English parser trained by Google that can be used to analyse text. With its name being a nod towards the recent controvers­y regarding the naming competitio­n for the UK research ship, RRS Sir David Attenborou­gh (the name that won the popular vote being ‘Boaty McBoatface’), Parsey McParsefac­e is touted as the most accurate model in the world and built on machine learning algorithms that can learn to analyse the linguistic structure of language, as well as explaining the functional role of each word in a given sentence. Slav Petrov, a Senior Research Scientist at Google, gave examples of how SyntaxNet takes sentences and determines the syntactic relationsh­ips between words in them. He also described how longer sentences might have thousands of different possible structures (known somewhat brain-bendingly as ‘prepositio­nal phrase attachment ambiguity'), which is something humans can parse easily, but requires the use of neural nets for SyntaxNet to be able to deal with ( http://bit.ly/SyntaxNetO­penSourced).

Google claims that Parsey can correctly understand dependenci­es between words over 94% of the time when standard testing data (from news wire sentences) is used, something approachin­g human levels of performanc­e (on more free-form data this drops to 90%). The team want to carry on developing this approach in order to incorporat­e real word knowledge and have Parsey become able to understand natural language across all languages and contexts. The paper detailing how all this works is at arXiv.org ( http://bit. ly/GNTNN) while the code is available on GitHub ( http://bit.ly/SyntaxNetC­ode).

 ??  ??

Newspapers in English

Newspapers from Australia