Mozilla’s JavaScript frame­work can un­der­stand Web pages just like peo­ple do

OpenSource For You - - Foss Bytes -

Mozilla is tak­ing the Fire­fox browser to the next level with the Fathom JavaScript frame­work. The frame­work is help­ing the browser maker to ex­tract the mean­ing out of Web pages.

Fathom is known as a mini-lan­guage for writ­ing se­man­tic ex­trac­tors. As its name in­di­cates, the Fathom frame­work picks out page de­scrip­tions, images and other items. The JavaScript frame­work is help­ing Fire­fox to un­der­stand the con­tent and struc­ture of a Web page. It is be­ing used in the Ac­tiv­ity Stream traf­fic tracker of Fire­fox.

Fathom can be im­ple­mented in a browser, in ex­ten­sions and even server-side soft­ware. It is a dataflow lan­guage like Prolog.

Mozilla’s de­vel­op­ers have de­signed Fathom to cal­cu­late DOM nodes based on user-spec­i­fied con­di­tions, an­no­ta­tions and a sys­tem of types. This process helps the frame­work to ex­tract mean­ing from parts like ad­dress forms, pre­vi­ous/ next but­tons and tex­tual con­tent found on the web­site.

Ac­cord­ing to Mozilla’s se­nior staff soft­ware ar­chi­tect, Erik Rose, the frame­work is still in the early stage of de­vel­op­ment. It nev­er­the­less en­ables the browser to iden­tify mean­ing­ful parts on a page and show the sum­mary.

Rule sets in Fathom are JavaScript func­tion calls that make an­no­ta­tions in the syn­tax tree ver­sion. These rule sets are ca­pa­ble of giv­ing au­to­matic tun­ing of score con­stants. Maybe fu­ture re­leases will learn and gen­er­ate rules au­to­mat­i­cally.

Newspapers in English

Newspapers from India

© PressReader. All rights reserved.