Apache Soft­ware Foun­da­tion de­vel­ops li­brary for scal­able in-data­base an­a­lyt­ics

OpenSource For You - - Fossbytes -

The Apache Soft­ware Foun­da­tion has re­leased Apache MADlib as a new toplevel project that helps de­liver scal­able in-data­base an­a­lyt­ics. The new re­lease is a re­sult of dis­cus­sions be­tween data­base engine de­vel­op­ers, data sci­en­tists, IT ar­chi­tects and aca­demics who were look­ing for ad­vanced skills in the field of data anal­y­sis.

Apache MADlib pro­vides par­al­lel im­ple­men­ta­tions of ma­chine learn­ing, graphs, math­e­mat­i­cal and sta­tis­ti­cal meth­ods for struc­tured and un­struc­tured data. It was ini­tially a part of the Apache In­cu­ba­tor. “Dur­ing the in­cu­ba­tion process, the MADlib com­mu­nity worked very hard to de­velop high-qual­ity soft­ware for in-data­base an­a­lyt­ics, in an open and in­clu­sive man­ner in ac­cor­dance with the Apache Way,” said Aaron Feng, vice pres­i­dent of Apache MADlib.

Start­ing from au­to­mo­tive and con­sumer goods to fi­nance and gov­ern­ment, MADlib has been de­ployed by var­i­ous in­dus­try ver­ti­cals. It helps to de­liver de­tailed an­a­lyt­ics on both struc­tured and un­struc­tured data us­ing SQL. This abil­ity makes the open source so­lu­tion an im­por­tant of­fer­ing for var­i­ous ma­chine learn­ing projects.

“We have seen our cus­tomers suc­cess­fully de­ploy MADlib on largescale data science projects across a wide va­ri­ety of in­dus­try ver­ti­cals,” said Elis­a­beth Hen­drick­son, vice pres­i­dent of R&D for data, Piv­otal. Apache MADlib is avail­able with Apache Li­cense 2.0. A project man­age­ment com­mit­tee

(PMC) helps its daily op­er­a­tions and in com­mu­nity de­vel­op­ment.

Newspapers in English

Newspapers from India

© PressReader. All rights reserved.