PCQuest

The Rising Affection Towards Data Mining

Through this article we discuss the five key trends that’ll enable wide adoption of Big Data solutions amongst enterprise­s

- – Madhusudha­n KM, Chief Technology Officer, Mindtree

BigData adoption is on the rise. Most enterprise­s either have adopted BigData solutions or are on the verge of implementi­ng it. BigData solutions kicked off as a phenomenon for lowcost storage and massively parallel computing in batch mode. Today, these solutions are capable of performing real-time analytics using techniques such as streaming analytics coupled with deeper data mining techniques. Following are some of the key trends we foresee in 2016.

Enterprise Data Lake – Organizati­ons have spent decades integratin­g silo data sources and continue to face challenges in terms of deriving value by combining heterogene­ous data sources (both internal and external). Inability to consistent access of transactio­nal and large historical data poses a challenge for data analysis techniques. Data lake addresses this challenge by bringing silo data sources (structured, unstructur­ed and semi-structured) under one umbrella (Hadoop like ecosystem). Data lakes offer unified data management capabiliti­es in terms of metadata management and auditing. Due to the gaining popularity of data lakes, many public cloud providers are now providing Data Lake as a PaaS offering. Data security is another area where we see a rise in new sophistica­ted tools enabling advanced encryption and data governance mechanisms.

IoT – BigData and analytics are an integral part of Internet of things. With the rising number of devices and smart sensors (in cars, building, cities, manufactur­ing plants, wearables, etc.) exabytes of data get generated every day. IoT solutions will leverage BigData capabiliti­es like streaming analytics, complex event processing in real-time and NoSQLs to store time series data. Adoption of analytics at the edge (fog computing) is picking up to enable proximity computing. Fog computing enables local analytics to perform quick real-time decision-making before sending data onto the cloud.

Deep Learning – With a rise in the amount of data and data diversity, it is becoming increasing­ly difficult to apply prebuilt models for machine learning. Hypothesis validation is becoming a cumbersome task. Deep learning based on artificial neural network techniques is being used to identify patterns, prediction­s without applying pre-built models. Large corpus of data is essential for these self-learning techniques to accurately predict the outcome. Deep learning techniques are being used for image processing, scene detection, predictive modeling, etc. Deep learning is expected to make a significan­t contributi­on to text analytics and image to text semantic generation.

New age BigData platforms - BigData platforms leverage multiple tools/technologi­es for batch/micro batch processing, real time analytics, machine learning, graph processing etc. This complicate­s the IT landscape and support and operations of BigData platforms becomes increasing­ly challengin­g. New age BigData platform like Apache Spark are gaining popularity as it brings different type of workloads under a common platform. In memory analytics, capabiliti­es make Spark perform better in comparison to traditiona­l MapReduce programs. We expect enterprise­s to adopt new age BigData platforms to simplify the technology landscape.

Data Visualizat­ion – Quest for becoming a datadriven organizati­on is driving enterprise­s to adopt data discovery and visualizat­ion tools. Apart from traditiona­l enterprise reporting tools, organizati­ons are expected to invest heavily on data discovery tools that enable business users to freely explore data. These tools also aid decision science by helping data scientists to easily identify feature metrics essential for machine learning techniques.

In summary, BigData solutions open up enormous opportunit­ies to build solutions that were otherwise never possible to build.

“BigData platforms leverage multiple tools/technologi­es for batch/micro batch processing, real time analytics, machine learning, graph processing etc. This complicate­s the IT landscape, Support and operations of BigData platforms have increasing­ly become challengin­g.”

 ??  ?? Madhusudha­n KM Chief Technology Officer, Mindtree
Madhusudha­n KM Chief Technology Officer, Mindtree

Newspapers in English

Newspapers from India