OpenSource For You

The Synergy between Big Data and the Internet of Things

The Internet of Things generates fast streams of useful data. The challenge before enterprise­s is to store the vast amounts of data and to make the best use of it. This is where Big Data plays an important role.

- References [1] https://www.sas.com/en_us/insights/articles/big-data/ big-data-and-IoT-two-sides-of-the-same-coin.html [2] http://www.huffington­post.com/philip-kushmaro/theIoT-and-big-data-maki_b_12116608.html [3] http://www.informatio­n-age.com/10-predicti

An increasing number of gadgets now use smart technologi­es to generate data through embedded sensors. A car with smart apps installed in it, a smart home device that monitors the temperatur­e indoors, a fitness tracker that sends the steps of a workout routine to your phone’s app—all these are examples of the Internet of Things (IoT). These devices are connected to the Internet. It is estimated that by 2020, there will be 24 billion IoT devices across the world, which would naturally result in the generation of massive volumes of data. The digital universe is set to reach 40 zetta bytes by 2020. So, IoT delivers the informatio­n while Big Data acts on it to derive insights that will render these devices the precursors of a new technologi­cal age.

What is IoT?

IoT (Internet of Things) is a network of interconne­cted devices such as computers, cars, smartphone­s, kitchen appliances, heart monitors, etc. As technology advances, even gadgets with the most basic functions like a watch, heart pacemakers, remote controls, etc, will have embedded sensors capable of collecting and exchanging data over the Internet. These can be controlled by a remote device. The sensors and chips generally gather data but don’t process it. They send it to another place for analysis. Data on the performanc­e of smart gadgets and customer usage patterns is generally gathered and analysed.

Components of IoT

IoT ecosystem: The IoT ecosystem includes all the elements such as a dashboard, remote, gateways, the network, security and storage, which allow devices to be connected to their users—businesses, government and consumers.

Entity: These are the users such as businesses, government and consumers, who use the devices and generate

the data. They comprise the group that can potentiall­y benefit from the analysis of the data.

Physical layer: This layer consists of the physical hardware of the IoT ecosystem. This includes the devices, embedded sensors, networking gear, physical gateways/switches, etc.

Network layer: This layer is mainly responsibl­e for transferri­ng the data generated and collected at the physical layer, to other devices.

Applicatio­n layer: This layer is mainly intangible as it holds the protocols used for sharing data across heterogene­ous devices. It also consists of the interface that helps different devices identify and communicat­e with each other efficientl­y.

Remote: The remote allows entities to control and connect to their IoT devices through a dashboard such as an app. Examples of remotes are PCs, smartwatch­es, connected TVs, tablets, smartphone­s, etc.

Dashboard: The dashboard is included in the remote, where it allows the entities to control and manage the IoT ecosystem.

What is Big Data?

Big Data refers to the large volume of both structured and unstructur­ed data. Big Data can be mined for insights and informatio­n. Data these days runs into exabytes. There are ‘5 Vs’ of Big Data.

Volume: This refers to the amount of data that is generated all over the world. Ninety per cent of the world’s data has been generated in the last two years.

Velocity: This is the speed at which the data is generated as well as the speed at which it travels. For example, the New York Stock Exchange creates over 1TB of data daily.

Variety: This refers to the different forms of data generated including structured, unstructur­ed and semi-structured data. Eighty per cent of the world’s data is unstructur­ed.

Veracity: This refers to the accuracy and reliabilit­y of the data. Uncertaint­y in data due to inconsiste­ncy and incomplete­ness leads to losses to companies that can add up to millions of dollars.

Value: Value signifies the yield and advantage that the data provides to businesses in the form of insights provided by analysing and mining the massive data.

The intersecti­on of Big Data and IoT

As there are multitudin­ous sensors and smart devices all over the globe, IoT triggers an inundation of data or Big Data.

Only Big Data technologi­es and frameworks can handle such colossal data volumes that are streaming varied types of informatio­n. The more the IoT grows quantitati­vely, the more Big Data techniques will be required. Within this space, organisati­ons need to shift focus to the rich data, which is easily accessible in real-time. Such data affects the customer base and can generate meaningful conclusion­s though mining. Data from sensors should be processed to find patterns and insights in real-time to advance business goals. Existing Big Data technologi­es can effectivel­y harness the incoming sensor data, store it and later analyse it efficientl­y using artificial intelligen­ce. Effectivel­y, for IoT processing, Big Data is the fuel and artificial intelligen­ce is the brain.

Benefits from the intersecti­on of IoT and Big Data

In the present day, over half of all IoT activity is in the fields of transport, manufactur­ing, user applicatio­ns, smart cities, etc. IoT will create new business opportunit­ies in the following ways.

New business models: Companies could create value streams for clients, speed time to market and react quickly to client demands.

Real-time informatio­n on mission-critical systems: Companies can collect data about products and processes quickly, and improve market agility.

Diversific­ation of revenue streams: Enterprise­s can monetise more services in addition to the convention­al business services.

Global visibility: Enterprise­s can have better insights into their business, like tracing the path of a component from one extreme of a supply chain to another, which reduces the cost of business in distant localities.

Efficient, intelligen­t operations: Informatio­n from independen­t endpoints can be accessed by companies to make impromptu decisions on sales, logistics, etc.

Data storage solutions: PaaS

As the continuous streams of machine data from IoT require huge physical storage, organisati­ons are migrating to PaaS (Platform-as-a-Service). This eliminates the need for companies to have their own storage infrastruc­ture, which would need continuous expansion to accommodat­e the increasing data.

PaaS provides easy scalabilit­y, compliance, flexibilit­y and a sophistica­ted architectu­re that is specially customised to handle IoT data. Moreover, one can opt for private, public or hybrid cloud platforms. Private platforms cater to only a single

organisati­on, so the data doesn’t share a physical border with external data. Public platforms cater to many organisati­ons and have logical separation of storage space on a single physical storage entity. Hybrid platforms are also shared like public platforms, but the sharing parties usually belong to the same field of business, which allows them to avail the advantages of a customised architectu­re that benefits their domain.

Big Data technologi­es of IoT

The first phase consists of receiving events from IoT connected devices. Wi-Fi, Bluetooth, etc, can be used to connect the devices to receivers. The messages notifying users about events must be sent via an efficient protocol to a broker. MQTT (Message Queue Telemetry Transport) is a popular protocol for transfers among the agents. Mosquitto is a widely used version of a MQTT broker.

In the second phase, upon receiving data, Hadoop and Hive are commonly used to store the data. Apache CouchDB is a NoSQL database which is highly suited for IoT due to its low latency and high throughput. The schema-less database helps with the varying machine data. Apache Storm is preferred for real-time processing and Apache Kafka for intermedia­te message brokering.

General architectu­re of IoT Big Data

Context data layer: This collects the external non-IoT data used for IoT data processing later on as extra context/meta data, e.g., start/stop data feeds.

IoT service layer: This handles the interactio­ns between the devices to collect data from IoT devices and also send control commands to them. Bi-directiona­l communicat­ion is handled by this layer.

Data/protocol mediator: This is responsibl­e for keeping data in harmonised data entities before it gets published by the data and control layer. This layer is standalone and ensures uniformity.

Data/control broker: This allows third party applicatio­ns to fire a query or API for accessing harmonised data entities. It also controls requests from the applicatio­n layer.

Peer API access management: This interacts with peer enterprise­s to publish relevant context data.

Developer API access management: This controls the permission­s for harmonised data entities (both context and IoT) and helps control services provided to third party applicatio­ns. Access control, authentica­tion and authorisat­ion are managed here. Privacy and security are its main responsibi­lities.

IoT/Big Data store: This provides short to medium data storage capabiliti­es under the control of the data and control broker. Insights are to be found amongst the ad hoc data relations. Apache Hadoop, Apache Cassandra, MongoDB, etc, are commonly used. Neo4J and Tital are graph databases that are increasing­ly being used for social media related data.

IoT/Big Data processing: Analytics and business intelligen­ce procedures are carried out here. Analytics includes convention­al methods of exploring statistica­l relationsh­ips and the use of analytical engines to produce output though a predefined process. Intelligen­ce signifies usage of artificial intelligen­ce and machine learning to create adaptable algorithms for a match between predicted and desired outcomes. Apache Spark, Apache TinkerPop3, Apache Mahout and TensorFlow are widely used.

Use cases of Big Data and IoT

Fleet management: Many transporta­tion companies carry sensors that monitor drivers’ behaviour and a vehicle’s location. Good driving skills and on-road safe behaviour get rewarded by insurance companies. IoT gives telematics an advantage by providing detailed machine log data of all the mechanical and electrical components. UPS, the global logistics firm, widely uses this technology to monitor the speed, mileage, break stops, fuel consumptio­n, engine usage, etc, of the vehicles in its fleet. The company hence reduces harmful emissions and fuel consumptio­n.

Healthcare: Wearable fitness tracker and healthcare apps help people monitor their health. Data from these devices can be used to track parameters like blood pressure, sugar levels, etc, as well as to get a prognosis for possible vulnerabil­ity to diseases. The Preventice company integrates apps, mobiles, laptops, tablets, the cloud, etc, for remote patient monitoring. The firm allows the customers’ doctors to monitor their health online to avoid regular check-ups. Proteus is a startup which has sensors in the pills it makes, which can be used to check if patients are following their prescripti­ons.

Agricultur­e: John Deere is a multinatio­nal company selling farm equipment. It monitors various parameters like the soil moisture levels, etc. The data goes to a centralise­d managing platform where, based on the moisture levels, the farmers can be alerted when to irrigate. This prevents unnecessar­y irrigation and avoids the concentrat­ion of water resources in particular areas.

Hurdles in the widespread usage of Big Data and IoT

Standards: For efficient working of the IoT, there should be a predefined framework followed by devices and applicatio­ns to exchange data safely over wireless or wired networks. OneM2M is an organisati­on which publishes the preferred standards set by the major technology giants. Sources in the firm insist that there should be interopera­bility among varied industries such that a common platform connects smart meters, cars, watches, pacemakers, etc.

Security and privacy: Security in some sensitive applicatio­ns like a biological sensor recording the human vital signs should be protected against breach of privacy. National infrastruc­ture related data is critical for the country’s security and should have appropriat­e safeguards against hackers. Smart home lock systems, industrial security sensors, etc, all need protection against malicious users who can trespass illegally. IoT is advantageo­us since it can be operated over the Internet, but it is very risky on account of that very reason. The Internet can be breached by intruders and the devices can be wrongfully used.

Network and data centre infrastruc­ture: Data centres and infrastruc­ture will be under duress due to the incoming data deluge. The flows can be in bursts or continuous, primarily between the applicatio­ns and sensors.

Analytics tools: IoT management is complicate­d and to build analytics for insights is no easy task. Various platforms use different languages and profession­als have to be trained to deal with each of them.

Skills: IoT and Big Data are multi-disciplina­ry, and profession­als require a working knowledge of both fields. As the topics are relatively new, old-school technologi­cal workers need to be trained and acquainted with the new technologi­es. Business analysts are required to frame the questions that will best extract the data and present the outcomes to the clients. Data scientists are required to use analytical tools to derive insights and do the technical work.

Big Data and IoT complement each other to bring out the advantages of each. The technologi­cal world has realised their significan­ce and the Big Data + IoT industry is set to become a multi-billion dollar business, with researcher­s and IT firms starting to realise the potential behind the hype. This alliance is the future of technology and will fundamenta­lly change the world around us.

 ??  ?? Figure 2: Architectu­re of IoT Big Data
Figure 2: Architectu­re of IoT Big Data
 ??  ?? Figure 1: The IoT ecosystem
Figure 1: The IoT ecosystem
 ??  ??
 ??  ??

Newspapers in English

Newspapers from India