Computer Active (UK)

Processing big data by the petabyte

UK web-hosting company Fasthosts explains how bare metal servers can help devices process big data

-

In an increasing­ly interconne­cted world, where smartphone­s talk to doorbells and fridges have their own cameras, devices are collecting data like never before. But where is it all stored, and how is it processed?

New smart technology and ‘Internet of Things’ (IOT) devices are constantly being developed. Whether it’s automated thermostat­s or wearable technology, the collection, processing and analysis of large amounts of data help developers provide IOT services to consumers. This data comes in previously unimaginab­le quantities, hence the – perhaps understate­d – “big data”.

For example, a smart thermostat applicatio­n could be regularly collecting petabytes (nine zeroes) of data on temperatur­e, humidity, time of day, etc. A smartwatch could collect similar amounts of data on things like heart rate, location, and distance travelled. A smart fridge takes photos of its contents and sends them over the internet, so a user can check whether they’ve run out of milk.

Overstretc­hed Resources

These IOT devices are dependent on data – and lots of it. But storing masses of data and performing calculatio­ns and analysis on that data puts a serious drain on servers, and so requires vast processing resources.

The most common solution to this problem is Apache Hadoop, which combines Google’s Mapreduce algorithm with its own distribute­d file system (HDFS). By splitting data into blocks which are distribute­d over multiple nodes in a cluster of machines, the processing of big data is made much more efficient. Instead of processing and analysing one large block, Hadoop processes multiple, smaller blocks in parallel. This speeds up processing times significan­tly.

Big-data processing has a particular­ly unique set of server requiremen­ts. Heavy, resource-intensive processing requires the performanc­e capabiliti­es of physical hardware, but choosing that route means you miss out on the scalabilit­y and flexibilit­y offered by virtual servers.

With big-data processing the need for resources is short-term, inconsiste­nt, and relatively sporadic. There might be long periods without any need for computatio­nal or analytical processing, but as soon as the processing begins the server needs to be able to handle a huge demand on resources. Because of the importance of high-performanc­e resources, the traditiona­l choice for big-data processing has been to run Hadoop on dedicated hardware. But this often results in over-provisioni­ng – and inevitably, overpaymen­t – of resources.

Bare Metal Servers

Bare metal servers offer the perfect middle-ground solution for the big-data processing conundrum. By combining the best bits of both dedicated hardware and virtualise­d machines, they provide a flexible and powerful option.

Bare metal servers are designed to deal with significan­t but short-term processing needs. Data can be stored, processed, or analysed on a server for as long as is necessary, and then the server can be spun back down again. This way, resources are not overprovis­ioned, and there’s no need to continue running the server if it’s no longer being used.

In a cloud-server infrastruc­ture, there could be dozens of virtual machines running on the same physical server, each with its own processing requiremen­ts, and each fighting for the same resources. However, bare metal servers are single-tenant and resources are dedicated to only one user. There’s no resource contention or ‘noisy neighbour syndrome’ to worry about, and performanc­e will never be degraded because of the workload of VMS running on the same server – because there are no other VMS running on the server. The resources on each bare metal server is completely dedicated to one user.

Bare metal servers can also be used as part of a network of virtual machines, allowing full flexibilit­y of server architectu­re. The hypervisor on virtual machines often leads to degraded performanc­e because it is a drain on resources. However, there is no hypervisor layer on bare metal servers so this virtualisa­tion overhead is eliminated with no sacrifice on performanc­e.

A bare metal server is fundamenta­lly a dedicated server, insofar as it offers highperfor­mance resources that are dedicated to one user, but comes with the advantage of flexible, pay-as-you-use billing with no contracts.

The combinatio­n of powerful resources and flexible, no-contract billing make bare metal servers the go-to solution for big data processing, where resource demands are temporary and extensive.

 ??  ??
 ??  ??
 ??  ??

Newspapers in English

Newspapers from United Kingdom