OpenSource For You

Advantages of Apache Spark

-

The following are the key advantages of Apache Spark. Fastest data processing: Initially, the main purpose of deploying Spark was to enhance efficiency in the current MapReduce applicatio­ns. Actually, MapReduce is a general framework and is not specifical­ly implemente­d in core Hadoop. Spark also enables MapReduce, as it can use memory optimally (even in recovering from failure cases). Some features work faster in Spark's MapReduce compared to Hadoop's MapReduce, even without making efficient use of caches in iterations. The iterative algorithm: By using the cache() function, Spark provides users and applicatio­ns the facility to provide cached datasets in explicit form. It means that all the applicatio­ns can take data from the RAM rather than the disk. This dramatical­ly increases the performanc­e of the iterative algorithm, which reaches the same dataset again and again.

Newspapers in English

Newspapers from India