Leveraging resource management for efficient performance of Apache Spark

Abstract Apache Spark is one of the most widely used open source processing framework for big data, it allows to process large datasets in parallel using a large number of nodes. Often, applications of this framework use resource management systems like YARN, which provide jobs a specific amount of...

Full description

Bibliographic Details
Main Authors: Khadija Aziz, Dounia Zaidouni, Mostafa Bellafkih
Format: Article
Language:English
Published: SpringerOpen 2019-08-01
Series:Journal of Big Data
Subjects:
Online Access:http://link.springer.com/article/10.1186/s40537-019-0240-1