Improving I/O Efficiency in Hadoop-Based Massive Data Analysis Programs

Apache Hadoop has been a popular parallel processing tool in the era of big data. While practitioners have rewritten many conventional analysis algorithms to make them customized to Hadoop, the issue of inefficient I/O in Hadoop-based programs has been repeatedly reported in the literature. In this...

Full description

Bibliographic Details
Main Authors: Kyong-Ha Lee, Woo Lam Kang, Young-Kyoon Suh
Format: Article
Language:English
Published: Hindawi Limited 2018-01-01
Series:Scientific Programming
Online Access:http://dx.doi.org/10.1155/2018/2682085