Improving I/O Efficiency in Hadoop-Based Massive Data Analysis Programs
Apache Hadoop has been a popular parallel processing tool in the era of big data. While practitioners have rewritten many conventional analysis algorithms to make them customized to Hadoop, the issue of inefficient I/O in Hadoop-based programs has been repeatedly reported in the literature. In this...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Hindawi Limited
2018-01-01
|
Series: | Scientific Programming |
Online Access: | http://dx.doi.org/10.1155/2018/2682085 |