Fusion-based Hadoop MapReduce job for fault tolerance in distributed systems

Standard recovery solution on a failed task in Hadoop systems is to execute the task again. After retrying for a configured number of times, it is marked as failure. With significant amount of data, complicated Map and Reduce functions, recovering corrupted or unfinished data from a failed job can b...

Full description

Bibliographic Details
Main Author: Ho, Iat-Kei
Format: Others
Language:en_US
Published: 2013
Subjects:
Map
Online Access:http://hdl.handle.net/2152/22605