Design and Implementation of File Deduplication Framework on HDFS

File systems are designed to control how files are stored and retrieved. Without knowing the context and semantics of file contents, file systems often contain duplicate copies and result in redundant consumptions of storage space and network bandwidth. It has been a complex and challenging issue fo...

Full description

Bibliographic Details
Main Authors: Ruey-Kai Sheu, Shyan-Ming Yuan, Win-Tsung Lo, Chan-I Ku
Format: Article
Language:English
Published: SAGE Publishing 2014-04-01
Series:International Journal of Distributed Sensor Networks
Online Access:https://doi.org/10.1155/2014/561340