Performance Enhancement for Data Deduplication System

碩士 === 國立臺灣科技大學 === 資訊工程系 === 102 === In recent years, network technology has been developed dramatically. With the improvement of network bandwidth, accesses to the remote data storage system become more and more frequent. However, huge amount of data stored in data storage system might degrade the...

Full description

Bibliographic Details
Main Authors: Ting-Jui Lin, 林廷叡
Other Authors: none
Format: Others
Language:zh-TW
Published: 2014
Online Access:http://ndltd.ncl.edu.tw/handle/13309990067455641619
id ndltd-TW-102NTUS5392074
record_format oai_dc
spelling ndltd-TW-102NTUS53920742016-03-09T04:31:00Z http://ndltd.ncl.edu.tw/handle/13309990067455641619 Performance Enhancement for Data Deduplication System Performance Enhancement for Data Deduplication System Ting-Jui Lin 林廷叡 碩士 國立臺灣科技大學 資訊工程系 102 In recent years, network technology has been developed dramatically. With the improvement of network bandwidth, accesses to the remote data storage system become more and more frequent. However, huge amount of data stored in data storage system might degrade the system performance. Data deduplication system could solve the performance degradation problem efficiently. Data deduplication is a technique that could eliminate the redundant copies of duplicated data. We can calculate fingerprint of each data segment through some hash function. Since fingerprint is unique in data deduplication system, we could identify whether two data segments are identical by comparing their fingerprints. Once we have identified the same fingerprints in the system, we could ignore the write request since its content has already existed in the system. In this thesis, we propose to enhance the performance of data deduplication system by storing the fingerprint store in SSD. In the traditional data deduplication system, DRAM is the major storage medium of fingerprint store. Comparing SSD with DRAM, the former one has a lower price. Using SSD as the major storage medium of fingerprint store, we could extend the capacity of fingerprint store easily. We also solve the data read fragmentation problem in data deduplication system to improve the system response time. The experiment results showed that the capacity of storage device could be saved by 36.86%, while the system response time could be improved by up to 34%. none 謝仁偉 2014 學位論文 ; thesis 45 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 國立臺灣科技大學 === 資訊工程系 === 102 === In recent years, network technology has been developed dramatically. With the improvement of network bandwidth, accesses to the remote data storage system become more and more frequent. However, huge amount of data stored in data storage system might degrade the system performance. Data deduplication system could solve the performance degradation problem efficiently. Data deduplication is a technique that could eliminate the redundant copies of duplicated data. We can calculate fingerprint of each data segment through some hash function. Since fingerprint is unique in data deduplication system, we could identify whether two data segments are identical by comparing their fingerprints. Once we have identified the same fingerprints in the system, we could ignore the write request since its content has already existed in the system. In this thesis, we propose to enhance the performance of data deduplication system by storing the fingerprint store in SSD. In the traditional data deduplication system, DRAM is the major storage medium of fingerprint store. Comparing SSD with DRAM, the former one has a lower price. Using SSD as the major storage medium of fingerprint store, we could extend the capacity of fingerprint store easily. We also solve the data read fragmentation problem in data deduplication system to improve the system response time. The experiment results showed that the capacity of storage device could be saved by 36.86%, while the system response time could be improved by up to 34%.
author2 none
author_facet none
Ting-Jui Lin
林廷叡
author Ting-Jui Lin
林廷叡
spellingShingle Ting-Jui Lin
林廷叡
Performance Enhancement for Data Deduplication System
author_sort Ting-Jui Lin
title Performance Enhancement for Data Deduplication System
title_short Performance Enhancement for Data Deduplication System
title_full Performance Enhancement for Data Deduplication System
title_fullStr Performance Enhancement for Data Deduplication System
title_full_unstemmed Performance Enhancement for Data Deduplication System
title_sort performance enhancement for data deduplication system
publishDate 2014
url http://ndltd.ncl.edu.tw/handle/13309990067455641619
work_keys_str_mv AT tingjuilin performanceenhancementfordatadeduplicationsystem
AT líntíngruì performanceenhancementfordatadeduplicationsystem
_version_ 1718202363073789952