A Framework of Distributed Snapshots for Hadoop HBase
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 101 === Apache Hadoop HBase™ is an emerging distributed key-value persistent data store, which can accommodate a large volume of data rapidly introduced from a variety of sources. While data objects stored in HBase are precious, HBase is unable to perform parallel re...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | en_US |
Published: |
2013
|
Online Access: | http://ndltd.ncl.edu.tw/handle/5bwj3f |
id |
ndltd-TW-101NCKU5392036 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-101NCKU53920362019-05-15T21:03:13Z http://ndltd.ncl.edu.tw/handle/5bwj3f A Framework of Distributed Snapshots for Hadoop HBase Hadoop HBase的分散式快照架構 Chi-TsunLiao 廖啟村 碩士 國立成功大學 資訊工程學系碩博士班 101 Apache Hadoop HBase™ is an emerging distributed key-value persistent data store, which can accommodate a large volume of data rapidly introduced from a variety of sources. While data objects stored in HBase are precious, HBase is unable to perform parallel recovery for recovering historical data objects concurrently stored in multiple storage servers in a consistent manner. The study presents a framework for implementing a data recovery scheme in HBase. The framework consists of four components, including (1) distributed snapshots represented by event logs gathered from internal (system) and external (clients) operations, (2) a global time labeling scheme for correlated events, (3) a bulk load process for bootstrapping a HBase cluster with a given snapshot, and (4) a forward replaying mechanism for precisely running the system into any specified time instance in the past. We enhance HBase such that it is capable of performing parallel recovery, and demonstrate our prototype implementation with performance results. In addition, based on our prototype, an application tracking multiple clients’ locations is demonstrated. Hung-Chang Hsiao 蕭宏章 2013 學位論文 ; thesis 32 en_US |
collection |
NDLTD |
language |
en_US |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 101 === Apache Hadoop HBase™ is an emerging distributed key-value persistent data store, which can accommodate a large volume of data rapidly introduced from a variety of sources. While data objects stored in HBase are precious, HBase is unable to perform parallel recovery for recovering historical data objects concurrently stored in multiple storage servers in a consistent manner. The study presents a framework for implementing a data recovery scheme in HBase. The framework consists of four components, including (1) distributed snapshots represented by event logs gathered from internal (system) and external (clients) operations, (2) a global time labeling scheme for correlated events, (3) a bulk load process for bootstrapping a HBase cluster with a given snapshot, and (4) a forward replaying mechanism for precisely running the system into any specified time instance in the past. We enhance HBase such that it is capable of performing parallel recovery, and demonstrate our prototype implementation with performance results. In addition, based on our prototype, an application tracking multiple clients’ locations is demonstrated.
|
author2 |
Hung-Chang Hsiao |
author_facet |
Hung-Chang Hsiao Chi-TsunLiao 廖啟村 |
author |
Chi-TsunLiao 廖啟村 |
spellingShingle |
Chi-TsunLiao 廖啟村 A Framework of Distributed Snapshots for Hadoop HBase |
author_sort |
Chi-TsunLiao |
title |
A Framework of Distributed Snapshots for Hadoop HBase |
title_short |
A Framework of Distributed Snapshots for Hadoop HBase |
title_full |
A Framework of Distributed Snapshots for Hadoop HBase |
title_fullStr |
A Framework of Distributed Snapshots for Hadoop HBase |
title_full_unstemmed |
A Framework of Distributed Snapshots for Hadoop HBase |
title_sort |
framework of distributed snapshots for hadoop hbase |
publishDate |
2013 |
url |
http://ndltd.ncl.edu.tw/handle/5bwj3f |
work_keys_str_mv |
AT chitsunliao aframeworkofdistributedsnapshotsforhadoophbase AT liàoqǐcūn aframeworkofdistributedsnapshotsforhadoophbase AT chitsunliao hadoophbasedefēnsànshìkuàizhàojiàgòu AT liàoqǐcūn hadoophbasedefēnsànshìkuàizhàojiàgòu AT chitsunliao frameworkofdistributedsnapshotsforhadoophbase AT liàoqǐcūn frameworkofdistributedsnapshotsforhadoophbase |
_version_ |
1719109100824952832 |