A Framework of Distributed Snapshots for Hadoop HBase

碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 101 === Apache Hadoop HBase™ is an emerging distributed key-value persistent data store, which can accommodate a large volume of data rapidly introduced from a variety of sources. While data objects stored in HBase are precious, HBase is unable to perform parallel re...

Full description

Bibliographic Details
Main Authors: Chi-TsunLiao, 廖啟村
Other Authors: Hung-Chang Hsiao
Format: Others
Language:en_US
Published: 2013
Online Access:http://ndltd.ncl.edu.tw/handle/5bwj3f
id ndltd-TW-101NCKU5392036
record_format oai_dc
spelling ndltd-TW-101NCKU53920362019-05-15T21:03:13Z http://ndltd.ncl.edu.tw/handle/5bwj3f A Framework of Distributed Snapshots for Hadoop HBase Hadoop HBase的分散式快照架構 Chi-TsunLiao 廖啟村 碩士 國立成功大學 資訊工程學系碩博士班 101 Apache Hadoop HBase™ is an emerging distributed key-value persistent data store, which can accommodate a large volume of data rapidly introduced from a variety of sources. While data objects stored in HBase are precious, HBase is unable to perform parallel recovery for recovering historical data objects concurrently stored in multiple storage servers in a consistent manner. The study presents a framework for implementing a data recovery scheme in HBase. The framework consists of four components, including (1) distributed snapshots represented by event logs gathered from internal (system) and external (clients) operations, (2) a global time labeling scheme for correlated events, (3) a bulk load process for bootstrapping a HBase cluster with a given snapshot, and (4) a forward replaying mechanism for precisely running the system into any specified time instance in the past. We enhance HBase such that it is capable of performing parallel recovery, and demonstrate our prototype implementation with performance results. In addition, based on our prototype, an application tracking multiple clients’ locations is demonstrated. Hung-Chang Hsiao 蕭宏章 2013 學位論文 ; thesis 32 en_US
collection NDLTD
language en_US
format Others
sources NDLTD
description 碩士 === 國立成功大學 === 資訊工程學系碩博士班 === 101 === Apache Hadoop HBase™ is an emerging distributed key-value persistent data store, which can accommodate a large volume of data rapidly introduced from a variety of sources. While data objects stored in HBase are precious, HBase is unable to perform parallel recovery for recovering historical data objects concurrently stored in multiple storage servers in a consistent manner. The study presents a framework for implementing a data recovery scheme in HBase. The framework consists of four components, including (1) distributed snapshots represented by event logs gathered from internal (system) and external (clients) operations, (2) a global time labeling scheme for correlated events, (3) a bulk load process for bootstrapping a HBase cluster with a given snapshot, and (4) a forward replaying mechanism for precisely running the system into any specified time instance in the past. We enhance HBase such that it is capable of performing parallel recovery, and demonstrate our prototype implementation with performance results. In addition, based on our prototype, an application tracking multiple clients’ locations is demonstrated.
author2 Hung-Chang Hsiao
author_facet Hung-Chang Hsiao
Chi-TsunLiao
廖啟村
author Chi-TsunLiao
廖啟村
spellingShingle Chi-TsunLiao
廖啟村
A Framework of Distributed Snapshots for Hadoop HBase
author_sort Chi-TsunLiao
title A Framework of Distributed Snapshots for Hadoop HBase
title_short A Framework of Distributed Snapshots for Hadoop HBase
title_full A Framework of Distributed Snapshots for Hadoop HBase
title_fullStr A Framework of Distributed Snapshots for Hadoop HBase
title_full_unstemmed A Framework of Distributed Snapshots for Hadoop HBase
title_sort framework of distributed snapshots for hadoop hbase
publishDate 2013
url http://ndltd.ncl.edu.tw/handle/5bwj3f
work_keys_str_mv AT chitsunliao aframeworkofdistributedsnapshotsforhadoophbase
AT liàoqǐcūn aframeworkofdistributedsnapshotsforhadoophbase
AT chitsunliao hadoophbasedefēnsànshìkuàizhàojiàgòu
AT liàoqǐcūn hadoophbasedefēnsànshìkuàizhàojiàgòu
AT chitsunliao frameworkofdistributedsnapshotsforhadoophbase
AT liàoqǐcūn frameworkofdistributedsnapshotsforhadoophbase
_version_ 1719109100824952832