Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
碩士 === 輔仁大學 === 資訊工程學系碩士班 === 107 === Due to the rapid development of the Internet, with the rapid growth of various electronic forms of data, the storage and calculation of Big Data has become an important issue. Hadoop is an open source cloud system platform that includes the HDFS (Hadoop Distrib...
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Others |
Language: | zh-TW |
Published: |
2019
|
Online Access: | http://ndltd.ncl.edu.tw/handle/a9nvqc |
id |
ndltd-TW-107FJU00396033 |
---|---|
record_format |
oai_dc |
spelling |
ndltd-TW-107FJU003960332019-08-08T03:44:57Z http://ndltd.ncl.edu.tw/handle/a9nvqc Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter 藉由布隆過濾器來改善Hadoop記錄資料查詢 TSAI, PEI-FENG 蔡沛峰 碩士 輔仁大學 資訊工程學系碩士班 107 Due to the rapid development of the Internet, with the rapid growth of various electronic forms of data, the storage and calculation of Big Data has become an important issue. Hadoop is an open source cloud system platform that includes the HDFS (Hadoop Distributed File System) and the MapReduce computing framework, providing a viable solution for Big Data storage and computing. One of the common applications of Big Data is to store the plain-text log files into HDFS and use the MapReduce framework to query the data by the feature field. The application characteristic is WORM (Write Once Read Many). Based on the characteristics of this application method, we use the Bloom Filter to process the feature field of the log data, so the HDFS system can reduce the number of file read to retrieve the log files we need. YEH, TSO-ZEN 葉佐任 2019 學位論文 ; thesis 59 zh-TW |
collection |
NDLTD |
language |
zh-TW |
format |
Others
|
sources |
NDLTD |
description |
碩士 === 輔仁大學 === 資訊工程學系碩士班 === 107 === Due to the rapid development of the Internet, with the rapid growth of various electronic forms of data, the storage and calculation of Big Data has become an important issue.
Hadoop is an open source cloud system platform that includes the HDFS (Hadoop Distributed File System) and the MapReduce computing framework, providing a viable solution for Big Data storage and computing.
One of the common applications of Big Data is to store the plain-text log files into HDFS and use the MapReduce framework to query the data by the feature field. The application characteristic is WORM (Write Once Read Many).
Based on the characteristics of this application method, we use the Bloom Filter to process the feature field of the log data, so the HDFS system can reduce the number of file read to retrieve the log files we need.
|
author2 |
YEH, TSO-ZEN |
author_facet |
YEH, TSO-ZEN TSAI, PEI-FENG 蔡沛峰 |
author |
TSAI, PEI-FENG 蔡沛峰 |
spellingShingle |
TSAI, PEI-FENG 蔡沛峰 Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter |
author_sort |
TSAI, PEI-FENG |
title |
Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter |
title_short |
Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter |
title_full |
Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter |
title_fullStr |
Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter |
title_full_unstemmed |
Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter |
title_sort |
improving the query efficiency of log data on hadoop through the bloom filter |
publishDate |
2019 |
url |
http://ndltd.ncl.edu.tw/handle/a9nvqc |
work_keys_str_mv |
AT tsaipeifeng improvingthequeryefficiencyoflogdataonhadoopthroughthebloomfilter AT càipèifēng improvingthequeryefficiencyoflogdataonhadoopthroughthebloomfilter AT tsaipeifeng jíyóubùlóngguòlǜqìláigǎishànhadoopjìlùzīliàocháxún AT càipèifēng jíyóubùlóngguòlǜqìláigǎishànhadoopjìlùzīliàocháxún |
_version_ |
1719233227122540544 |