Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter

碩士 === 輔仁大學 === 資訊工程學系碩士班 === 107 === Due to the rapid development of the Internet, with the rapid growth of various electronic forms of data, the storage and calculation of Big Data has become an important issue. Hadoop is an open source cloud system platform that includes the HDFS (Hadoop Distrib...

Full description

Bibliographic Details
Main Authors: TSAI, PEI-FENG, 蔡沛峰
Other Authors: YEH, TSO-ZEN
Format: Others
Language:zh-TW
Published: 2019
Online Access:http://ndltd.ncl.edu.tw/handle/a9nvqc
id ndltd-TW-107FJU00396033
record_format oai_dc
spelling ndltd-TW-107FJU003960332019-08-08T03:44:57Z http://ndltd.ncl.edu.tw/handle/a9nvqc Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter 藉由布隆過濾器來改善Hadoop記錄資料查詢 TSAI, PEI-FENG 蔡沛峰 碩士 輔仁大學 資訊工程學系碩士班 107 Due to the rapid development of the Internet, with the rapid growth of various electronic forms of data, the storage and calculation of Big Data has become an important issue. Hadoop is an open source cloud system platform that includes the HDFS (Hadoop Distributed File System) and the MapReduce computing framework, providing a viable solution for Big Data storage and computing. One of the common applications of Big Data is to store the plain-text log files into HDFS and use the MapReduce framework to query the data by the feature field. The application characteristic is WORM (Write Once Read Many). Based on the characteristics of this application method, we use the Bloom Filter to process the feature field of the log data, so the HDFS system can reduce the number of file read to retrieve the log files we need. YEH, TSO-ZEN 葉佐任 2019 學位論文 ; thesis 59 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 輔仁大學 === 資訊工程學系碩士班 === 107 === Due to the rapid development of the Internet, with the rapid growth of various electronic forms of data, the storage and calculation of Big Data has become an important issue. Hadoop is an open source cloud system platform that includes the HDFS (Hadoop Distributed File System) and the MapReduce computing framework, providing a viable solution for Big Data storage and computing. One of the common applications of Big Data is to store the plain-text log files into HDFS and use the MapReduce framework to query the data by the feature field. The application characteristic is WORM (Write Once Read Many). Based on the characteristics of this application method, we use the Bloom Filter to process the feature field of the log data, so the HDFS system can reduce the number of file read to retrieve the log files we need.
author2 YEH, TSO-ZEN
author_facet YEH, TSO-ZEN
TSAI, PEI-FENG
蔡沛峰
author TSAI, PEI-FENG
蔡沛峰
spellingShingle TSAI, PEI-FENG
蔡沛峰
Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
author_sort TSAI, PEI-FENG
title Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_short Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_full Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_fullStr Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_full_unstemmed Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_sort improving the query efficiency of log data on hadoop through the bloom filter
publishDate 2019
url http://ndltd.ncl.edu.tw/handle/a9nvqc
work_keys_str_mv AT tsaipeifeng improvingthequeryefficiencyoflogdataonhadoopthroughthebloomfilter
AT càipèifēng improvingthequeryefficiencyoflogdataonhadoopthroughthebloomfilter
AT tsaipeifeng jíyóubùlóngguòlǜqìláigǎishànhadoopjìlùzīliàocháxún
AT càipèifēng jíyóubùlóngguòlǜqìláigǎishànhadoopjìlùzīliàocháxún
_version_ 1719233227122540544