Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter

碩士 === 輔仁大學 === 資訊工程學系碩士班 === 107 === Due to the rapid development of the Internet, with the rapid growth of various electronic forms of data, the storage and calculation of Big Data has become an important issue. Hadoop is an open source cloud system platform that includes the HDFS (Hadoop Distrib...

Full description

Bibliographic Details
Main Authors:	TSAI, PEI-FENG, 蔡沛峰
Other Authors:	YEH, TSO-ZEN
Format:	Others
Language:	zh-TW
Published:	2019
Online Access:	http://ndltd.ncl.edu.tw/handle/a9nvqc

id	ndltd-TW-107FJU00396033
record_format	oai_dc
spelling	ndltd-TW-107FJU003960332019-08-08T03:44:57Z http://ndltd.ncl.edu.tw/handle/a9nvqc Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter 藉由布隆過濾器來改善Hadoop記錄資料查詢 TSAI, PEI-FENG 蔡沛峰碩士輔仁大學資訊工程學系碩士班 107 Due to the rapid development of the Internet, with the rapid growth of various electronic forms of data, the storage and calculation of Big Data has become an important issue. Hadoop is an open source cloud system platform that includes the HDFS (Hadoop Distributed File System) and the MapReduce computing framework, providing a viable solution for Big Data storage and computing. One of the common applications of Big Data is to store the plain-text log files into HDFS and use the MapReduce framework to query the data by the feature field. The application characteristic is WORM (Write Once Read Many). Based on the characteristics of this application method, we use the Bloom Filter to process the feature field of the log data, so the HDFS system can reduce the number of file read to retrieve the log files we need. YEH, TSO-ZEN 葉佐任 2019 學位論文 ; thesis 59 zh-TW
collection	NDLTD
language	zh-TW
format	Others
sources	NDLTD
description	碩士 === 輔仁大學 === 資訊工程學系碩士班 === 107 === Due to the rapid development of the Internet, with the rapid growth of various electronic forms of data, the storage and calculation of Big Data has become an important issue. Hadoop is an open source cloud system platform that includes the HDFS (Hadoop Distributed File System) and the MapReduce computing framework, providing a viable solution for Big Data storage and computing. One of the common applications of Big Data is to store the plain-text log files into HDFS and use the MapReduce framework to query the data by the feature field. The application characteristic is WORM (Write Once Read Many). Based on the characteristics of this application method, we use the Bloom Filter to process the feature field of the log data, so the HDFS system can reduce the number of file read to retrieve the log files we need.
author2	YEH, TSO-ZEN
author_facet	YEH, TSO-ZEN TSAI, PEI-FENG 蔡沛峰
author	TSAI, PEI-FENG 蔡沛峰
spellingShingle	TSAI, PEI-FENG 蔡沛峰 Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
author_sort	TSAI, PEI-FENG
title	Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_short	Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_full	Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_fullStr	Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_full_unstemmed	Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter
title_sort	improving the query efficiency of log data on hadoop through the bloom filter
publishDate	2019
url	http://ndltd.ncl.edu.tw/handle/a9nvqc
work_keys_str_mv	AT tsaipeifeng improvingthequeryefficiencyoflogdataonhadoopthroughthebloomfilter AT càipèifēng improvingthequeryefficiencyoflogdataonhadoopthroughthebloomfilter AT tsaipeifeng jíyóubùlóngguòlǜqìláigǎishànhadoopjìlùzīliàocháxún AT càipèifēng jíyóubùlóngguòlǜqìláigǎishànhadoopjìlùzīliàocháxún
_version_	1719233227122540544

Improving the Query Efficiency of Log Data on Hadoop through the Bloom Filter

Similar Items