Study and Implementation of RHadoop Technology

碩士 === 靜宜大學 === 資訊管理學系 === 103 === In recent more than ten years, due to the rapid development of Internet and cloud technology, a sheer variety of big data are created. The big data appear to be a trend. It has radically changed in every aspect of data storage, management and handling. The enterpri...

Full description

Bibliographic Details
Main Authors: Chih-Hung Liao, 廖知航
Other Authors: Jieh-Shan Yeh
Format: Others
Language:zh-TW
Published: 2015
Online Access:http://ndltd.ncl.edu.tw/handle/17266966378211785338
id ndltd-TW-104PU000396001
record_format oai_dc
spelling ndltd-TW-104PU0003960012016-07-31T04:21:52Z http://ndltd.ncl.edu.tw/handle/17266966378211785338 Study and Implementation of RHadoop Technology RHadoop技術探討與實作 Chih-Hung Liao 廖知航 碩士 靜宜大學 資訊管理學系 103 In recent more than ten years, due to the rapid development of Internet and cloud technology, a sheer variety of big data are created. The big data appear to be a trend. It has radically changed in every aspect of data storage, management and handling. The enterprises nowadays need to store much more data than before, with wider sources, and more diversified forms, and must understand how to transform the big data into more valuable information. Apparently, the strong and powerful software plays a very crucial role in the information, digital and big-data times. For example, software Apache’s Hadoop not only has low-cost and high-benefit advantages, it also has high agility, fast handling speed, strong debugging ability, and better expansibility, which are very helpful to the access to and the handling of the big data. RHadoop, coupled with R language, can combine from many data sources which are more helpful to the utilization of the big data. This study combines Hadoop and R language to carry out data mining, which aims to use the technology of Hadoop parallel distributed processing and the algorithm of R language statistical software to explore the function of big data’s storage, handling, and retrieval. The current research uses the real product sales volume provided by the TH enterprise as the research data set, which consists of a total of 2,8,12,555 records. Jieh-Shan Yeh 葉介山 2015 學位論文 ; thesis 87 zh-TW
collection NDLTD
language zh-TW
format Others
sources NDLTD
description 碩士 === 靜宜大學 === 資訊管理學系 === 103 === In recent more than ten years, due to the rapid development of Internet and cloud technology, a sheer variety of big data are created. The big data appear to be a trend. It has radically changed in every aspect of data storage, management and handling. The enterprises nowadays need to store much more data than before, with wider sources, and more diversified forms, and must understand how to transform the big data into more valuable information. Apparently, the strong and powerful software plays a very crucial role in the information, digital and big-data times. For example, software Apache’s Hadoop not only has low-cost and high-benefit advantages, it also has high agility, fast handling speed, strong debugging ability, and better expansibility, which are very helpful to the access to and the handling of the big data. RHadoop, coupled with R language, can combine from many data sources which are more helpful to the utilization of the big data. This study combines Hadoop and R language to carry out data mining, which aims to use the technology of Hadoop parallel distributed processing and the algorithm of R language statistical software to explore the function of big data’s storage, handling, and retrieval. The current research uses the real product sales volume provided by the TH enterprise as the research data set, which consists of a total of 2,8,12,555 records.
author2 Jieh-Shan Yeh
author_facet Jieh-Shan Yeh
Chih-Hung Liao
廖知航
author Chih-Hung Liao
廖知航
spellingShingle Chih-Hung Liao
廖知航
Study and Implementation of RHadoop Technology
author_sort Chih-Hung Liao
title Study and Implementation of RHadoop Technology
title_short Study and Implementation of RHadoop Technology
title_full Study and Implementation of RHadoop Technology
title_fullStr Study and Implementation of RHadoop Technology
title_full_unstemmed Study and Implementation of RHadoop Technology
title_sort study and implementation of rhadoop technology
publishDate 2015
url http://ndltd.ncl.edu.tw/handle/17266966378211785338
work_keys_str_mv AT chihhungliao studyandimplementationofrhadooptechnology
AT liàozhīháng studyandimplementationofrhadooptechnology
AT chihhungliao rhadoopjìshùtàntǎoyǔshízuò
AT liàozhīháng rhadoopjìshùtàntǎoyǔshízuò
_version_ 1718367494488457216