Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing

Digital libraries, journals and conference proceedings repositories are a great source of information. These sources are very useful for the purpose of research and development. This paper presents an overview of text mining and its application towards information extraction from literature. In this...

Full description

Bibliographic Details
Main Authors: Muhammad Inaam Ul Haq, Qianmu Li, Shoaib Hassan
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8886390/
id doaj-5f7b7515cea8413ea3ffbd4d24caa58a
record_format Article
spelling doaj-5f7b7515cea8413ea3ffbd4d24caa58a2021-03-30T00:38:31ZengIEEEIEEE Access2169-35362019-01-01716225416226710.1109/ACCESS.2019.29500458886390Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data ProcessingMuhammad Inaam Ul Haq0https://orcid.org/0000-0002-5759-073XQianmu Li1https://orcid.org/0000-0002-0998-1517Shoaib Hassan2School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, ChinaSchool of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, ChinaSchool of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, ChinaDigital libraries, journals and conference proceedings repositories are a great source of information. These sources are very useful for the purpose of research and development. This paper presents an overview of text mining and its application towards information extraction from literature. In this study, we used word cloud, term frequency analysis, similarity analysis, cluster analysis, and topic modeling to extract information from multi-domain research articles. Cloud computing and big data are new emerging trends. So it is important to extract useful patterns and knowledge from published articles in these domains and discover the relationship between them. Therefore, a total of two hundred research articles published from 2010 to 2018 in these domains, were selected. The source of these articles is high impact factor journals from reputed publishers namely IEEE, Springer, Wiley, Elsevier, and ACM. It is a cross-domain analysis in cloud computing and big data domains to find the latest trends, related topics, tools, terms, and author affiliation from extracted data. This study identifies the ten major areas of big data using cloud computing, fourteen factors towards cloud adoption, and hurdles in adoption. Moreover finding shows that IEEE has more sources for subject cloud computing application towards big data, then comes Springer, Wiley, and Elsevier. Furthermore, it has been observed in the analysis that the number of articles in these domains increased from 2013 onward.https://ieeexplore.ieee.org/document/8886390/Text miningtopic modelingcloud computingbig datainformation extractionliterature analysis
collection DOAJ
language English
format Article
sources DOAJ
author Muhammad Inaam Ul Haq
Qianmu Li
Shoaib Hassan
spellingShingle Muhammad Inaam Ul Haq
Qianmu Li
Shoaib Hassan
Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing
IEEE Access
Text mining
topic modeling
cloud computing
big data
information extraction
literature analysis
author_facet Muhammad Inaam Ul Haq
Qianmu Li
Shoaib Hassan
author_sort Muhammad Inaam Ul Haq
title Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing
title_short Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing
title_full Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing
title_fullStr Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing
title_full_unstemmed Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing
title_sort text mining techniques to capture facts for cloud computing adoption and big data processing
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description Digital libraries, journals and conference proceedings repositories are a great source of information. These sources are very useful for the purpose of research and development. This paper presents an overview of text mining and its application towards information extraction from literature. In this study, we used word cloud, term frequency analysis, similarity analysis, cluster analysis, and topic modeling to extract information from multi-domain research articles. Cloud computing and big data are new emerging trends. So it is important to extract useful patterns and knowledge from published articles in these domains and discover the relationship between them. Therefore, a total of two hundred research articles published from 2010 to 2018 in these domains, were selected. The source of these articles is high impact factor journals from reputed publishers namely IEEE, Springer, Wiley, Elsevier, and ACM. It is a cross-domain analysis in cloud computing and big data domains to find the latest trends, related topics, tools, terms, and author affiliation from extracted data. This study identifies the ten major areas of big data using cloud computing, fourteen factors towards cloud adoption, and hurdles in adoption. Moreover finding shows that IEEE has more sources for subject cloud computing application towards big data, then comes Springer, Wiley, and Elsevier. Furthermore, it has been observed in the analysis that the number of articles in these domains increased from 2013 onward.
topic Text mining
topic modeling
cloud computing
big data
information extraction
literature analysis
url https://ieeexplore.ieee.org/document/8886390/
work_keys_str_mv AT muhammadinaamulhaq textminingtechniquestocapturefactsforcloudcomputingadoptionandbigdataprocessing
AT qianmuli textminingtechniquestocapturefactsforcloudcomputingadoptionandbigdataprocessing
AT shoaibhassan textminingtechniquestocapturefactsforcloudcomputingadoptionandbigdataprocessing
_version_ 1724188022492102656