Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing
Digital libraries, journals and conference proceedings repositories are a great source of information. These sources are very useful for the purpose of research and development. This paper presents an overview of text mining and its application towards information extraction from literature. In this...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
IEEE
2019-01-01
|
Series: | IEEE Access |
Subjects: | |
Online Access: | https://ieeexplore.ieee.org/document/8886390/ |
id |
doaj-5f7b7515cea8413ea3ffbd4d24caa58a |
---|---|
record_format |
Article |
spelling |
doaj-5f7b7515cea8413ea3ffbd4d24caa58a2021-03-30T00:38:31ZengIEEEIEEE Access2169-35362019-01-01716225416226710.1109/ACCESS.2019.29500458886390Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data ProcessingMuhammad Inaam Ul Haq0https://orcid.org/0000-0002-5759-073XQianmu Li1https://orcid.org/0000-0002-0998-1517Shoaib Hassan2School of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, ChinaSchool of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, ChinaSchool of Computer Science and Engineering, Nanjing University of Science and Technology, Nanjing, ChinaDigital libraries, journals and conference proceedings repositories are a great source of information. These sources are very useful for the purpose of research and development. This paper presents an overview of text mining and its application towards information extraction from literature. In this study, we used word cloud, term frequency analysis, similarity analysis, cluster analysis, and topic modeling to extract information from multi-domain research articles. Cloud computing and big data are new emerging trends. So it is important to extract useful patterns and knowledge from published articles in these domains and discover the relationship between them. Therefore, a total of two hundred research articles published from 2010 to 2018 in these domains, were selected. The source of these articles is high impact factor journals from reputed publishers namely IEEE, Springer, Wiley, Elsevier, and ACM. It is a cross-domain analysis in cloud computing and big data domains to find the latest trends, related topics, tools, terms, and author affiliation from extracted data. This study identifies the ten major areas of big data using cloud computing, fourteen factors towards cloud adoption, and hurdles in adoption. Moreover finding shows that IEEE has more sources for subject cloud computing application towards big data, then comes Springer, Wiley, and Elsevier. Furthermore, it has been observed in the analysis that the number of articles in these domains increased from 2013 onward.https://ieeexplore.ieee.org/document/8886390/Text miningtopic modelingcloud computingbig datainformation extractionliterature analysis |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Muhammad Inaam Ul Haq Qianmu Li Shoaib Hassan |
spellingShingle |
Muhammad Inaam Ul Haq Qianmu Li Shoaib Hassan Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing IEEE Access Text mining topic modeling cloud computing big data information extraction literature analysis |
author_facet |
Muhammad Inaam Ul Haq Qianmu Li Shoaib Hassan |
author_sort |
Muhammad Inaam Ul Haq |
title |
Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing |
title_short |
Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing |
title_full |
Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing |
title_fullStr |
Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing |
title_full_unstemmed |
Text Mining Techniques to Capture Facts for Cloud Computing Adoption and Big Data Processing |
title_sort |
text mining techniques to capture facts for cloud computing adoption and big data processing |
publisher |
IEEE |
series |
IEEE Access |
issn |
2169-3536 |
publishDate |
2019-01-01 |
description |
Digital libraries, journals and conference proceedings repositories are a great source of information. These sources are very useful for the purpose of research and development. This paper presents an overview of text mining and its application towards information extraction from literature. In this study, we used word cloud, term frequency analysis, similarity analysis, cluster analysis, and topic modeling to extract information from multi-domain research articles. Cloud computing and big data are new emerging trends. So it is important to extract useful patterns and knowledge from published articles in these domains and discover the relationship between them. Therefore, a total of two hundred research articles published from 2010 to 2018 in these domains, were selected. The source of these articles is high impact factor journals from reputed publishers namely IEEE, Springer, Wiley, Elsevier, and ACM. It is a cross-domain analysis in cloud computing and big data domains to find the latest trends, related topics, tools, terms, and author affiliation from extracted data. This study identifies the ten major areas of big data using cloud computing, fourteen factors towards cloud adoption, and hurdles in adoption. Moreover finding shows that IEEE has more sources for subject cloud computing application towards big data, then comes Springer, Wiley, and Elsevier. Furthermore, it has been observed in the analysis that the number of articles in these domains increased from 2013 onward. |
topic |
Text mining topic modeling cloud computing big data information extraction literature analysis |
url |
https://ieeexplore.ieee.org/document/8886390/ |
work_keys_str_mv |
AT muhammadinaamulhaq textminingtechniquestocapturefactsforcloudcomputingadoptionandbigdataprocessing AT qianmuli textminingtechniquestocapturefactsforcloudcomputingadoptionandbigdataprocessing AT shoaibhassan textminingtechniquestocapturefactsforcloudcomputingadoptionandbigdataprocessing |
_version_ |
1724188022492102656 |