SCANCPECLENS: A Framework for Automatic Lexicon Generation and Sentiment Analysis of Micro Blogging Data on China Pakistan Economic Corridor

With the growing availability of internet and opinion rich resources such as social networks and personal blogs, the task of mining public opinion and exploring facts has become more popular than ever before during the last decade. The latest trend has deeply transformed the way the governments inte...

Full description

Bibliographic Details
Main Authors: Bibi Amina, Tayyaba Azim
Format: Article
Language:English
Published: IEEE 2019-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/8831392/
id doaj-5a0577b03264496c98f3dd1c58a4c13e
record_format Article
spelling doaj-5a0577b03264496c98f3dd1c58a4c13e2021-04-05T17:14:58ZengIEEEIEEE Access2169-35362019-01-01713387613388710.1109/ACCESS.2019.29405288831392SCANCPECLENS: A Framework for Automatic Lexicon Generation and Sentiment Analysis of Micro Blogging Data on China Pakistan Economic CorridorBibi Amina0https://orcid.org/0000-0002-3509-1788Tayyaba Azim1https://orcid.org/0000-0002-5122-2397Center of Excellence in IT, Institute of Management Sciences, Peshawar, PakistanCenter of Excellence in IT, Institute of Management Sciences, Peshawar, PakistanWith the growing availability of internet and opinion rich resources such as social networks and personal blogs, the task of mining public opinion and exploring facts has become more popular than ever before during the last decade. The latest trend has deeply transformed the way the governments interact with their citizens and offer them various services through continuous public engagement. The proposed framework SCANCPECLENS is an initiative to support performance assessment framework for e-government in Pakistan. The research takes into account the opinion of masses on one of the most crucial and widely discussed development projects, China Pakistan Economic Corridor (CPEC), considered as a game changer due to its promise of bringing economic prosperity to the region. The proposed framework suggests to use machine learning algorithms to automatically discover the public sentiment from microblogs on the matter nationally as well as internationally. We also present an automated way to create sentiment lexicon of positive, negative and neutral words on the subject. To the best of our knowledge, this theme has not been explored for opinion mining before and helps one in effectively assessing public satisfaction over government's policies in the CPEC region. The research is an initiative to discover new avenues of future research and direction for the government, policy making institutions and other stake holders and demonstrates the power of text mining as an effective tool to extract business value from vast amount of social media data.https://ieeexplore.ieee.org/document/8831392/China Pakistan economic corridor (CPEC)<italic xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">k</italic>- nearest neighborlogistic regressionlexicon generationmachine learningnatural language processing
collection DOAJ
language English
format Article
sources DOAJ
author Bibi Amina
Tayyaba Azim
spellingShingle Bibi Amina
Tayyaba Azim
SCANCPECLENS: A Framework for Automatic Lexicon Generation and Sentiment Analysis of Micro Blogging Data on China Pakistan Economic Corridor
IEEE Access
China Pakistan economic corridor (CPEC)
<italic xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">k</italic>- nearest neighbor
logistic regression
lexicon generation
machine learning
natural language processing
author_facet Bibi Amina
Tayyaba Azim
author_sort Bibi Amina
title SCANCPECLENS: A Framework for Automatic Lexicon Generation and Sentiment Analysis of Micro Blogging Data on China Pakistan Economic Corridor
title_short SCANCPECLENS: A Framework for Automatic Lexicon Generation and Sentiment Analysis of Micro Blogging Data on China Pakistan Economic Corridor
title_full SCANCPECLENS: A Framework for Automatic Lexicon Generation and Sentiment Analysis of Micro Blogging Data on China Pakistan Economic Corridor
title_fullStr SCANCPECLENS: A Framework for Automatic Lexicon Generation and Sentiment Analysis of Micro Blogging Data on China Pakistan Economic Corridor
title_full_unstemmed SCANCPECLENS: A Framework for Automatic Lexicon Generation and Sentiment Analysis of Micro Blogging Data on China Pakistan Economic Corridor
title_sort scancpeclens: a framework for automatic lexicon generation and sentiment analysis of micro blogging data on china pakistan economic corridor
publisher IEEE
series IEEE Access
issn 2169-3536
publishDate 2019-01-01
description With the growing availability of internet and opinion rich resources such as social networks and personal blogs, the task of mining public opinion and exploring facts has become more popular than ever before during the last decade. The latest trend has deeply transformed the way the governments interact with their citizens and offer them various services through continuous public engagement. The proposed framework SCANCPECLENS is an initiative to support performance assessment framework for e-government in Pakistan. The research takes into account the opinion of masses on one of the most crucial and widely discussed development projects, China Pakistan Economic Corridor (CPEC), considered as a game changer due to its promise of bringing economic prosperity to the region. The proposed framework suggests to use machine learning algorithms to automatically discover the public sentiment from microblogs on the matter nationally as well as internationally. We also present an automated way to create sentiment lexicon of positive, negative and neutral words on the subject. To the best of our knowledge, this theme has not been explored for opinion mining before and helps one in effectively assessing public satisfaction over government's policies in the CPEC region. The research is an initiative to discover new avenues of future research and direction for the government, policy making institutions and other stake holders and demonstrates the power of text mining as an effective tool to extract business value from vast amount of social media data.
topic China Pakistan economic corridor (CPEC)
<italic xmlns:ali="http://www.niso.org/schemas/ali/1.0/" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">k</italic>- nearest neighbor
logistic regression
lexicon generation
machine learning
natural language processing
url https://ieeexplore.ieee.org/document/8831392/
work_keys_str_mv AT bibiamina scancpeclensaframeworkforautomaticlexicongenerationandsentimentanalysisofmicrobloggingdataonchinapakistaneconomiccorridor
AT tayyabaazim scancpeclensaframeworkforautomaticlexicongenerationandsentimentanalysisofmicrobloggingdataonchinapakistaneconomiccorridor
_version_ 1721540018058559488