Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics

Information is exploding on the web at exponential pace, and online movie review over the web is a substantial source of information for online users. However, users write millions of movie reviews on regular basis, and it is not possible for users to condense the reviews. Classification and summari...

Full description

Bibliographic Details
Main Authors: Atif Khan, Muhammad Adnan Gul, M. Irfan Uddin, Syed Atif Ali Shah, Shafiq Ahmad, Muhammad Dzulqarnain Al Firdausi, Mazen Zaindin
Format: Article
Language:English
Published: Hindawi Limited 2020-01-01
Series:Scientific Programming
Online Access:http://dx.doi.org/10.1155/2020/5812715
id doaj-d2fa045365104782a8641be92bed1387
record_format Article
spelling doaj-d2fa045365104782a8641be92bed13872021-07-19T01:04:13ZengHindawi LimitedScientific Programming1875-919X2020-01-01202010.1155/2020/58127155812715Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data AnalyticsAtif Khan0Muhammad Adnan Gul1M. Irfan Uddin2Syed Atif Ali Shah3Shafiq Ahmad4Muhammad Dzulqarnain Al Firdausi5Mazen Zaindin6Department of Computer ScienceDepartment of Computer ScienceInstitute of ComputingFaculty of Engineering and Information TechnologyIndustrial Engineering DepartmentIndustrial Engineering DepartmentDepartment of Statistics and Operations ResearchInformation is exploding on the web at exponential pace, and online movie review over the web is a substantial source of information for online users. However, users write millions of movie reviews on regular basis, and it is not possible for users to condense the reviews. Classification and summarization of reviews is a difficult task in computational linguistics. Hence, an automatic method is demanded to summarize the vast amount of movie reviews, and this method will permit the users to speedily distinguish between positive and negative features of a movie. This work has proposed a classification and summarization method for movie reviews. For movie review classification, bag-of-words feature extraction technique is used to extract unigrams, bigrams, and trigrams as a feature set from given review documents and represent the review documents as a vector. Next, the Na¨ıve Bayes algorithm is employed to categorize the movie reviews (signified as a feature vector) into negative and positive reviews. For the task of movie review summarization, word2vec model is used to extract features from classified movie review sentences, and then semantic clustering technique is used to cluster semantically related review sentences. Different text features are employed to compute the salience score of all review sentences in clusters. Finally, the best-ranked review sentences are picked based on top salience scores to form a summary of movie reviews. Empirical results indicate that the suggested machine learning approach performed better than benchmark summarization approaches.http://dx.doi.org/10.1155/2020/5812715
collection DOAJ
language English
format Article
sources DOAJ
author Atif Khan
Muhammad Adnan Gul
M. Irfan Uddin
Syed Atif Ali Shah
Shafiq Ahmad
Muhammad Dzulqarnain Al Firdausi
Mazen Zaindin
spellingShingle Atif Khan
Muhammad Adnan Gul
M. Irfan Uddin
Syed Atif Ali Shah
Shafiq Ahmad
Muhammad Dzulqarnain Al Firdausi
Mazen Zaindin
Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics
Scientific Programming
author_facet Atif Khan
Muhammad Adnan Gul
M. Irfan Uddin
Syed Atif Ali Shah
Shafiq Ahmad
Muhammad Dzulqarnain Al Firdausi
Mazen Zaindin
author_sort Atif Khan
title Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics
title_short Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics
title_full Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics
title_fullStr Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics
title_full_unstemmed Summarizing Online Movie Reviews: A Machine Learning Approach to Big Data Analytics
title_sort summarizing online movie reviews: a machine learning approach to big data analytics
publisher Hindawi Limited
series Scientific Programming
issn 1875-919X
publishDate 2020-01-01
description Information is exploding on the web at exponential pace, and online movie review over the web is a substantial source of information for online users. However, users write millions of movie reviews on regular basis, and it is not possible for users to condense the reviews. Classification and summarization of reviews is a difficult task in computational linguistics. Hence, an automatic method is demanded to summarize the vast amount of movie reviews, and this method will permit the users to speedily distinguish between positive and negative features of a movie. This work has proposed a classification and summarization method for movie reviews. For movie review classification, bag-of-words feature extraction technique is used to extract unigrams, bigrams, and trigrams as a feature set from given review documents and represent the review documents as a vector. Next, the Na¨ıve Bayes algorithm is employed to categorize the movie reviews (signified as a feature vector) into negative and positive reviews. For the task of movie review summarization, word2vec model is used to extract features from classified movie review sentences, and then semantic clustering technique is used to cluster semantically related review sentences. Different text features are employed to compute the salience score of all review sentences in clusters. Finally, the best-ranked review sentences are picked based on top salience scores to form a summary of movie reviews. Empirical results indicate that the suggested machine learning approach performed better than benchmark summarization approaches.
url http://dx.doi.org/10.1155/2020/5812715
work_keys_str_mv AT atifkhan summarizingonlinemoviereviewsamachinelearningapproachtobigdataanalytics
AT muhammadadnangul summarizingonlinemoviereviewsamachinelearningapproachtobigdataanalytics
AT mirfanuddin summarizingonlinemoviereviewsamachinelearningapproachtobigdataanalytics
AT syedatifalishah summarizingonlinemoviereviewsamachinelearningapproachtobigdataanalytics
AT shafiqahmad summarizingonlinemoviereviewsamachinelearningapproachtobigdataanalytics
AT muhammaddzulqarnainalfirdausi summarizingonlinemoviereviewsamachinelearningapproachtobigdataanalytics
AT mazenzaindin summarizingonlinemoviereviewsamachinelearningapproachtobigdataanalytics
_version_ 1721295601841209344