A Recommendation Mechanism for Under-Emphasized Tourist Spots Using Topic Modeling and Sentiment Analysis
With rapid advancements in internet applications, the growth rate of recommendation systems for tourists has skyrocketed. This has generated an enormous amount of travel-based data in the form of reviews, blogs, and ratings. However, most recommendation systems only recommend the top-rated places. A...
Main Authors: | , |
---|---|
Format: | Article |
Language: | English |
Published: |
MDPI AG
2019-12-01
|
Series: | Sustainability |
Subjects: | |
Online Access: | https://www.mdpi.com/2071-1050/12/1/320 |
id |
doaj-e6f46a1a3aaa4001af0c827e19155691 |
---|---|
record_format |
Article |
spelling |
doaj-e6f46a1a3aaa4001af0c827e191556912020-11-25T01:40:32ZengMDPI AGSustainability2071-10502019-12-0112132010.3390/su12010320su12010320A Recommendation Mechanism for Under-Emphasized Tourist Spots Using Topic Modeling and Sentiment AnalysisWafa Shafqat0Yung-Cheol Byun1Department of Computer Engineering, Jeju National University, Jeju 63243, KoreaDepartment of Computer Engineering, Jeju National University, Jeju 63243, KoreaWith rapid advancements in internet applications, the growth rate of recommendation systems for tourists has skyrocketed. This has generated an enormous amount of travel-based data in the form of reviews, blogs, and ratings. However, most recommendation systems only recommend the top-rated places. Along with the top-ranked places, we aim to discover places that are often ignored by tourists owing to lack of promotion or effective advertising, referred to as under-emphasized locations. In this study, we use all relevant data, such as travel blogs, ratings, and reviews, in order to obtain optimal recommendations. We also aim to discover the latent factors that need to be addressed, such as food, cleanliness, and opening hours, and recommend a tourist place based on user history data. In this study, we propose a cross mapping table approach based on the location’s popularity, ratings, latent topics, and sentiments. An objective function for recommendation optimization is formulated based on these mappings. The baseline algorithms are latent Dirichlet allocation (LDA) and support vector machine (SVM). Our results show that the combined features of LDA, SVM, ratings, and cross mappings are conducive to enhanced performance. The main motivation of this study was to help tourist industries to direct more attention towards designing effective promotional activities for under-emphasized locations.https://www.mdpi.com/2071-1050/12/1/320latent dirichlet allocation (lda)support vector machine (svm)cross mapping tableslocation’s popularity indexunder-emphasized locations |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Wafa Shafqat Yung-Cheol Byun |
spellingShingle |
Wafa Shafqat Yung-Cheol Byun A Recommendation Mechanism for Under-Emphasized Tourist Spots Using Topic Modeling and Sentiment Analysis Sustainability latent dirichlet allocation (lda) support vector machine (svm) cross mapping tables location’s popularity index under-emphasized locations |
author_facet |
Wafa Shafqat Yung-Cheol Byun |
author_sort |
Wafa Shafqat |
title |
A Recommendation Mechanism for Under-Emphasized Tourist Spots Using Topic Modeling and Sentiment Analysis |
title_short |
A Recommendation Mechanism for Under-Emphasized Tourist Spots Using Topic Modeling and Sentiment Analysis |
title_full |
A Recommendation Mechanism for Under-Emphasized Tourist Spots Using Topic Modeling and Sentiment Analysis |
title_fullStr |
A Recommendation Mechanism for Under-Emphasized Tourist Spots Using Topic Modeling and Sentiment Analysis |
title_full_unstemmed |
A Recommendation Mechanism for Under-Emphasized Tourist Spots Using Topic Modeling and Sentiment Analysis |
title_sort |
recommendation mechanism for under-emphasized tourist spots using topic modeling and sentiment analysis |
publisher |
MDPI AG |
series |
Sustainability |
issn |
2071-1050 |
publishDate |
2019-12-01 |
description |
With rapid advancements in internet applications, the growth rate of recommendation systems for tourists has skyrocketed. This has generated an enormous amount of travel-based data in the form of reviews, blogs, and ratings. However, most recommendation systems only recommend the top-rated places. Along with the top-ranked places, we aim to discover places that are often ignored by tourists owing to lack of promotion or effective advertising, referred to as under-emphasized locations. In this study, we use all relevant data, such as travel blogs, ratings, and reviews, in order to obtain optimal recommendations. We also aim to discover the latent factors that need to be addressed, such as food, cleanliness, and opening hours, and recommend a tourist place based on user history data. In this study, we propose a cross mapping table approach based on the location’s popularity, ratings, latent topics, and sentiments. An objective function for recommendation optimization is formulated based on these mappings. The baseline algorithms are latent Dirichlet allocation (LDA) and support vector machine (SVM). Our results show that the combined features of LDA, SVM, ratings, and cross mappings are conducive to enhanced performance. The main motivation of this study was to help tourist industries to direct more attention towards designing effective promotional activities for under-emphasized locations. |
topic |
latent dirichlet allocation (lda) support vector machine (svm) cross mapping tables location’s popularity index under-emphasized locations |
url |
https://www.mdpi.com/2071-1050/12/1/320 |
work_keys_str_mv |
AT wafashafqat arecommendationmechanismforunderemphasizedtouristspotsusingtopicmodelingandsentimentanalysis AT yungcheolbyun arecommendationmechanismforunderemphasizedtouristspotsusingtopicmodelingandsentimentanalysis AT wafashafqat recommendationmechanismforunderemphasizedtouristspotsusingtopicmodelingandsentimentanalysis AT yungcheolbyun recommendationmechanismforunderemphasizedtouristspotsusingtopicmodelingandsentimentanalysis |
_version_ |
1725045098124673024 |