Toward Improving the Prediction Accuracy of Product Recommendation System Using Extreme Gradient Boosting and Encoding Approaches

With the spread of COVID-19, the “untact” culture in South Korea is expanding and customers are increasingly seeking for online services. A recommendation system serves as a decision-making indicator that helps users by suggesting items to be purchased in the future by exploring the symmetry between...

Full description

Bibliographic Details
Main Authors: Zeinab Shahbazi, Debapriya Hazra, Sejoon Park, Yung Cheol Byun
Format: Article
Language:English
Published: MDPI AG 2020-09-01
Series:Symmetry
Subjects:
Online Access:https://www.mdpi.com/2073-8994/12/9/1566
id doaj-2dfc3fd22bcf4083af91ff836a9853d4
record_format Article
spelling doaj-2dfc3fd22bcf4083af91ff836a9853d42020-11-25T01:29:30ZengMDPI AGSymmetry2073-89942020-09-01121566156610.3390/sym12091566Toward Improving the Prediction Accuracy of Product Recommendation System Using Extreme Gradient Boosting and Encoding ApproachesZeinab Shahbazi0Debapriya Hazra1Sejoon Park2Yung Cheol Byun3Department of Computer Engineering, Jeju National University, Jejusi 63243, KoreaDepartment of Computer Engineering, Jeju National University, Jejusi 63243, KoreaDepartment of Computer Engineering, Jeju National University, Jejusi 63243, KoreaDepartment of Computer Engineering, Jeju National University, Jejusi 63243, KoreaWith the spread of COVID-19, the “untact” culture in South Korea is expanding and customers are increasingly seeking for online services. A recommendation system serves as a decision-making indicator that helps users by suggesting items to be purchased in the future by exploring the symmetry between multiple user activity characteristics. A plethora of approaches are employed by the scientific community to design recommendation systems, including collaborative filtering, stereotyping, and content-based filtering, etc. The current paradigm of recommendation systems favors collaborative filtering due to its significant potential to closely capture the interest of a user as compared to other approaches. The collaborative filtering harnesses features like user-profile details, visited pages, and click information to determine the interest of a user, thereby recommending the items that are related to the user’s interest. The existing collaborative filtering approaches exploit implicit and explicit features and report either good classification or prediction outcome. These systems fail to exhibit good results for both measures at the same time. We believe that avoiding the recommendation of those items that have already been purchased could contribute to overcoming the said issue. In this study, we present a collaborative filtering-based algorithm to tackle big data of user with symmetric purchasing order and repetitive purchased products. The proposed algorithm relies on combining extreme gradient boosting machine learning architecture with word2vec mechanism to explore the purchased products based on the click patterns of users. Our algorithm improves the accuracy of predicting the relevant products to be recommended to the customers that are likely to be bought. The results are evaluated on the dataset that contains click-based features of users from an online shopping mall in Jeju Island, South Korea. We have evaluated Mean Absolute Error, Mean Square Error, and Root Mean Square Error for our proposed methodology and also other machine learning algorithms. Our proposed model generated the least error rate and enhanced the prediction accuracy of the recommendation system compared to other traditional approaches.https://www.mdpi.com/2073-8994/12/9/1566recommendation systemcollaborative filteringsymmetric purchasing orderpredictive analysisdata miningXGBoost
collection DOAJ
language English
format Article
sources DOAJ
author Zeinab Shahbazi
Debapriya Hazra
Sejoon Park
Yung Cheol Byun
spellingShingle Zeinab Shahbazi
Debapriya Hazra
Sejoon Park
Yung Cheol Byun
Toward Improving the Prediction Accuracy of Product Recommendation System Using Extreme Gradient Boosting and Encoding Approaches
Symmetry
recommendation system
collaborative filtering
symmetric purchasing order
predictive analysis
data mining
XGBoost
author_facet Zeinab Shahbazi
Debapriya Hazra
Sejoon Park
Yung Cheol Byun
author_sort Zeinab Shahbazi
title Toward Improving the Prediction Accuracy of Product Recommendation System Using Extreme Gradient Boosting and Encoding Approaches
title_short Toward Improving the Prediction Accuracy of Product Recommendation System Using Extreme Gradient Boosting and Encoding Approaches
title_full Toward Improving the Prediction Accuracy of Product Recommendation System Using Extreme Gradient Boosting and Encoding Approaches
title_fullStr Toward Improving the Prediction Accuracy of Product Recommendation System Using Extreme Gradient Boosting and Encoding Approaches
title_full_unstemmed Toward Improving the Prediction Accuracy of Product Recommendation System Using Extreme Gradient Boosting and Encoding Approaches
title_sort toward improving the prediction accuracy of product recommendation system using extreme gradient boosting and encoding approaches
publisher MDPI AG
series Symmetry
issn 2073-8994
publishDate 2020-09-01
description With the spread of COVID-19, the “untact” culture in South Korea is expanding and customers are increasingly seeking for online services. A recommendation system serves as a decision-making indicator that helps users by suggesting items to be purchased in the future by exploring the symmetry between multiple user activity characteristics. A plethora of approaches are employed by the scientific community to design recommendation systems, including collaborative filtering, stereotyping, and content-based filtering, etc. The current paradigm of recommendation systems favors collaborative filtering due to its significant potential to closely capture the interest of a user as compared to other approaches. The collaborative filtering harnesses features like user-profile details, visited pages, and click information to determine the interest of a user, thereby recommending the items that are related to the user’s interest. The existing collaborative filtering approaches exploit implicit and explicit features and report either good classification or prediction outcome. These systems fail to exhibit good results for both measures at the same time. We believe that avoiding the recommendation of those items that have already been purchased could contribute to overcoming the said issue. In this study, we present a collaborative filtering-based algorithm to tackle big data of user with symmetric purchasing order and repetitive purchased products. The proposed algorithm relies on combining extreme gradient boosting machine learning architecture with word2vec mechanism to explore the purchased products based on the click patterns of users. Our algorithm improves the accuracy of predicting the relevant products to be recommended to the customers that are likely to be bought. The results are evaluated on the dataset that contains click-based features of users from an online shopping mall in Jeju Island, South Korea. We have evaluated Mean Absolute Error, Mean Square Error, and Root Mean Square Error for our proposed methodology and also other machine learning algorithms. Our proposed model generated the least error rate and enhanced the prediction accuracy of the recommendation system compared to other traditional approaches.
topic recommendation system
collaborative filtering
symmetric purchasing order
predictive analysis
data mining
XGBoost
url https://www.mdpi.com/2073-8994/12/9/1566
work_keys_str_mv AT zeinabshahbazi towardimprovingthepredictionaccuracyofproductrecommendationsystemusingextremegradientboostingandencodingapproaches
AT debapriyahazra towardimprovingthepredictionaccuracyofproductrecommendationsystemusingextremegradientboostingandencodingapproaches
AT sejoonpark towardimprovingthepredictionaccuracyofproductrecommendationsystemusingextremegradientboostingandencodingapproaches
AT yungcheolbyun towardimprovingthepredictionaccuracyofproductrecommendationsystemusingextremegradientboostingandencodingapproaches
_version_ 1725096710598819840