Comparing performance of ensemble methods in predicting movie box office revenue
While many business intelligence methods have been applied to predict movie box office revenue, the studies using an ensemble approach to predict box office revenue are almost nonexistent. In this study, we propose decision trees, k-nearest-neighbors (k-NN), and linear regression using ensemble meth...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Elsevier
2020-06-01
|
Series: | Heliyon |
Subjects: | |
Online Access: | http://www.sciencedirect.com/science/article/pii/S240584402031104X |
id |
doaj-b297da8bb88347aca969f23ef43a2fd8 |
---|---|
record_format |
Article |
spelling |
doaj-b297da8bb88347aca969f23ef43a2fd82020-11-25T02:55:13ZengElsevierHeliyon2405-84402020-06-0166e04260Comparing performance of ensemble methods in predicting movie box office revenueSangjae Lee0Bikash KC1Joon Yeon Choeh2College of Business Administration, Sejong University, Seoul 05006, South Korea; Corresponding author.College of Business Administration, Sejong University, Seoul 05006, South KoreaDepartment of Software, Sejong University, Seoul 05006, South KoreaWhile many business intelligence methods have been applied to predict movie box office revenue, the studies using an ensemble approach to predict box office revenue are almost nonexistent. In this study, we propose decision trees, k-nearest-neighbors (k-NN), and linear regression using ensemble methods and the prediction performance of decision trees based on random forests, bagging and boosting are compared with that of k-NN and linear regression based on bagging and boosting using the sample of 1439 movies. The results indicate that ensemble methods based on decision trees (random forests, bagging, boosting) outperform ensemble methods based on k-NN (bagging, boosting) in predicting box office at week 1, 2, 3 after release. Decision trees using ensemble methods provide better prediction performance than ensemble methods based on linear regression analysis in the box office at week 1 after release. This is explained by the results that after comparing the prediction performance between ensemble methods and non-ensemble methods. For decision tree methods, unlike the other methods, the prediction performance of ensemble methods is greater than that of non-ensemble methods. This shows that decision trees using ensemble methods provide better application effectiveness of ensemble methods than k-NN and linear regression analysis.http://www.sciencedirect.com/science/article/pii/S240584402031104XMovie box office revenueEnsemble methodsPrediction of box office revenueDecision treesData analysisData analytics |
collection |
DOAJ |
language |
English |
format |
Article |
sources |
DOAJ |
author |
Sangjae Lee Bikash KC Joon Yeon Choeh |
spellingShingle |
Sangjae Lee Bikash KC Joon Yeon Choeh Comparing performance of ensemble methods in predicting movie box office revenue Heliyon Movie box office revenue Ensemble methods Prediction of box office revenue Decision trees Data analysis Data analytics |
author_facet |
Sangjae Lee Bikash KC Joon Yeon Choeh |
author_sort |
Sangjae Lee |
title |
Comparing performance of ensemble methods in predicting movie box office revenue |
title_short |
Comparing performance of ensemble methods in predicting movie box office revenue |
title_full |
Comparing performance of ensemble methods in predicting movie box office revenue |
title_fullStr |
Comparing performance of ensemble methods in predicting movie box office revenue |
title_full_unstemmed |
Comparing performance of ensemble methods in predicting movie box office revenue |
title_sort |
comparing performance of ensemble methods in predicting movie box office revenue |
publisher |
Elsevier |
series |
Heliyon |
issn |
2405-8440 |
publishDate |
2020-06-01 |
description |
While many business intelligence methods have been applied to predict movie box office revenue, the studies using an ensemble approach to predict box office revenue are almost nonexistent. In this study, we propose decision trees, k-nearest-neighbors (k-NN), and linear regression using ensemble methods and the prediction performance of decision trees based on random forests, bagging and boosting are compared with that of k-NN and linear regression based on bagging and boosting using the sample of 1439 movies. The results indicate that ensemble methods based on decision trees (random forests, bagging, boosting) outperform ensemble methods based on k-NN (bagging, boosting) in predicting box office at week 1, 2, 3 after release. Decision trees using ensemble methods provide better prediction performance than ensemble methods based on linear regression analysis in the box office at week 1 after release. This is explained by the results that after comparing the prediction performance between ensemble methods and non-ensemble methods. For decision tree methods, unlike the other methods, the prediction performance of ensemble methods is greater than that of non-ensemble methods. This shows that decision trees using ensemble methods provide better application effectiveness of ensemble methods than k-NN and linear regression analysis. |
topic |
Movie box office revenue Ensemble methods Prediction of box office revenue Decision trees Data analysis Data analytics |
url |
http://www.sciencedirect.com/science/article/pii/S240584402031104X |
work_keys_str_mv |
AT sangjaelee comparingperformanceofensemblemethodsinpredictingmovieboxofficerevenue AT bikashkc comparingperformanceofensemblemethodsinpredictingmovieboxofficerevenue AT joonyeonchoeh comparingperformanceofensemblemethodsinpredictingmovieboxofficerevenue |
_version_ |
1724717490780504064 |