Predicting attributes based movie success through ensemble machine learning

被引:0
作者
Vedika Gupta
Nikita Jain
Harshit Garg
Srishti Jhunthra
Senthilkumar Mohan
Abdullah Hisam Omar
Ali Ahmadian
机构
[1] Bharati Vidyapeeth’s College of Engineering,Department of Computer Science & Engineering
[2] Vellore Institute of Technology,School of Information Technology and Engineering
[3] Universiti Teknologi Malaysia,Faculty of Built Environment and Surveying
[4] Institute of IR 4.0,Department of Mathematics
[5] The National University of Malaysia,undefined
[6] UKM,undefined
[7] Near East University,undefined
来源
Multimedia Tools and Applications | 2023年 / 82卷
关键词
Boosting ensemble technique; IMDb; KNN; Machine learning; Movie success prediction; MLP-NN; Naive bayes; SVM; Voting ensemble technique;
D O I
暂无
中图分类号
学科分类号
摘要
The film industry has grown into a multi-billionaire industry in terms of entertainment. The success of the film industry depends on the criteria that how much profit a movie would make which gives the tag of a ‘hit’ or a ‘flop’. Predicting the success is guided by various factors like genre, date of release, actors, net gross and many more. Understanding the stakes involved with a movie release that can affect its success or a failure, before-hand can be a great step towards the expansion of the film industry business. Therefore, this study proposes an ensemble learning strategy as a solution to analyze such understanding where predictions from previously guided attribute calculations can be used to enhance future success/failure accuracy. This study shows various strategies used in the literature to analyze and compare the results obtained. The various machines learning algorithms SVM, KNN, Naive Bayes, Boosting Ensemble Technique, Stacking Ensemble Technique, Voting Ensemble Technique, and MLP Neural Network are applied on the dataset to predict the box office success of a movie. The paper uses various algorithms and their trends in predicting the outcome of a movie and shows that the proposed methodology outperforms the existing studies. The most effective algorithm in the study is Gradient Boosting with a success rate of 84.1297%.
引用
收藏
页码:9597 / 9626
页数:29
相关论文
共 47 条
  • [1] Bauer E(1999)An empirical comparison of voting classification algorithms: Bagging, boosting, and variants Mach Learn 36 105-139
  • [2] Kohavi R(2021)TLCoV-An automated Covid-19 screening model using transfer learning from chest X-ray images Chaos Solitons Fractals 144 110713-6433
  • [3] Das AK(2010)Predicting movie prices through dynamic social network analysis Procedia Soc Behav Sci 2 6423-1493
  • [4] Kalam S(2009)K nearest neighbours with mutual information for simultaneous classification and missing data imputation Neurocomputing 72 1483-313
  • [5] Kumar C(2013)Prediction of movie success using sentiment analysis of tweets Int J Soft Comput Softw Eng 3 308-903
  • [6] Sinha D(2016)Early predictions of movie success: the who, what, and when of profitability J Manag Inf Syst 33 874-22
  • [7] Doshi L(2016)Prediction of movies popularity using machine learning techniques Int J Comput Sci Netw Sec 16 127-81
  • [8] Krauss J(2002)Classification and regression by randomForest R News 2 18-368
  • [9] Nann S(2015)Comparison of four Adaboost algorithm based artificial neural networks in wind speed predictions Energy Convers Manage 92 67-3311
  • [10] Gloor P(2021)Prediction modelling of COVID using machine learning methods from B-cell dataset Results Phys 21 103813-7