Machine learning-based new approach to films review

被引:0
作者
Mustafa Abdalrassual Jassim
Dhafar Hamed Abd
Mohamed Nazih Omri
机构
[1] University of Sousse,MARS Research Laboratory
[2] University of Monastir,Monastir Faculty of Science
[3] Al-Muthanna University,Department of Computer Science
[4] Al-Maaref University College,undefined
来源
Social Network Analysis and Mining | / 13卷
关键词
Sentiment analysis; Movie review; Machine learning; Word selection; Decision-making; Text analysis; Data science;
D O I
暂无
中图分类号
学科分类号
摘要
The main purpose of Sentiment Analysis (SA) is to derive useful insights from large amounts of unstructured data compiled from various sources. This analysis helps to interpret and classify textual data using different techniques applied in machine learning (ML) models. In this paper, we compared simple and ensemble ML methods as classifiers for SA: Random Forest, K-Nearest Neighbor, Artificial Neural Network, Gradient Boosting, Support Vector Machine (SVM), AdaBoost, Extreme Gradient Boosting, Decision Tree, Light GBM, Stochastic Gradient Descent and Bagging. For this, we considered a test set database of 50,000 movie reviews, of which 25,000 were rated positive and 25,000 negatives. We have chosen 20,000 words that have an impact on the feelings of the documents. This work aims to propose a new rating prediction approach based on a textual customer review. We consider term frequency characteristics and term frequency-inverse document frequency from the large-scale and serial trials to compare the results obtained by various classifiers using feature extraction techniques. For the decision phase, we applied the Fuzzy Decision by Opinion Score Method, one of the most recent methods for multi-criteria decision-making. To evaluate and quantify the performance of the different ML methods we considered, we apply six standard measures namely precision, accuracy, recall, F-score, AUC, and Kappa-measure. The results we obtained, at the end of the experimental work that we conducted, indicated that the SVM classier is the best with 88,333% as a precision rate followed by the FDOSM method, with 0.800 for the same measurement.
引用
收藏
相关论文
共 156 条
[91]  
Li Z(undefined)undefined undefined undefined undefined-undefined
[92]  
Fan Y(undefined)undefined undefined undefined undefined-undefined
[93]  
Jiang B(undefined)undefined undefined undefined undefined-undefined
[94]  
Lei T(undefined)undefined undefined undefined undefined-undefined
[95]  
Liu W(undefined)undefined undefined undefined undefined-undefined
[96]  
Liu B(undefined)undefined undefined undefined undefined-undefined
[97]  
Mahdavi I(undefined)undefined undefined undefined undefined-undefined
[98]  
Mahdavi-Amiri N(undefined)undefined undefined undefined undefined-undefined
[99]  
Heidarzade A(undefined)undefined undefined undefined undefined-undefined
[100]  
Nourifar R(undefined)undefined undefined undefined undefined-undefined