A Multi-Criteria Approach for Arabic Dialect Sentiment Analysis for Online Reviews: Exploiting Optimal Machine Learning Algorithm Selection

被引:22
作者
Abo, Mohamed Elhag Mohamed [1 ]
Idris, Norisma [1 ]
Mahmud, Rohana [1 ]
Qazi, Atika [2 ]
Hashem, Ibrahim Abaker Targio [3 ]
Maitama, Jaafar Zubairu [1 ,4 ]
Naseem, Usman [5 ]
Khan, Shah Khalid [6 ]
Yang, Shuiqing [7 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Artificial Intelligence, Kuala Lumpur 50603, Malaysia
[2] Univ Brunei Darussalam, Ctr Lifelong Learning, BE-1410 Gadong, Brunei
[3] Univ Sharjah, Dept Comp Sci, Coll Comp & Informat, Sharjah 27272, U Arab Emirates
[4] Bayero Univ, Fac Comp Sci & Informat Technol, Dept Informat Technol, Kano 3011, Nigeria
[5] Univ Sydney, Sch Comp Sci, Sydney, NSW 2006, Australia
[6] RMIT Univ, Sch Engn, Carlton, Vic 3053, Australia
[7] Zhejiang Univ Finance & Econ, Sch Informat Management & Artificial Intelligence, Hangzhou 310018, Peoples R China
关键词
multiple-criteria; Arabic dialect; sentiment analysis; machine learning; performance evaluation; OF-THE-ART; CLASSIFICATION; NETWORKS; CRITERIA;
D O I
10.3390/su131810018
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
A sentiment analysis of Arabic texts is an important task in many commercial applications such as Twitter. This study introduces a multi-criteria method to empirically assess and rank classifiers for Arabic sentiment analysis. Prominent machine learning algorithms were deployed to build classification models for Arabic sentiment analysis classifiers. Moreover, an assessment of the top five machine learning classifiers' performances measures was discussed to rank the performance of the classifier. We integrated the top five ranking methods with evaluation metrics of machine learning classifiers such as accuracy, recall, precision, F-measure, CPU Time, classification error, and area under the curve (AUC). The method was tested using Saudi Arabic product reviews to compare five popular classifiers. Our results suggest that deep learning and support vector machine (SVM) classifiers perform best with accuracy 85.25%, 82.30%; precision 85.30, 83.87%; recall 88.41%, 83.89; F-measure 86.81, 83.87%; classification error 14.75, 17.70; and AUC 0.93, 0.90, respectively. They outperform decision trees, K-nearest neighbours (K-NN), and Naive Bayes classifiers.
引用
收藏
页数:20
相关论文
共 85 条
[51]  
Hathlian N.F. B., 2016, 2016 4th Saudi International Conference on Information Technology (Big Data Analysis)(KACSTIT), P1
[52]  
Khasawneh RT, 2013, INT CONF INTERNET, P101, DOI 10.1109/ICIST.2013.6747520
[53]   Machine learning for medical diagnosis: history, state of the art and perspective [J].
Kononenko, I .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2001, 23 (01) :89-109
[54]  
Kotthoff L, 2016, LECT NOTES COMPUT SC, V10101, P149, DOI 10.1007/978-3-319-50137-6_7
[55]   A survey of deep neural network architectures and their applications [J].
Liu, Weibo ;
Wang, Zidong ;
Liu, Xiaohui ;
Zeng, Nianyin ;
Liu, Yurong ;
Alsaadi, Fuad E. .
NEUROCOMPUTING, 2017, 234 :11-26
[56]   RNN for Solving Perturbed Time-Varying Underdetermined Linear System With Double Bound Limits on Residual Errors and State Variables [J].
Lu, Huiyan ;
Jin, Long ;
Luo, Xin ;
Liao, Bolin ;
Guo, Dongsheng ;
Xiao, Lin .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (11) :5931-5942
[57]  
Michalski R., 2013, Machine Learning: An Artificial Intelligence Approach
[58]  
Mohammad A.H., 2018, GSTF J COMPUT, V5, P1, DOI [10.7603/s40601-016-0016-9, DOI 10.7603/S40601-016-0016-9]
[59]   More than words: Social networks' text mining for consumer brand sentiments [J].
Mostafa, Mohamed M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (10) :4241-4251
[60]  
Mountassir A, 2012, COLLOQ INF SCI TECH, P43, DOI 10.1109/CIST.2012.6388061