A Multi-Criteria Approach for Arabic Dialect Sentiment Analysis for Online Reviews: Exploiting Optimal Machine Learning Algorithm Selection

被引：22

作者：

Abo, Mohamed Elhag Mohamed ^{[1
]}

Idris, Norisma ^{[1
]}

Mahmud, Rohana ^{[1
]}

Qazi, Atika ^{[2
]}

Hashem, Ibrahim Abaker Targio ^{[3
]}

Maitama, Jaafar Zubairu ^{[1
,4
]}

Naseem, Usman ^{[5
]}

Khan, Shah Khalid ^{[6
]}

Yang, Shuiqing ^{[7
]}

机构：

[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Artificial Intelligence, Kuala Lumpur 50603, Malaysia

[2] Univ Brunei Darussalam, Ctr Lifelong Learning, BE-1410 Gadong, Brunei

[3] Univ Sharjah, Dept Comp Sci, Coll Comp & Informat, Sharjah 27272, U Arab Emirates

[4] Bayero Univ, Fac Comp Sci & Informat Technol, Dept Informat Technol, Kano 3011, Nigeria

[5] Univ Sydney, Sch Comp Sci, Sydney, NSW 2006, Australia

[6] RMIT Univ, Sch Engn, Carlton, Vic 3053, Australia

[7] Zhejiang Univ Finance & Econ, Sch Informat Management & Artificial Intelligence, Hangzhou 310018, Peoples R China

来源：

SUSTAINABILITY | 2021年 / 13卷 / 18期

关键词：

multiple-criteria; Arabic dialect; sentiment analysis; machine learning; performance evaluation; OF-THE-ART; CLASSIFICATION; NETWORKS; CRITERIA;

D O I：

10.3390/su131810018

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

A sentiment analysis of Arabic texts is an important task in many commercial applications such as Twitter. This study introduces a multi-criteria method to empirically assess and rank classifiers for Arabic sentiment analysis. Prominent machine learning algorithms were deployed to build classification models for Arabic sentiment analysis classifiers. Moreover, an assessment of the top five machine learning classifiers' performances measures was discussed to rank the performance of the classifier. We integrated the top five ranking methods with evaluation metrics of machine learning classifiers such as accuracy, recall, precision, F-measure, CPU Time, classification error, and area under the curve (AUC). The method was tested using Saudi Arabic product reviews to compare five popular classifiers. Our results suggest that deep learning and support vector machine (SVM) classifiers perform best with accuracy 85.25%, 82.30%; precision 85.30, 83.87%; recall 88.41%, 83.89; F-measure 86.81, 83.87%; classification error 14.75, 17.70; and AUC 0.93, 0.90, respectively. They outperform decision trees, K-nearest neighbours (K-NN), and Naive Bayes classifiers.

引用

页数：20

共 85 条

[51]

Hathlian N.F. B., 2016, 2016 4th Saudi International Conference on Information Technology (Big Data Analysis)(KACSTIT), P1

[52]

Khasawneh RT, 2013, INT CONF INTERNET, P101, DOI 10.1109/ICIST.2013.6747520

[53] Machine learning for medical diagnosis: history, state of the art and perspective [J].