An explainable machine learning model for sentiment analysis of online reviews

被引:1
作者
Mrabti, Soufiane El [1 ]
EL-Mekkaoui, Jaouad [1 ]
Hachmoud, Adil [1 ]
Lazaar, Mohamed [2 ]
机构
[1] Sidi Mohamed Ben Abdellah Univ, Innovat Technol Lab, EST, Fes, Morocco
[2] Mohammed V Univ Rabat, ENSIAS, Rabat, Morocco
关键词
Explainable machine learning; Sentiment analysis; Feature selection; Chi-square; Support vector machines; Hierarchical clustering; SUPPORT VECTOR MACHINE; NEURAL-NETWORK; CLASSIFICATION; SELECTION;
D O I
10.1016/j.knosys.2024.112348
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over the last two decades and with the widespread use of social media and e-commerce sites, scientific research in the field of sentiment analysis (SA) has made considerable progress in terms of obtained results and the number of published articles. The greatest part of this progress has been achieved by SA systems based on machine learning. However, most of these systems lack transparency and explainability, making it difficult to understand their internal processes and consequently to trust their decisions and predictions. To address this problem, we propose an easy-to-use machine learning model based on an intuitive geometric approach for SA of online reviews. For linearly separable data, we adopt an iterative algorithm called the explainable algorithm for binary linear classification (EABLC) to determine the maximum-margin separating hyperplane based on the geometric concept of the convex hull. As an extension of EABLC, two new algorithms are further proposed, namely, the soft explainable algorithm for binary classification and the explainable algorithm for binary polyhedral classification, to avoid outliers and deal with linearly nonseparable data. Aside from its simplicity and intuitiveness, experimental results on the Amazon product and movie review sentiment datasets demonstrate the efficiency and robustness of our model, which outperforms ten benchmark classification algorithms.
引用
收藏
页数:15
相关论文
共 63 条
[1]  
Aghaebrahimian A., 2019, 4 SWISS TEXT AN C SW
[2]   Deep Recurrent neural network vs. support vector machine for aspect-based sentiment analysis of Arabic hotels' reviews [J].
Al-Smadi, Mohammad ;
Qawasmeh, Omar ;
Al-Ayyoub, Mahmoud ;
Jararweh, Yaser ;
Gupta, Brij .
JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 27 :386-393
[3]   The impact of preprocessing steps on the accuracy of machine learning algorithms in sentiment analysis [J].
Alam, Saqib ;
Yao, Nianmin .
COMPUTATIONAL AND MATHEMATICAL ORGANIZATION THEORY, 2019, 25 (03) :319-335
[4]   Sentiment analysis on Twitter data integrating TextBlob and deep learning models: The case of US airline industry [J].
Aljedaani, Wajdi ;
Rustam, Furqan ;
Mkaouer, Mohamed Wiem ;
Ghallab, Abdullatif ;
Rupapara, Vaibhav ;
Washington, Patrick Bernard ;
Lee, Ernesto ;
Ashraf, Imran .
KNOWLEDGE-BASED SYSTEMS, 2022, 255
[5]   Semantic text classification: A survey of past and recent advances [J].
Altinel, Berna ;
Ganiz, Murat Can .
INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (06) :1129-1153
[6]  
[Anonymous], 2011, P 49 ANN M ASS COMP
[7]   A hybrid approach to the sentiment analysis problem at the sentence level [J].
Appel, Orestes ;
Chiclana, Francisco ;
Carter, Jenny ;
Fujita, Hamido .
KNOWLEDGE-BASED SYSTEMS, 2016, 108 :110-124
[8]   Polyhedral separability through successive LP [J].
Astorino, A ;
Gaudioso, M .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2002, 112 (02) :265-293
[9]  
Bennett K.P., 2000, P 17 INT C MACH LEAR, P5764
[10]   A comprehensive survey on sentiment analysis: Approaches, challenges and trends [J].
Birjali, Marouane ;
Kasri, Mohammed ;
Beni-Hssane, Abderrahim .
KNOWLEDGE-BASED SYSTEMS, 2021, 226