ASA: A framework for Arabic sentiment analysis

被引:62
作者
Oussous, Ahmed [1 ]
Benjelloun, Fatima-Zahra [1 ]
Lahcen, Ayoub Ait [1 ,2 ]
Belfkih, Samir [1 ]
机构
[1] Ibn Tofail Univ, Natl Sch Appl Sci ENSA, Kenitra, Morocco
[2] Mohammed V Agdal Univ, Unite Associee CNRST URAC 29, LRIT, Rabat, Morocco
关键词
Arabic; convolutional neural network; deep learning; long short-term memory; machine-learning; sentiment analysis; CLASSIFICATION; REVIEWS; MODEL;
D O I
10.1177/0165551519849516
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sentiment analysis (SA), also known as opinion mining, is a growing important research area. Generally, it helps to automatically determine if a text expresses a positive, negative or neutral sentiment. It enables to mine the huge increasing resources of shared opinions such as social networks, review sites and blogs. In fact, SA is used by many fields and for various languages such as English and Arabic. However, since Arabic is a highly inflectional and derivational language, it raises many challenges. In fact, SA of Arabic text should handle such complex morphology. To better handle these challenges, we decided to provide the research community and Arabic users with a new efficient framework for Arabic Sentiment Analysis (ASA). Our primary goal is to improve the performance of ASA by exploiting deep learning while varying the preprocessing techniques. For that, we implement and evaluate two deep learning models namely convolutional neural network (CNN) and long short-term memory (LSTM) models. The framework offers various preprocessing techniques for ASA (including stemming, normalisation, tokenization and stop words). As a result of this work, we first provide a new rich and publicly available Arabic corpus called Moroccan Sentiment Analysis Corpus (MSAC). Second, the proposed framework demonstrates improvement in ASA. In fact, the experimental results prove that deep learning models have a better performance for ASA than classical approaches (support vector machines, naive Bayes classifiers and maximum entropy). They also show the key role of morphological features in Arabic Natural Language Processing (NLP).
引用
收藏
页码:544 / 559
页数:16
相关论文
共 71 条
[1]  
Al Sallab A., 2015, Deep learning models for sentiment analysis in Arabic, P9
[2]   Deep learning for Arabic NLP: A survey [J].
Al-Ayyoub, Mahmoud ;
Nuseir, Aya ;
Alsmearat, Kholoud ;
Jararweh, Yaser ;
Gupta, Brij .
JOURNAL OF COMPUTATIONAL SCIENCE, 2018, 26 :522-531
[3]   The impact of indexing approaches on Arabic text classification [J].
Al-Badarneh, Amer ;
Al-Shawakfa, Emad ;
Bani-Ismail, Basel ;
Al-Rababah, Khaleel ;
Shatnawi, Safwan .
JOURNAL OF INFORMATION SCIENCE, 2017, 43 (02) :159-173
[4]   Arabic senti-lexicon: Constructing publicly available language resources for Arabic sentiment analysis [J].
Al-Moslmi, Tareq ;
Albared, Mohammed ;
Al-Shabi, Adel ;
Omar, Nazlia ;
Abdullah, Salwani .
JOURNAL OF INFORMATION SCIENCE, 2018, 44 (03) :345-362
[5]  
Al-Shammari ET., 2008, Proceeding of the 2nd ACM Workshop on Improving Non English Web Searching, P9, DOI DOI 10.1145/1460027.1460030
[6]   A Comparison Study of Some Arabic Root Finding Algorithms [J].
Al-Shawakfa, Emad ;
Al-Badarneh, Amer ;
Shatnawi, Safwan ;
Al-Rabab'ah, Khaleel ;
Bani-Ismail, Basel .
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2010, 61 (05) :1015-1024
[7]  
Alomari Khaled Mohammad, 2017, Advances in Artificial Intelligence: from Theory to Practice. 30th International Conference on Industrial Engineering and Other Applications of Applied Intelligent Systems, IEA/AIE 2017. Proceedings: LNAI 10350, P602, DOI 10.1007/978-3-319-60042-0_66
[8]  
Alshalabi R., 2005, Information Technology Journal, V4, P38
[9]  
[Anonymous], 2015, International Journal of Computer Applications
[10]  
[Anonymous], 2018, GSTF J COMPUT JOC