Which configuration works best? An Experimental Study on Supervised Arabic Twitter Sentiment Analysis

被引:8
作者
Khalil, Talaat [1 ]
Halaby, Amal [1 ]
Hammad, Muhammad [1 ]
El-Beltagy, Samhaa R. [1 ]
机构
[1] Nile Univ, Ctr Informat Sci, Giza, Egypt
来源
2015 FIRST INTERNATIONAL CONFERENCE ON ARABIC COMPUTATIONAL LINGUISTICS (ACLING 2015): ADVANCES IN ARABIC COMPUTATIONAL LINGUISTICS | 2015年
关键词
Sentiment Analysis; Arabic Tweets;
D O I
10.1109/ACLing.2015.19
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Arabic Twitter Sentiment Analysis has been gaining a lot of attention lately with supervised approaches being exploited widely. However, to date, there has not been an experimental study that examines how different configurations of the Bag of Words model, text representation scheme, can affect various supervised machine learning methods. The goal of the presented work is to do exactly that. Specifically, this work examines which configurations work best for each of three machine learning approaches that have shown good results when applied on the task of sentiment analysis, namely: Support Vector Machines, Compliment Naive Bayes, and Multinomial Naive Bayes. Experimenting with different datasets has shown that each of these classifiers has a Bag of Words configuration in conjunction with which, it consistently performs best. It also showed that some features are dataset dependent.
引用
收藏
页码:86 / 93
页数:8
相关论文
共 16 条
[1]  
Abdulla N., 2013, 2013 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT), P1, DOI DOI 10.1109/AEECT.2013.6716448
[2]  
[Anonymous], 2014, M BINRASHID SCH GOVT
[3]  
[Anonymous], Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
[4]  
[Anonymous], 2011, ACM Transactions on Speech and Language Processing
[5]  
Duwairi RM, 2014, 2014 5TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS)
[6]  
El-Beltagy SR, 2013, IEEE INT CONF INNOV
[7]  
Liu B, 2011, DATA CENTRIC SYST AP, P459, DOI 10.1007/978-3-642-19460-3_11
[8]  
McCallum A., 1998, AAAI-98 Workshop on Learning for Text Categorization, V752, P41, DOI [DOI 10.1109/TSMC.1985.6313426, 10.1.1.46]
[9]  
Refaee E., 2015, P 8 SAUD SCI C LOND, P2
[10]  
Refaee E., 2014, 9 ED LANG RES EV C L