A framework for Arabic sentiment analysis using supervised classification

被引:25
作者
Duwairi, Rehab M. [1 ]
Qarqaz, Islam [2 ]
机构
[1] Jordan Univ Sci & Technol, Dept Comp Informat Syst, Irbid 22110, Jordan
[2] Jordan Univ Sci & Technol, Dept Comp Sci, Irbid 22110, Jordan
关键词
sentiment analysis; sentiment classification; opinion mining; polarity detection; supervised learning; text mining; Arabic language;
D O I
10.1504/IJDMMM.2016.081247
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis aims to determine the polarity that is embedded in people comments and reviews. Sentiment analysis is important for companies and organisations which are interested in evaluating their products or services. The current paper deals with sentiment analysis in Arabic reviews. Three classifiers were applied on an in-house developed dataset of tweets/comments. In particular, the Naive Bayes, SVM and K-nearest neighbour classifiers were employed. This paper also addresses the effects of term weighting schemes on the accuracy of the results. The binary model, term frequency and term frequency inverse document frequency were used to assign weights to the tokens of tweets/comments. The results show that alternating between the three weighting schemes slightly affects the accuracies. The results also clarify that the classifiers were able to remove false examples (high precision) but were not that successful in identifying all correct examples (low recall).
引用
收藏
页码:369 / 381
页数:13
相关论文
共 28 条
  • [1] Abdul-Mageed M., 2012, P LREC IST TURK
  • [2] Abdul-Mageed Muhammad, 2011, P 49 ANN M ASS COMP, V2
  • [3] [Anonymous], P 2 SIAM INT C DAT M
  • [4] [Anonymous], 2 WORKSH COMP APPR A
  • [5] [Anonymous], 2002, P C EMP METH NAT LAN
  • [6] [Anonymous], P 22 INT C COMP LING
  • [7] [Anonymous], 2001, IJCAI 2001 WORKSH EM
  • [8] [Anonymous], IJCSI INT J COMPUTER
  • [9] [Anonymous], 2006, TEXTGRAPHS WORKSH HL
  • [10] [Anonymous], P JOINT C 47 ANN M A