Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm

被引:17
作者
Al-Saffar, Ahmed [1 ]
Awang, Suryanti [1 ]
Tao, Hai [1 ]
Omar, Nazlia [2 ]
Al-Saiagh, Wafaa [2 ]
Al-bared, Mohammed [2 ]
机构
[1] Univ Malaysia Pahang, Fac Comp Syst & Software Engn, Pahang, Malaysia
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi, Selangor, Malaysia
来源
PLOS ONE | 2018年 / 13卷 / 04期
关键词
OPINION;
D O I
10.1371/journal.pone.0194852
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sentiment analysis techniques are increasingly exploited to categorize the opinion text to one or more predefined sentiment classes for the creation and automated maintenance of review-aggregation websites. In this paper, a Malay sentiment analysis classification model is proposed to improve classification performances based on the semantic orientation and machine learning approaches. First, a total of 2,478 Malay sentiment-lexicon phrases and words are assigned with a synonym and stored with the help of more than one Malay native speaker, and the polarity is manually allotted with a score. In addition, the supervised machine learning approaches and lexicon knowledge method are combined for Malay sentiment classification with evaluating thirteen features. Finally, three individual classifiers and a combined classifier are used to evaluate the classification accuracy. In experimental results, a wide-range of comparative experiments is conducted on a Malay Reviews Corpus (MRC), and it demonstrates that the feature extraction improves the performance of Malay sentiment analysis based on the combined classification. However, the results depend on three factors, the features, the number of features and the classification approach.
引用
收藏
页数:18
相关论文
共 56 条
  • [1] Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums
    Abbasi, Ahmed
    Chen, Hsinchun
    Salem, Arab
    [J]. ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2008, 26 (03)
  • [2] Alfred R, 2016, INT C SOFT COMP DAT, P289
  • [3] [Anonymous], 2005, Proceedings of HLT/EMNLP on Interactive Demonstrations
  • [4] [Anonymous], 2012, WORKSH SENT AN SDAD
  • [5] [Anonymous], 2012, Mining text data
  • [6] [Anonymous], 2016, ARTIF INTELL REV
  • [7] [Anonymous], 2015, J COMPUT SCI-NETH
  • [8] A Unified Framework for Creating Domain Dependent Polarity Lexicons from User Generated Reviews
    Asghar, Muhammad Zubair
    Khan, Aurangzeb
    Ahmad, Shakeel
    Khan, Imran Ali
    Kundi, Fazal Masud
    [J]. PLOS ONE, 2015, 10 (10):
  • [9] Sentiment Analysis using Sentiment Features
    Bahrainian, Seyed-Ali
    Dengel, Andreas
    [J]. 2013 IEEE/WIC/ACM INTERNATIONAL JOINT CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY - WORKSHOPS (WI-IAT), VOL 3, 2013, : 26 - 29
  • [10] Sentiment Analysis Is a Big Suitcase
    Cambria, Erik
    Poria, Soujanya
    Gelbukh, Alexander
    Thelwall, Mike
    [J]. IEEE INTELLIGENT SYSTEMS, 2017, 32 (06) : 74 - 80