Malay sentiment analysis based on combined classification approaches and Senti-lexicon algorithm

被引:17
作者
Al-Saffar, Ahmed [1 ]
Awang, Suryanti [1 ]
Tao, Hai [1 ]
Omar, Nazlia [2 ]
Al-Saiagh, Wafaa [2 ]
Al-bared, Mohammed [2 ]
机构
[1] Univ Malaysia Pahang, Fac Comp Syst & Software Engn, Pahang, Malaysia
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi, Selangor, Malaysia
来源
PLOS ONE | 2018年 / 13卷 / 04期
关键词
OPINION;
D O I
10.1371/journal.pone.0194852
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Sentiment analysis techniques are increasingly exploited to categorize the opinion text to one or more predefined sentiment classes for the creation and automated maintenance of review-aggregation websites. In this paper, a Malay sentiment analysis classification model is proposed to improve classification performances based on the semantic orientation and machine learning approaches. First, a total of 2,478 Malay sentiment-lexicon phrases and words are assigned with a synonym and stored with the help of more than one Malay native speaker, and the polarity is manually allotted with a score. In addition, the supervised machine learning approaches and lexicon knowledge method are combined for Malay sentiment classification with evaluating thirteen features. Finally, three individual classifiers and a combined classifier are used to evaluate the classification accuracy. In experimental results, a wide-range of comparative experiments is conducted on a Malay Reviews Corpus (MRC), and it demonstrates that the feature extraction improves the performance of Malay sentiment analysis based on the combined classification. However, the results depend on three factors, the features, the number of features and the classification approach.
引用
收藏
页数:18
相关论文
共 56 条
  • [11] New avenues in knowledge bases for natural language processing
    Cambria, Erik
    Schuller, Bjorn
    Xia, Yunqing
    White, Bebo
    [J]. KNOWLEDGE-BASED SYSTEMS, 2016, 108 : 1 - 4
  • [12] Computational Intelligence for Big Social Data Analysis
    Cambria, Erik
    Howard, Newton
    Xia, Yunqing
    Chua, Tat-Seng
    [J]. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2016, 11 (03) : 9 - 10
  • [13] Affective Computing and Sentiment Analysis
    Cambria, Erik
    [J]. IEEE INTELLIGENT SYSTEMS, 2016, 31 (02) : 102 - 107
  • [14] Chaovalit P., 2005, HAWAII INT C SYSTEM, V4, p112c, DOI DOI 10.1109/HICSS.2005.445
  • [15] Chaturvedi I, 2017, INFORM FUSION, P65
  • [16] COMPOSITE CLASSIFIER SYSTEM-DESIGN - CONCEPTS AND METHODOLOGY
    DASARATHY, BV
    SHEELA, BV
    [J]. PROCEEDINGS OF THE IEEE, 1979, 67 (05) : 708 - 713
  • [17] Adaptation and Use of Subjectivity Lexicons for Domain Dependent Sentiment Classification
    Dehkharghani, Rahim
    Yanikoglu, Berrin
    Tapucu, Dilek
    Saygin, Yucel
    [J]. 12TH IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2012), 2012, : 669 - 673
  • [18] Diversity techniques improve the performance of the best imbalance learning ensembles
    Diez-Pastor, Jose F.
    Rodriguez, Juan J.
    Garcia-Osorio, Cesar I.
    Kuncheva, Ludmila I.
    [J]. INFORMATION SCIENCES, 2015, 325 : 98 - 117
  • [19] Dzeroski S, 2004, MACH LEARN, V54, P255, DOI 10.1023/B.MAC.0000015881.36452.6e
  • [20] ESULI A., 2005, P ACM INT C INFORM K, P617, DOI DOI 10.1145/1099554.1099713