The Impact of Sentiment Features on the Sentiment Polarity Classification in Persian Reviews

被引:19
作者
Asgarian, Ehsan [1 ]
Kahani, Mohsen [1 ]
Sharifi, Shahla [2 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Comp Engn, Azadi Sq, Mashhad, Iran
[2] Ferdowsi Univ Mashhad, Fac Letters & Humanities, Dept Linguist, Mashhad, Iran
关键词
Opinion mining; Persian sentiment word miner; Feature engineering; Comprehensive Persian WordNet; FEATURE-SELECTION; LEXICON;
D O I
10.1007/s12559-017-9513-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural language processing (NLP) techniques can prove relevant to a variety of specialties in the field of cognitive science, including sentiment analysis. This paper investigates the impact of NLP tools, various sentiment features, and sentiment lexicon generation approaches to sentiment polarity classification of internet reviews written in Persian language. For this purpose, a comprehensive Persian WordNet (FerdowsNet), with high recall and proper precision (based on Princeton WordNet), was developed. Using FerdowsNet and a generated corpus of reviews, a Persian sentiment lexicon was developed using (i) mapping to the SentiWordNet and (ii) a semi-supervised learning method, after which the results of both methods were compared. In addition to sentiment words, a set of various features were extracted and applied to the sentiment classification. Then, by employing various well-known feature selection approaches and state-of-the art machine learning methods, a sentiment classification for Persian text reviews was carried out. The obtained results demonstrate the critical role of sentiment lexicon quality in improving the quality of sentiment classification in Persian language.
引用
收藏
页码:117 / 135
页数:19
相关论文
共 106 条
  • [1] Agarwal B., 2016, Prominent Feature Extraction for Sentiment Analysis
  • [2] Concept-Level Sentiment Analysis with Dependency-Based Semantic Parsing: A Novel Approach
    Agarwal, Basant
    Poria, Soujanya
    Mittal, Namita
    Gelbukh, Alexander
    Hussain, Amir
    [J]. COGNITIVE COMPUTATION, 2015, 7 (04) : 487 - 499
  • [3] Semantic Feature Clustering for Sentiment Analysis of English Reviews
    Agarwal, Basant
    Mittal, Namita
    [J]. IETE JOURNAL OF RESEARCH, 2014, 60 (06) : 414 - 422
  • [4] Arabic tweets sentiment analysis - a hybrid scheme
    Aldayel, Haifa K.
    Azmi, Aqil M.
    [J]. JOURNAL OF INFORMATION SCIENCE, 2016, 42 (06) : 782 - 797
  • [5] Hamshahri: A standard Persian text collection
    AleAhmad, Abolfazl
    Amiri, Hadi
    Darrudi, Ehsan
    Rahgozar, Masoud
    Oroumchian, Farhad
    [J]. KNOWLEDGE-BASED SYSTEMS, 2009, 22 (05) : 382 - 387
  • [6] Ali-Mardani S, 2015, J INFO TECH MANAGE, V7, P345
  • [7] Amiri F, 2015, RECENT ADV NATURAL L
  • [8] [Anonymous], ACL
  • [9] [Anonymous], 2012, Mining text data
  • [10] [Anonymous], 2008, Introduction to information retrieval