Feature selection based on genetic algorithm and hybrid model for sentiment polarity classification

被引:4
|
作者
Kalaivani, P. [1 ]
Shunmuganathan, K. L. [2 ]
机构
[1] Sathyabama Univ, Dept Comp Sci & Engn, St Josephs Coll Engn, Madras, Tamil Nadu, India
[2] RMK Engn Coll, Dept Comp Sci & Engn, Madras, Tamil Nadu, India
关键词
sentiment classification; supervised machine learning algorithm; feature selection; genetic algorithm; review; information gain; bagging;
D O I
10.1504/IJDMMM.2016.081242
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment classification is to find the polarity of product or user reviews. Supervised machine learning algorithms is used for opinion mining such as naive Bayes, K-nearest neighbour, decision trees, maximum entropy and hidden Markov model and support vector machine. KNN is a simple algorithm, but a less efficient classification algorithm. In this paper, we propose an improved KNN algorithm. An optimised feature selection, genetic algorithm that incorporates the information gain for feature selection and combined with bagging technique and KNN for improving the accuracy of sentiment classification. Specifically, we compared two approaches and traditional KNN for sentiment classification of movie reviews and product reviews. The same approach has been applied to other machine learning algorithms such as support vector machine and naive Bayes and the result is compared with POS-based feature set method. The proposed method is evaluated and experimental results using information gain, genetic algorithm with bagging technique indicate higher performance result with accuracy of 87.50% of the movie reviews and exhibits better performance in terms of accuracy, precision and recall for movie, DVD, electronics and kitchen reviews.
引用
收藏
页码:315 / 329
页数:15
相关论文
共 50 条
  • [31] Hybrid Ensemble Learning With Feature Selection for Sentiment Classification in Social Media
    Sharma, Sanur
    Jain, Anurag
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2020, 10 (02) : 40 - 58
  • [32] Hybrid feature selection based on SLI and genetic algorithm for microarray datasets
    Abasabadi, Sedighe
    Nematzadeh, Hossein
    Motameni, Homayun
    Akbari, Ebrahim
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (18): : 19725 - 19753
  • [33] A hybrid genetic algorithm for feature selection wrapper based on mutual information
    Huang, Jinjie
    Cai, Yunze
    Xu, Xiaoming
    PATTERN RECOGNITION LETTERS, 2007, 28 (13) : 1825 - 1844
  • [34] Polarity Analysis Based on an Improved Feature Selection Algorithm
    Tian Weixin
    Zheng Sheng
    Wang Anhui
    2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL I, 2010, : 129 - 132
  • [35] Polarity Analysis Based on an Improved Feature Selection Algorithm
    Tian Weixin
    Zheng Sheng
    Wang Anhui
    APPLIED INFORMATICS AND COMMUNICATION, PT I, 2011, 224 : 207 - +
  • [36] A filter model for feature subset selection based on genetic algorithm
    Elalami, M. E.
    KNOWLEDGE-BASED SYSTEMS, 2009, 22 (05) : 356 - 362
  • [37] A Novel Hybrid Feature Selection Algorithm for Hierarchical Classification
    Lima, Helen C. S. C.
    Otero, Fernando E. B.
    Merschmann, Luiz H. C.
    Souza, Marcone J. F.
    IEEE ACCESS, 2021, 9 : 127278 - 127292
  • [38] An Ensemble Classification Algorithm of Micro-Blog Sentiment Based on Feature Selection and Differential Evolution
    Li, Hongchan
    Ma, Zishuai
    Zhu, Haodong
    Ma, Yu
    Chang, Zhifang
    IEEE ACCESS, 2022, 10 : 70467 - 70475
  • [39] Bio inspired Boolean artificial bee colony based feature selection algorithm for sentiment classification
    Anuradha, K.
    Krishna, M. Vamsi
    Mallik, Banitamani
    Measurement: Sensors, 2024, 32
  • [40] Trajectory Classification Using Feature Selection by Genetic Algorithm
    Saini, Rajkumar
    Kumar, Pradeep
    Roy, Partha Pratim
    Pal, Umapada
    PROCEEDINGS OF 3RD INTERNATIONAL CONFERENCE ON COMPUTER VISION AND IMAGE PROCESSING, CVIP 2018, VOL 2, 2020, 1024 : 377 - 388