Ordinal-based and frequency-based integration of feature selection methods for sentiment analysis

被引:36
|
作者
Yousefpour, Alireza [1 ]
Ibrahim, Roliana [1 ]
Hamed, Haza Nuzly Abdel [1 ]
机构
[1] Univ Teknologi Malaysia, Fac Comp, Software Engn Res Grp, Skudai 81310, Malaysia
关键词
Feature selection; Ordinal-based integration; Frequency-based integration; Feature vectors integration; Feature subsets integration; Sentiment analysis; FEATURE-EXTRACTION; CLASSIFICATION; REDUCTION;
D O I
10.1016/j.eswa.2017.01.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature subset selection with the aim of reducing, dependency of feature selection techniques and obtaining a high-quality minimal feature subset from a real-world domain is the main task of this research. For this end, firstly, two types of feature representation are presented for feature sets, namely unigram-based and part-of-speech based feature sets. Secondly, five methods of feature ranking are employed for creating feature vectors. Finally, we propose two methods for the integration feature vectors and feature subsets. An ordinal-based integration of different feature vectors (OIFV) is proposed in order to obtain a new feature vector. The new feature vector depends on the order of features in the old vectors. A frequency based integration of different feature subsets (FIFS) with most effective features, which are obtained from a hybrid filter and wrapper methods in the feature selection task, is then proposed. In addition, four wellknown text classification algorithms are employed as classifiers in the wrapper method for the selection of the feature subsets. A wide range of comparative experiments on five widely-used datasets in sentiment analysis were carried out. The experiments demonstrate that proposed methods can effectively improve the performance of sentiment classification. These results also show that proposed part-of-speech patterns are more effective in their classification accuracy compared to unigram-based features. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:80 / 93
页数:14
相关论文
共 50 条
  • [1] IWD Based Feature Selection Algorithm for Sentiment Analysis
    Parlar, Tuba
    Sarac, Esra
    ELEKTRONIKA IR ELEKTROTECHNIKA, 2019, 25 (01) : 54 - 58
  • [2] Comparison of Feature Selection Methods for Sentiment Analysis
    El Mrabti, Soufiane
    Al Achhab, Mohammed
    Lazaar, Mohamed
    BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018, 2018, 872 : 261 - 272
  • [3] A Review on Feature Selection Methods for Sentiment Analysis
    Hung, Lai Po
    Alfred, Rayner
    Hijazi, Mohd Hanafi Ahmad
    ADVANCED SCIENCE LETTERS, 2015, 21 (10) : 2952 - 2956
  • [4] Feature Selection Methods in Sentiment Analysis : A Review
    Khairi, Nurilhami Izzatie
    Mohamed, Azlinah
    Yusof, Nor Nadiah
    3RD INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEM & SECURITY (NISS'20), 2020,
  • [5] An Effective Feature Selection Based Classification model using Firefly with Levy and Multilayer Perceptron based Sentiment Analysis
    Elangovan, D.
    Subedha, V
    PROCEEDINGS OF THE 5TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT-2020), 2020, : 376 - 380
  • [6] Comparison of Feature Selection Methods for Sentiment Analysis
    Nicholls, Chris
    Song, Fei
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2010, 6085 : 286 - 289
  • [7] Evolutionary Multiobjective Feature Selection for Sentiment Analysis
    Deniz, Ayca
    Angin, Merih
    Angin, Pelin
    IEEE ACCESS, 2021, 9 : 142982 - 142996
  • [8] Text Sentiment Analysis Using Frequency-Based Vigorous Features
    Abdul Razzaq
    Muhammad Asim
    Zulqrnain Ali
    Salman Qadri
    Imran Mumtaz
    Dost Muhammad Khan
    Qasim Niaz
    中国通信, 2019, 16 (12) : 145 - 153
  • [9] Text Sentiment Analysis Using Frequency-Based Vigorous Features
    Razzaq, Abdul
    Asim, Muhammad
    Ali, Zulqrnain
    Qadri, Salman
    Mumtaz, Imran
    Khan, Dost Muhammad
    Niaz, Qasim
    CHINA COMMUNICATIONS, 2019, 16 (12) : 145 - 153
  • [10] Feature selection for sentiment analysis based on content and syntax models
    Duric, Adnan
    Song, Fei
    DECISION SUPPORT SYSTEMS, 2012, 53 (04) : 704 - 711