Ordinal-based and frequency-based integration of feature selection methods for sentiment analysis

被引:36
作者
Yousefpour, Alireza [1 ]
Ibrahim, Roliana [1 ]
Hamed, Haza Nuzly Abdel [1 ]
机构
[1] Univ Teknologi Malaysia, Fac Comp, Software Engn Res Grp, Skudai 81310, Malaysia
关键词
Feature selection; Ordinal-based integration; Frequency-based integration; Feature vectors integration; Feature subsets integration; Sentiment analysis; FEATURE-EXTRACTION; CLASSIFICATION; REDUCTION;
D O I
10.1016/j.eswa.2017.01.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature subset selection with the aim of reducing, dependency of feature selection techniques and obtaining a high-quality minimal feature subset from a real-world domain is the main task of this research. For this end, firstly, two types of feature representation are presented for feature sets, namely unigram-based and part-of-speech based feature sets. Secondly, five methods of feature ranking are employed for creating feature vectors. Finally, we propose two methods for the integration feature vectors and feature subsets. An ordinal-based integration of different feature vectors (OIFV) is proposed in order to obtain a new feature vector. The new feature vector depends on the order of features in the old vectors. A frequency based integration of different feature subsets (FIFS) with most effective features, which are obtained from a hybrid filter and wrapper methods in the feature selection task, is then proposed. In addition, four wellknown text classification algorithms are employed as classifiers in the wrapper method for the selection of the feature subsets. A wide range of comparative experiments on five widely-used datasets in sentiment analysis were carried out. The experiments demonstrate that proposed methods can effectively improve the performance of sentiment classification. These results also show that proposed part-of-speech patterns are more effective in their classification accuracy compared to unigram-based features. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:80 / 93
页数:14
相关论文
共 50 条
  • [21] Comparison of Feature Selection Methods for Sentiment Analysis on Turkish Twitter Data
    Parlar, Tuba
    Sarac, Esra
    Ozel, Selma Ayse
    2017 25TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2017,
  • [22] SA-MSVM: Hybrid Heuristic Algorithm-based Feature Selection for Sentiment Analysis in Twitter
    Selvi, C. P. Thamil
    PushpaLaksmi, R.
    COMPUTER SYSTEMS SCIENCE AND ENGINEERING, 2023, 44 (03): : 2439 - 2456
  • [23] Twitter Sentiment Analysis Based on Ordinal Regression
    Elbagir, Shihab
    Yang, Jing
    IEEE ACCESS, 2019, 7 : 163677 - 163685
  • [24] Comparison of feature selection methods based on discrimination and reliability for fMRI decoding analysis
    Xu, Wenyan
    Li, Qing
    Liu, Xingyu
    Zhen, Zonglei
    Wu, Xia
    JOURNAL OF NEUROSCIENCE METHODS, 2020, 335
  • [25] QER: a new feature selection method for sentiment analysis
    Parlar, Tuba
    Ozel, Selma Ayse
    Song, Fei
    HUMAN-CENTRIC COMPUTING AND INFORMATION SCIENCES, 2018, 8
  • [26] Optimized Swarm Search-based Feature Selection for Text Mining in Sentiment Analysis
    Fong, Simon
    Gao, Elisa
    Wong, Raymond
    2015 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2015, : 1153 - 1162
  • [27] A novel sentiment analysis for amazon data with TSA based feature selection
    DANIEL D. A.J.
    M. J.M.
    Scalable Computing, 2021, 22 (01): : 53 - 66
  • [28] Firefly with Levy Based Feature Selection with Multilayer Perceptron for Sentiment Analysis
    Elangovan, D.
    Subedha, V.
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2023, 14 (02) : 342 - 349
  • [29] Sentiment Analysis for Assessment of Hotel Services Review using Feature Selection Approach based-on Decision Tree
    Apriliani, Dyah
    Abidin, Taufiq
    Sutanta, Edhy
    Hamzah, Amir
    Somantri, Oman
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2020, 11 (04) : 240 - 245
  • [30] Word Sentiment Orientation Computing with Feature Selection Methods
    Li, Shoushan
    Huang, Chu-Ren
    11TH CHINESE LEXICAL SEMANTICS WORKSHOP (CKSW2010), 2010, : 368 - 373