Ordinal-based and frequency-based integration of feature selection methods for sentiment analysis

被引:36
作者
Yousefpour, Alireza [1 ]
Ibrahim, Roliana [1 ]
Hamed, Haza Nuzly Abdel [1 ]
机构
[1] Univ Teknologi Malaysia, Fac Comp, Software Engn Res Grp, Skudai 81310, Malaysia
关键词
Feature selection; Ordinal-based integration; Frequency-based integration; Feature vectors integration; Feature subsets integration; Sentiment analysis; FEATURE-EXTRACTION; CLASSIFICATION; REDUCTION;
D O I
10.1016/j.eswa.2017.01.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature subset selection with the aim of reducing, dependency of feature selection techniques and obtaining a high-quality minimal feature subset from a real-world domain is the main task of this research. For this end, firstly, two types of feature representation are presented for feature sets, namely unigram-based and part-of-speech based feature sets. Secondly, five methods of feature ranking are employed for creating feature vectors. Finally, we propose two methods for the integration feature vectors and feature subsets. An ordinal-based integration of different feature vectors (OIFV) is proposed in order to obtain a new feature vector. The new feature vector depends on the order of features in the old vectors. A frequency based integration of different feature subsets (FIFS) with most effective features, which are obtained from a hybrid filter and wrapper methods in the feature selection task, is then proposed. In addition, four wellknown text classification algorithms are employed as classifiers in the wrapper method for the selection of the feature subsets. A wide range of comparative experiments on five widely-used datasets in sentiment analysis were carried out. The experiments demonstrate that proposed methods can effectively improve the performance of sentiment classification. These results also show that proposed part-of-speech patterns are more effective in their classification accuracy compared to unigram-based features. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:80 / 93
页数:14
相关论文
共 50 条
[31]   Word Sentiment Orientation Computing with Feature Selection Methods [J].
Li, Shoushan ;
Huang, Chu-Ren .
11TH CHINESE LEXICAL SEMANTICS WORKSHOP (CKSW2010), 2010, :368-373
[32]   A Comparative Study of Evolutionary Methods for Feature Selection in Sentiment Analysis [J].
Garg, Shikhar ;
Verma, Sukriti .
IJCCI: PROCEEDINGS OF THE 11TH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL INTELLIGENCE, 2019, :131-138
[33]   Sentiment Analysis of Twitter Data based on Ordinal Classification [J].
Elbagir, Shihab ;
Yang, Jing .
2018 INTERNATIONAL CONFERENCE ON ALGORITHMS, COMPUTING AND ARTIFICIAL INTELLIGENCE (ACAI 2018), 2018,
[34]   A Frequency-Based Gene Selection Method with Random Forests for Gene Data Analysis [J].
Thanh Trinh ;
Wu, DingMing ;
Salloum, Salman ;
Tung Nguyen ;
Huang, Joshua Zhexue .
2016 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES, RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2016, :193-198
[35]   A New Feature Selection Method for Sentiment Analysis in Short Text [J].
Kumar, H. M. Keerthi ;
Harish, B. S. .
JOURNAL OF INTELLIGENT SYSTEMS, 2020, 29 (01) :1122-1134
[36]   Deep learning-based hybrid sentiment analysis with feature selection using optimization algorithm [J].
Daniel, D. Anand Joseph ;
Meena, M. Janaki .
MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (28) :43273-43296
[37]   Sentiment Classification of Spanish Reviews: An Approach based on Feature Selection and Machine Learning Methods [J].
del Pilar Salas-Zarate, Maria ;
Andres Paredes-Valverde, Mario ;
Limon-Romero, Jorge ;
Tlapa, Diego ;
Baez-Lopez, Yolanda .
JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2016, 22 (05) :691-708
[38]   Feature selection and ensemble construction: A two-step method for aspect based sentiment analysis [J].
Akhtar, Md Shad ;
Gupta, Deepak ;
Ekbal, Asif ;
Bhattacharyya, Pushpak .
KNOWLEDGE-BASED SYSTEMS, 2017, 125 :116-135
[39]   Sentiment Analysis on Movie Reviews Using Ensemble Features and Pearson Correlation Based Feature Selection [J].
Rangkuti, Fachrul Rozy Saputra ;
Fauzi, M. Ali ;
Sari, Yuita Arum ;
Sari, Eka Dewi Lukmana .
PROCEEDINGS OF 2018 3RD INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY (SIET 2018), 2018, :88-91
[40]   Efficient feature selection techniques for sentiment analysis [J].
Avinash Madasu ;
Sivasankar Elango .
Multimedia Tools and Applications, 2020, 79 :6313-6335