A Review on Feature Selection Methods for Sentiment Analysis

被引:6
作者
Hung, Lai Po [1 ]
Alfred, Rayner [1 ,2 ]
Hijazi, Mohd Hanafi Ahmad [1 ,2 ]
机构
[1] Univ Malaysia Sabah, Fac Comp & Informat, Kota Kinabalu 88400, Sabah, Malaysia
[2] Univ Malaysia Sabah, AIRU, Kota Kinabalu 88400, Sabah, Malaysia
关键词
Feature Selection; Sentiment Analysis; Filter; Wrapper; Embedded;
D O I
10.1166/asl.2015.6475
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Text documents are normally represented as a feature-document matrix in sentiment analysis. Features can be single words from the text document or more complex pairs extracted by different schemes that adds information in order to enrich the feature-document matrix representation. Having diverse feature types however creates a problem of high dimensionality due to the vast number of features and relations they hold. Thus, feature selection helps in ensuring that effective and efficient sentiment analysis applications can be developed by selecting features that are relevant and informative to assist classifiers to perform better and to reduce the processing load by narrowing down the feature set. This paper highlights methods used for feature selection, namely filter, wrapper and embedded. Prior to feature selection, preprocessing techniques are performed to reduce the amount of features first. This paper is concluded by summarizing this review and outlining the challenges faced and proposing the ensemble feature selection method for sentiment analysis data.
引用
收藏
页码:2952 / 2956
页数:5
相关论文
共 43 条
[1]  
Agarwal B., 2012, P 2 WORKSH SENT AN A, P17
[2]  
[Anonymous], 2013, INT J COMPUTER APPL, DOI DOI 10.5120/14573-2697
[3]  
[Anonymous], 2009, P 14 AUSTR DOC COMP
[4]  
[Anonymous], 2024, P INT SCI CONFERENCE
[5]  
[Anonymous], 1997, ICML
[6]  
[Anonymous], INT J INNOVATIVE COM
[7]  
[Anonymous], P SIAM INT C DAT MIN
[8]  
[Anonymous], 2011, Modern Information Retrieval-the Concepts and Technology Behind Search
[9]  
Brank J., 2005, TECH REP
[10]   On feature selection through clustering [J].
Butterworth, R ;
Piatetsky-Shapiro, G ;
Simovici, DA .
FIFTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2005, :581-584