Feature Selection Methods in Sentiment Analysis : A Review

被引:1
作者
Khairi, Nurilhami Izzatie [1 ]
Mohamed, Azlinah [2 ]
Yusof, Nor Nadiah [1 ]
机构
[1] Univ Teknol MARA UiTM, Fac Comp & Math Sci, Shah Alam, Malaysia
[2] Univ Teknol MARA UiTM, Adv Analyt Engn Ctr, Fac Comp & Math Sci, Shah Alam, Malaysia
来源
3RD INTERNATIONAL CONFERENCE ON NETWORKING, INFORMATION SYSTEM & SECURITY (NISS'20) | 2020年
关键词
Feature Selection; Filter; Wrapper; Embedded; Hybrid; Sentiment Analysis;
D O I
10.1145/3386723.3387840
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The development of digital tecnnologies nowadays assists people by suggesting opinion, choices, preferences and feelings. This opinion is useful for company's engagement to make certain analysis to know their potential users and personalized their need. However, the information needs extraction to make further analysis. Thus, sentiment analysis is used to extract opinion and others and transform it into meaningful data. During the process of analysis, feature selection method is required to select a subset which consists of relevant features to construct a predictive model. This process requires some conditions during the selection of feature subset. The required conditions for feature selection are that the selected feature subset must be small and relevant for a high dimensional dataset which considers the presence of noise plus there are no redundant features. However, some of the feature selection methods unable to fulfill all conditions. In this research, 40 papers were collected, classified and reviewed. We discussed on the feature selection methods in sentiment analysis based on its level of analysis and make comparison between these methods to know its limitation and advantages. The comparison made between methods are based on its accuracy and CPU performance. Finally, suggest the best/benchmark method for feature selection. The findings obtained from this research shows that hybrid methods obtain the best accuracy and CPU performance compared to the other methods.
引用
收藏
页数:7
相关论文
共 39 条
[31]   A feature selection method based on improved fisher's discriminant ratio for text sentiment classification [J].
Wang, Suge ;
Li, Deyu ;
Song, Xiaolei ;
Wei, Yingjie ;
Li, Hongxia .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) :8696-8702
[32]  
Wang SH, 2015, AAAI CONF ARTIF INTE, P470
[33]  
Xue B., 2012, INFORM SYST RES, P1
[34]  
Yusof Nor Nadiah, 2018, International Journal of Machine Learning and Computing, V8, DOI 10.18178/ijmlc.2018.8.4.719
[35]  
Yusof N.N., 2019, P SOFT COMP DAT SCI, P325, DOI 10.1007/978-981-15-0399-3_26
[36]   Reviewing Classification Approaches in Sentiment Analysis [J].
Yusof, Nor Nadiah ;
Mohamed, Azlinah ;
Abdul-Rahman, Shuzlina .
SOFT COMPUTING IN DATA SCIENCE, SCDS 2015, 2015, 545 :43-53
[37]   Feature selection for optimizing traffic classification [J].
Zhang, Hongli ;
Lu, Gang ;
Qassrawi, Mahmoud T. ;
Zhang, Yu ;
Yu, Xiangzhan .
COMPUTER COMMUNICATIONS, 2012, 35 (12) :1457-1471
[38]   Multi-stage convex relaxation for feature selection [J].
Zhang, Tong .
BERNOULLI, 2013, 19 (5B) :2277-2293
[39]   Heartbeat classification using disease-specific feature selection [J].
Zhang, Zhancheng ;
Dong, Jun ;
Luo, Xiaoqing ;
Choi, Kup-Sze ;
Wu, Xiaojun .
COMPUTERS IN BIOLOGY AND MEDICINE, 2014, 46 :79-89