Comparison of Feature Selection Methods for Sentiment Analysis

被引:9
作者
El Mrabti, Soufiane [1 ]
Al Achhab, Mohammed [1 ]
Lazaar, Mohamed [1 ]
机构
[1] Abdelmalek Essaadi Univ, ENSA, Tetouan, Morocco
来源
BIG DATA, CLOUD AND APPLICATIONS, BDCA 2018 | 2018年 / 872卷
关键词
Sentiment analysis; Feature selection; Text classification; Natural language processing; ALGORITHMS;
D O I
10.1007/978-3-319-96292-4_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment analysis is process of deriving the opinion or attitude expressed in input text. For the classification problem, feature selection aims to select features that are capable of discriminating samples that belong to different classes. This paper evaluates the performance of three feature selection methods (MI, CHI and ANOVA) combined with three machine learning based classification techniques (NB, SVM and KNN) for sentiment analysis on online movie reviews dataset. The paper shows that feature selection is important task for sentiment based classification.
引用
收藏
页码:261 / 272
页数:12
相关论文
共 40 条
[1]   Sentiment analysis in multiple languages: Feature selection for opinion classification in Web forums [J].
Abbasi, Ahmed ;
Chen, Hsinchun ;
Salem, Arab .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2008, 26 (03)
[2]  
Annett M, 2008, LECT NOTES ARTIF INT, V5032, P25
[3]  
[Anonymous], 2004, Proceedings of the 42nd annual meeting on Association for Computational Linguistics, DOI DOI 10.3115/1218955.1218990
[4]  
[Anonymous], P ANN M ASS COMP LIN
[5]  
[Anonymous], 1997, INT C MACHINE LEARNI
[6]  
[Anonymous], 2004, COLING 2004 P 20 INT
[7]  
[Anonymous], 2012, Mining text data
[8]  
[Anonymous], 2010, Journal of Machine Learning Research, DOI DOI 10.5555/1756006.1859899
[9]  
Boiy Erik, 2007, 11th International Conference on Electronic Publishing. Openness in Digital Publishing: Awareness, Discovery and Access, P349
[10]  
Bronshtein Adi., 2017, A Quick Introduction to K-Nearest Neighbors Algorithm