Sentiment Analysis of Movie Reviews: A study on Feature Selection & Classification Algorithms

被引:0
作者
Sahu, Tirath Prasad [1 ]
Ahuja, Sanjeev [1 ]
机构
[1] Natl Inst Technol, Raipur, Madhya Pradesh, India
来源
2016 INTERNATIONAL CONFERENCE ON MICROELECTRONICS, COMPUTING AND COMMUNICATIONS (MICROCOM) | 2016年
关键词
Feature selection; Movie Review; Sentiment Analysis; Information Retrieval; Opinion Mining; Classifier;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Sentiment analysis is a sub-domain of opinion mining where the analysis is focused on the extraction of emotions and opinions of the people towards a particular topic from a structured, semi-structured or unstructured textual data. In this paper, we try to focus our task of sentiment analysis on IMDB movie review database. We examine the sentiment expression to classify the polarity of the movie review on a scale of 0(highly disliked) to 4(highly liked) and perform feature extraction and ranking and use these features to train our multi-label classifier to classify the movie review into its correct label. Due to lack of strong grammatical structures in movie reviews which follow the informal jargon, an approach based on structured N-grams has been followed. In addition, a comparative study on different classification approaches has been performed to determine the most suitable classifier to suit our problem domain. We conclude that our proposed approach to sentiment classification supplements the existing rating movie rating systems used across the web and will serve as base to future researches in this domain. "Our approach using classification techniques has the best accuracy of 88.95%."
引用
收藏
页数:6
相关论文
共 12 条
[1]  
Andreevskaia Alina, 2006, EACL, V6
[2]  
Annett M, 2008, LECT NOTES ARTIF INT, V5032, P25
[3]  
[Anonymous], LREC
[4]  
[Anonymous], 2004, Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics, DOI 10.3115/1218955.1218990
[5]  
[Anonymous], 2010, LREC
[6]  
[Anonymous], 2007, ICWSM
[7]  
[Anonymous], 2002, P ACL 02 C EMP METH
[8]  
Mullen Tony., 2004, EMNLP, V4
[9]   Sentiment analysis: A combined approach [J].
Prabowo, Rudy ;
Thelwall, Mike .
JOURNAL OF INFORMETRICS, 2009, 3 (02) :143-157
[10]  
Singh V. K., 2013, AUT COMP COMM CONTR