Sentiment classification: a lexical similarity based approach for extracting subjectivity in documents

被引:0
作者
Kiran Sarvabhotla
Prasad Pingali
Vasudeva Varma
机构
[1] International Institute of Information Technology,Search and Information Extraction Lab
来源
Information Retrieval | 2011年 / 14卷
关键词
Social media; Sentiment classification; Subjectivity; Linguistic resources; RSUMM;
D O I
暂无
中图分类号
学科分类号
摘要
With the growth of social media, document sentiment classification has become an active area of research in this decade. It can be viewed as a special case of topical classification applied only to subjective portions of a document (sources of sentiment). Hence, the key task in document sentiment classification is extracting subjectivity. Existing approaches to extract subjectivity rely heavily on linguistic resources such as sentiment lexicons and complex supervised patterns based on part-of-speech (POS) information. This makes the task of subjective feature extraction complex and resource dependent. In this work, we try to minimize the dependency on linguistic resources in sentiment classification. We propose a simple and statistical methodology called review summary (RSUMM) and use it in combination with well-known feature selection methods to extract subjectivity. Our experimental results on a movie review dataset prove the effectiveness of the proposed methodology.
引用
收藏
页码:337 / 353
页数:16
相关论文
empty
未找到相关数据