Sentiment classification: a lexical similarity based approach for extracting subjectivity in documents

被引：0

作者：

Kiran Sarvabhotla

Prasad Pingali

Vasudeva Varma

机构：

[1] International Institute of Information Technology,Search and Information Extraction Lab

来源：

Information Retrieval | 2011年 / 14卷

关键词：

Social media; Sentiment classification; Subjectivity; Linguistic resources; RSUMM;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

With the growth of social media, document sentiment classification has become an active area of research in this decade. It can be viewed as a special case of topical classification applied only to subjective portions of a document (sources of sentiment). Hence, the key task in document sentiment classification is extracting subjectivity. Existing approaches to extract subjectivity rely heavily on linguistic resources such as sentiment lexicons and complex supervised patterns based on part-of-speech (POS) information. This makes the task of subjective feature extraction complex and resource dependent. In this work, we try to minimize the dependency on linguistic resources in sentiment classification. We propose a simple and statistical methodology called review summary (RSUMM) and use it in combination with well-known feature selection methods to extract subjectivity. Our experimental results on a movie review dataset prove the effectiveness of the proposed methodology.

引用

页码：337 / 353

页数：16