Document sentiment classification by exploring description model of topical terms

被引:35
作者
Hu, Yi [1 ,2 ]
Li, Wenjie [2 ]
机构
[1] Tencent Commun Corp, Dept Search Platform, Shenzhen, Peoples R China
[2] Hong Kong Polytech Univ, Dept Comp, Hong Kong, Hong Kong, Peoples R China
关键词
Sentiment classification; Topical term; Topical Term Description Model; Maximum spanning tree;
D O I
10.1016/j.csl.2010.07.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment classification is used to identify whether the opinion expressed in a document is positive or negative. In this paper, we present an approach to do documentary-level sentiment classification by modeling description of topical terms. The motivation of this work stems from the observation that the global document classification will benefit greatly by examining the way of a topical term to give opinion in its local sentence context. Two sentence-level sentiment description models, namely positive and negative Topical Term Description Models, are constructed for each topical term. When analyzing a document, the Topical Term Description Models generate divergence to support the classification of its sentiment at the sentence-level which in turn can be used to decide the whole document classification collectively. The results of the experiments prove that our proposed method is effective. It is also shown that our results are comparable to the state-of-art results on a publicly available movie review corpus and a Chinese digital product review corpus. This is quite encouraging to us and motivates us to have further investigation on the development of a more effective topical term related description model in the future. (C) 2010 Elsevier Ltd. All rights reserved.
引用
收藏
页码:386 / 403
页数:18
相关论文
共 28 条
  • [1] [Anonymous], 2002, P EMNLP
  • [2] [Anonymous], P 20 INT C COMPUTATI, DOI DOI 10.3115/1220355.1220555
  • [3] [Anonymous], 2009, P JOINT C 47 ANN M A
  • [4] [Anonymous], P AAAI 2006 SPRING S
  • [5] BAI X, 2004, P INT WORKSH MIN SEM
  • [6] Banea C., 2008, P EMNLP
  • [7] BASILI R, 1999, P IJCAI MACH LEARN I
  • [8] Bo Pang, 2008, Foundations and Trends in Information Retrieval, V2, P1, DOI 10.1561/1500000001
  • [9] CHEN S.F., 1998, EMPIRICAL STUDY SMOO
  • [10] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38