A short text sentiment-topic model for product review analysis

被引:0
|
作者
Xiong S.-F. [1 ,2 ]
Ji D.-H. [1 ]
机构
[1] Computer School of Wuhan University, Wuhan
[2] Pingdingshan University, Pingdingshan
来源
基金
中国国家自然科学基金;
关键词
Sentiment classification; Sentiment topic model; Short text topic mode; Text sparse; Topic model;
D O I
10.16383/j.aas.2016.c150591
中图分类号
学科分类号
摘要
Topic and sentiment joint modelling has been successfully used in sentiment analysis for opinion text. However, we have to face the text sparse problem in opinion text when the length of text becomes shorter and shorter with popularity of smart devices. In this paper, we propose a joint sentiment-topic model SSTM (short-text sentiment-topic model) for short text. Unlike the topic model which models the generative process of each document, we directly model the generation of the whole review set. In the generation process of corpus, we sample a word-pair each time, in which the two words have the same sentiment label and topic. We apply SSTM to two real life social media datasets with three tasks. In the experiment, we demonstrate the effectiveness of the model on topic discovery by qualitative analysis. On the quantitative analysis of document level sentiment classification, SSTM model achieves better performance compared with the existing approaches. Copyright © 2016 Acta Automatica Sinica. All rights reserved.
引用
收藏
页码:1227 / 1237
页数:10
相关论文
共 36 条
  • [31] Li C.T., Zhang J.W., Sun J.T., Chen Z., Sentiment topic model with decomposed prior, Proceedings of the 2013 SIAM International Conference on Data Mining, pp. 767-775, (2013)
  • [32] Wang X.R., McCallum A., Topics over time: a non-Markov continuous-time model of topical trends, Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 424-433, (2006)
  • [33] Phan X.H., Nguyen L.M., Horiguchi S., Learning to classify short and sparse text & web with hidden topics from large-scale data collections, Proceedings of the 17th International Conference onWorld WideWeb, pp. 91-100, (2008)
  • [34] Lim K.W., Buntine W., Twitter opinion topic model: extracting product opinions from tweets by leveraging hashtags and sentiment lexicon, Proceedings of the 23rd ACM International Conference on Conference on Information and Knowledge Management, pp. 1319-1328, (2014)
  • [35] Chang J., Boyd-Graber J.L., Gerrish S., Wang C., Blei D.M., Reading tea leaves: how humans interpret topic models, Proceedings of the 2009 Advances in Neural Information Processing Systems, pp. 288-296, (2009)
  • [36] Xie P.T., Xing E.P., Integrating document clustering and topic modeling, Proceedings of the 29th Conference on Uncertainty in Artificial Intelligence, (2013)