Integrating word status for joint detection of sentiment and aspect in reviews

被引:7
作者
Bagheri, Ayoub [1 ,2 ]
机构
[1] Univ Utrecht, Fac Social Sci, Dept Methodol & Stat, NL-3508 TC Utrecht, Netherlands
[2] Univ Med Ctr Utrecht, Dept Cardiol, Div Heart & Lungs, Utrecht, Netherlands
关键词
Aspect-based sentiment analysis; joint sentiment aspect; latent Dirichlet allocation; online reviews; sentiment analysis; topic modelling; MODEL; LDA;
D O I
10.1177/0165551518811458
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A crucial task in sentiment analysis is aspect detection: the step of selecting the aspects on which opinions are expressed. This step anticipates the step of determining whether the opinions on aspects are positive or negative. This article proposes a novel probabilistic generative topic model for aspect-based sentiment analysis which is able to discover the latent structure of a large collection of review documents. The proposed joint sentiment-aspect detection model (SAM) is a generative topic model that incorporates the structure of review sentences for detecting aspects and sentiments simultaneously. The intuitions behind the SAM are that from generating documents by latent single- and multi-word topics, modelling the word distribution for each topic and learning of the prior distribution over topics in sentences of documents. SAM introduces word status so that the model can decide when to sample from a bigram distribution or a unigram distribution and integrates all these components into one combined model for aspect-based sentiment analysis. We evaluate SAM both qualitatively and quantitatively to show that the model is indeed able to perform the task effectively and improves significantly over standard joint sentiment-aspect models. The proposed model can easily be transformed between domains or languages and can detect the polarity of text data at various levels. However, for the quantitative analysis, we mainly focus on presenting the results for the document-level sentiment classification.
引用
收藏
页码:736 / 755
页数:20
相关论文
共 52 条
  • [1] Andrzejewski David, 2009, Proc Int Conf Mach Learn, V382, P25
  • [2] [Anonymous], 2010, Technical Report
  • [3] [Anonymous], 2007, NATURAL LANGUAGE PRO, DOI DOI 10.1007/978-1-84628-754-1_2
  • [4] Baccianella S, 2009, LECT NOTES COMPUT SC, V5478, P461, DOI 10.1007/978-3-642-00958-7_41
  • [5] Bagheri Ayoub, 2013, Natural Language Processing and Information Systems. 18th International Conference on Applications of Natural Language to Information Systems, NLDB 2013. Proceedings: LNCS 7934, P140, DOI 10.1007/978-3-642-38824-8_12
  • [6] ADM-LDA: An aspect detection model based on topic modelling using the structure of review sentences
    Bagheri, Ayoub
    Saraee, Mohamad
    de Jong, Franciska
    [J]. JOURNAL OF INFORMATION SCIENCE, 2014, 40 (05) : 621 - 636
  • [7] Care more about customers: Unsupervised domain-independent aspect detection for sentiment analysis of customer reviews
    Bagheri, Ayoub
    Saraee, Mohamad
    de Jong, Franciska
    [J]. KNOWLEDGE-BASED SYSTEMS, 2013, 52 : 201 - 213
  • [8] Opinion Question Answering: Towards a Unified Approach
    Balahur, Alexandra
    Boldrini, Ester
    Montoyo, Andres
    Martinez-Barco, Patricio
    [J]. ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 511 - 516
  • [9] Probabilistic Topic Models
    Blei, David M.
    [J]. COMMUNICATIONS OF THE ACM, 2012, 55 (04) : 77 - 84
  • [10] Latent Dirichlet allocation
    Blei, DM
    Ng, AY
    Jordan, MI
    [J]. JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) : 993 - 1022