Learning multiple layers of knowledge representation for aspect based sentiment analysis

被引:89
作者
Duc-Hong Pham [1 ,3 ]
Anh-Cuong Le [2 ]
机构
[1] Elect Power Univ, Fac Informat Technol, Hanoi, Vietnam
[2] Ton Duc Thang Univ, Fac Informat Technol, NLP KD Lab, Ho Chi Minh City, Vietnam
[3] Vietnam Natl Univ, Univ Engn & Technol, Fac Informat Technol, Hanoi, Vietnam
关键词
Sentiment analysis; Aspect based sentiment analysis; Representation learning; Multiple layer representation; Compositional vector models; Word embeddings;
D O I
10.1016/j.datak.2017.06.001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Sentiment Analysis is the task of automatically discovering the exact sentimental ideas about a product (or service, social event, etc.) from customer textual comments (i.e. reviews) crawled from various social media resources. Recently, we can see the rising demand of aspect-based sentiment analysis, in which we need to determine sentiment ratings and importance degrees of product aspects. In this paper we propose a novel multi-layer architecture for representing customer reviews. We observe that the overall sentiment for a product is composed from sentiments of its aspects, and in turn each aspect has its sentiments expressed in related sentences which are also the compositions from their words. This observation motivates us to design a multiple layer architecture of knowledge representation for representing the different sentiment levels for an input text. This representation is then integrated into a neural network to form a model for prediction of product overall ratings. We will use the representation learning techniques including word embeddings and compositional vector models, and apply a back propagation algorithm based on gradient descent to learn the model. This model consequently generates the aspect ratings as well as aspect weights (i.e. aspect importance degrees). Our experiment is conducted on a data set of reviews from hotel domain, and the obtained results show that our model outperforms the well-known methods in previous studies.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 52 条
  • [1] Alghunaim Abdulaziz., 2015, VS@HLT-NAACL, P116
  • [2] [Anonymous], 2004, Proceedings of the 42nd annual meeting on Association for Computational Linguistics, DOI DOI 10.3115/1218955.1218990
  • [3] [Anonymous], 2015, P 19 C COMP NAT LANG
  • [4] [Anonymous], 2008, P ACL 08 HLT ASS COM
  • [5] [Anonymous], 2013, P WORKSHOP INT C LEA
  • [6] [Anonymous], 2007, Hlt-naacl
  • [7] [Anonymous], 2011, Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP '11
  • [8] [Anonymous], 2005, Proceedings of the ACM international conference on world wide web
  • [9] [Anonymous], 2014, P INT C INT C MACH L
  • [10] [Anonymous], 2002, P 8 ACM SIGKDD INT C, DOI [DOI 10.1145/775047.775098, 10.1145/775047.775098]