Contextual Sentiment Neural Network for Document Sentiment Analysis

被引:1
作者
Tomoki Ito
Kota Tsubouchi
Hiroki Sakaji
Tatsuo Yamashita
Kiyoshi Izumi
机构
[1] The University of Tokyo,Graduate School of Engineering
[2] Yahoo Japan Corporation,undefined
来源
Data Science and Engineering | 2020年 / 5卷
关键词
Interpretable neural networks; Text mining; Support system;
D O I
暂无
中图分类号
学科分类号
摘要
Although deep neural networks are excellent for text sentiment analysis, their applications in real-world practice are occasionally limited owing to their black-box property. In this study, we propose a novel neural network model called contextual sentiment neural network (CSNN) model that can explain the process of its sentiment analysis prediction in a way that humans find natural and agreeable and can catch up the summary of the contents. The CSNN has the following interpretable layers: the word-level original sentiment layer, word-level sentiment shift layer, word-level global importance layer, word-level contextual sentiment layer, and concept-level contextual sentiment layer. Because of these layers, this network can explain the process of its document-level sentiment analysis results in a human-like way using these layers. Realizing the interpretability of each layer in the CSNN is a crucial problem in the development of this CSNN because the general back-propagation method cannot realize such interpretability. To realize this interpretability, we propose a novel learning strategy called initialization propagation (IP) learning. Using real textual datasets, we experimentally demonstrate that the proposed IP learning is effective for improving the interpretability of each layer in CSNN. We then experimentally demonstrate that the CSNN has both the high predictability and high explanation ability.
引用
收藏
页码:180 / 192
页数:12
相关论文
共 41 条
[1]  
Li S(2013)Sentiment classification with polarity shifting detection IALP 2013 129-132
[2]  
Wang Z(2017)Towards bootstrapping a polarity shifter lexicon using linguistic features IJCNLP 2017 624-633
[3]  
Lee SYM(2016)Don’t count, predict! an automatic approach to learning sentiment lexicons for short text ACL 2016 219-224
[4]  
Huang C-R(2017)Learning stock market sentiment lexicon and sentiment-oriented word vector from stocktwits CoNLL 2017 301-310
[5]  
Schulder M(2017)On pixel-wise explanations for nonlinear classifier decisions by layer-wise relevance propagation PLoS ONE 10 1-46
[6]  
Wiegand M(1997)Bidirectional recurrent neural networks IEEE Trans Signal Process 45 2673-2681
[7]  
Ruppenhofer J(2012)Spherical k-means clustering J Stat Softw 50 1-22
[8]  
Roth B(2014)Dropout: a simple way to prevent neural networks from overfitting JMLR 15 1929-1958
[9]  
Vo DT(2010)Sentiment classification and polarity shifting COLING 2010 635-643
[10]  
Zhang Y(2005)Recognizing contextual polarity in phrase level sentiment analysis EMNLP 2005 347-354