A discourse-aware neural network-based text model for document-level text classification

被引:9
作者
Lee, Kangwook [1 ]
Han, Sanggyu [1 ]
Myaeng, Sung-Hyon [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Sch Comp, 291 Daehak Ro, Daejeon 34141, South Korea
关键词
Deep learning; discourse analysis; neural network; sarcasm detection; sentiment analysis; text classification; text model; RHETORICAL STRUCTURE;
D O I
10.1177/0165551517743644
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Capturing semantics scattered across entire text is one of the important issues for Natural Language Processing (NLP) tasks. It would be particularly critical with long text embodying a flow of themes. This article proposes a new text modelling method that can handle thematic flows of text with Deep Neural Networks (DNNs) in such a way that discourse information and distributed representations of text are incorporate. Unlike previous DNN-based document models, the proposed model enables discourse-aware analysis of text and composition of sentence-level distributed representations guided by the discourse structure. More specifically, our method identifies Elementary Discourse Units (EDUs) and their discourse relations in a given document by applying Rhetorical Structure Theory (RST)-based discourse analysis. The result is fed into a tree-structured neural network that reflects the discourse information including the structure of the document and the discourse roles and relation types. We evaluate the document model for two document-level text classification tasks, sentiment analysis and sarcasm detection, with comparisons against the reference systems that also utilise discourse information. In addition, we conduct additional experiments to evaluate the impact of neural network types and adopted discourse factors on modelling documents vis-a-vis the two classification tasks. Furthermore, we investigate the effects of various learning methods, input units on the quality of the proposed discourse-aware document model.
引用
收藏
页码:715 / 735
页数:21
相关论文
共 38 条
[21]  
Ji YF, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P13
[22]  
Johnson R., 2015, P 2015 C N AM CHAPT, P103, DOI DOI 10.3115/V1/N15-1011
[23]   A Convolutional Neural Network for Modelling Sentences [J].
Kalchbrenner, Nal ;
Grefenstette, Edward ;
Blunsom, Phil .
PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2014, :655-665
[24]  
Kim Y., 2014, P C EMP METH NAT LAN, P1746, DOI DOI 10.3115/V1/D14-1181
[25]  
Li J., 2015, P C EMPIRICAL METHOD, P2304
[26]  
Lin R., 2015, EMNLP, P899
[27]   Exploring performance of clustering methods on document sentiment analysis [J].
Ma, Baojun ;
Yuan, Hua ;
Wu, Ye .
JOURNAL OF INFORMATION SCIENCE, 2017, 43 (01) :54-74
[28]  
Maas A., 2011, P 49 ANN M ASS COMPU, P142
[29]  
Mann W., 1988, Text Interdisciplinary Journal for the Study of Discourse, V8, P243, DOI [10.1515/text.1.1988.8.3.243, DOI 10.1515/TEXT.1.1988.8.3.243]
[30]  
Manning CD., 1999, Foundations of statistical natural language processing