Document-level relation extraction using evidence reasoning on RST-GRAPH

被引:38
作者
Wang, Hailin [1 ,2 ]
Qin, Ke [1 ,2 ]
Lu, Guoming [1 ,2 ]
Yin, Jin [1 ,2 ]
Zakari, Rufai Yusuf [1 ,2 ]
Owusu, Jim Wilson [1 ,2 ]
机构
[1] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, 2006 Xiyuan Ave, Chengdu 611731, Sichuan, Peoples R China
[2] Trusted Cloud Comp & Big Data Key Lab Sichuan Pro, Chengdu, Peoples R China
关键词
Relation extraction; Rhetorical structure theory; Document-level; Evidence reasoning;
D O I
10.1016/j.knosys.2021.107274
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level relation extraction (RE) is a more challenging task providing a new perspective to solve larger and more complex text mining work. Recent document-level RE research following traditional sentence-level methods focus on learning a character representation of one sentence, highlighting the importance of partial words. When applied to a document with a longer text, more entities and more complicated semantics, these traditional methods lacking the ability to select evidence and further reason on them, may not be sufficient to identify all potential relation in the document in an intuitive way. However, some textual semantic associations between entities as well as logical structure of document could provide evidence to the potential relation and explain the reasoning process. Hence, by introducing Rhetorical Structure Theory (RST) as an external knowledge, this article attempts to select appropriate evidence and show reasoning process on a new document-graph, RST-GRAPH, which indicators valid semantic associations between multiple text units through RST and incorporates a set of reasoning modules to capture efficient evidence. Our experimental result shows that the RST-GRAPH, as the first work introducing RST, builds associations in an interpretable manner along discourse relation link between any entities compared to previous graph-oriented models, exhibits a clear process of evidence reasoning with competitive performance on DocRED dataset, and outperforms most existing models on the value of Ign F1. (C) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页数:12
相关论文
共 49 条
[21]  
Ji YF, 2014, PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P13
[22]  
Jin LF, 2020, AAAI CONF ARTIF INTE, V34, P8034
[23]   Global Stock Market Prediction Based on Stock Chart Images Using Deep Q-Network [J].
Lee, Jinho ;
Kim, Raehyun ;
Koh, Yookyung ;
Kang, Jaewoo .
IEEE ACCESS, 2019, 7 :167260-167277
[24]  
Ma XZ, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P1064
[25]  
Mann William C, 1988, Text-interdisciplinary Journal for the Study of Discourse, V8, P243, DOI DOI 10.1515/TEXT.1.1988.8.3.243
[26]  
McCallum A., 2018, ARXIV PREPRINT ARXIV
[27]  
Peng Nanyun, 2017, Transactions of the Association for Computational Linguistics, V5, P101
[28]  
Peng Yan, 2013, Advances in Data Mining. Applications and Theoretical Aspects. 13th Industrial Conference, ICDM 2013. Proceedings: LNCS 7987, P70, DOI 10.1007/978-3-642-39736-3_6
[29]  
Pennington Jeffrey, 2014, Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), P1532, DOI DOI 10.3115/V1/D14-1162
[30]  
Quirk C, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1171