COREX: Document-level Relation Extraction Framework with Consistent Two-Hop Reasoning and Evidence Sentence Prediction

被引:0
作者
Zhao, Silan [1 ]
Li, Chunping [1 ]
机构
[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China
来源
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024 | 2024年
关键词
Natural Language Processing; Document-level Relation Extraction; Evidence-Enhanced Framework; Multi-Hop Reasoning; Class Imbalance;
D O I
10.1109/IJCNN60899.2024.10650330
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level relation extraction (DocRE) focuses on identifying relationships between entities within a document. It differs from sentence-level extraction as it involves understanding relationships over longer distances, including across sentences or paragraphs. A major challenge in DocRE is to focus on crucial evidence and bridge entities, defined as entities that link others in relationships spanning multiple sentences. Therefore, we propose a novel, efficient representation to support two-hop reasoning. Supervised by evidence prediction and relation extraction, this explicit multi-hop reasoning representation notably improves long-distance reasoning capabilities in DocRE. To address the issue of class imbalance in DocRE, we revise the Adaptive Focal Loss (AFL) and incorporate it into our framework. Moreover, we develop a Weighted Fusion Layer, designed to optimize the utilization of evidence, thereby ensuring a comprehensive yet focused analytical perspective. Our framework, named COREX, has shown superior performance in extensive experiments on three benchmark datasets. Its performance exceeds the baseline by 1.8/1.58 in Ign F1/Inter F1 score on the DocRED leaderboard.
引用
收藏
页数:8
相关论文
共 28 条
[1]  
Cheng Q, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P2819
[2]  
Huang K, 2021, REPL4NLP 2021: PROCEEDINGS OF THE 6TH WORKSHOP ON REPRESENTATION LEARNING FOR NLP, P307
[3]   BioCreative V CDR task corpus: a resource for chemical disease relation extraction [J].
Li, Jiao ;
Sun, Yueping ;
Johnson, Robin J. ;
Sciaky, Daniela ;
Wei, Chih-Hsuan ;
Leaman, Robert ;
Davis, Allan Peter ;
Mattingly, Carolyn J. ;
Wiegers, Thomas C. ;
Lu, Zhiyong .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
[4]  
Li JY, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P1359
[5]  
Nan GS, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P1546
[6]  
Peng H, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P3661
[7]  
Quirk C, 2017, 15TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2017), VOL 1: LONG PAPERS, P1171
[8]   The Graph Neural Network Model [J].
Scarselli, Franco ;
Gori, Marco ;
Tsoi, Ah Chung ;
Hagenbuchner, Markus ;
Monfardini, Gabriele .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (01) :61-80
[9]  
Soares Livio Baldini, 2019, P 57 ANN M ASS COMP
[10]  
Tan QY, 2022, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), P1672