Document-Level Relation Extraction with Cross-sentence Reasoning Graph

被引：12

作者：

Liu, Hongfei ^{[1
]}

Kang, Zhao ^{[1
]}

Zhang, Lizong ^{[1
]}

Tian, Ling ^{[1
]}

Hua, Fujun ^{[2
]}

机构：

[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China

[2] TROY Informat Technol Co Ltd, Res & Dev Ctr, Chengdu, Peoples R China

来源：

ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT I | 2023年 / 13935卷

基金：

中国国家自然科学基金;

关键词：

Deep learning; Relation extraction; Document-level RE;

D O I：

10.1007/978-3-031-33374-3_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Relation extraction (RE) has recently moved from the sentence-level to document-level, which requires aggregating document information and using entities and mentions for reasoning. Existing works put entity nodes and mention nodes with similar representations in a document-level graph, whose complex edges may incur redundant information. Furthermore, existing studies only focus on entity-level reasoning paths without considering global interactions among entities cross-sentence. To these ends, we propose a novel document-level RE model with a GRaph information Aggregation and Cross-sentence Reasoning network (GRACR). Specifically, a simplified document-level graph is constructed to model the semantic information of all mentions and sentences in a document, and an entity-level graph is designed to explore relations of long-distance cross-sentence entity pairs. Experimental results show that GRACR achieves excellent performance on two public datasets of document-level RE. It is especially effective in extracting potential relations of cross-sentence entity pairs. Our code is available at https://github.com/UESTC-LHF/GRACR.

引用

页码：316 / 328

页数：13

共 26 条

[1] Christopoulou F, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P4925
[2] Graph Fusion Network for Text Classification
Dai, Yong
Shou, Linjun
Gong, Ming
Xia, Xiaolin
Kang, Zhao
Xu, Zenglin
Jiang, Daxin
[J]. KNOWLEDGE-BASED SYSTEMS, 2022, 236
[3] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[4] Structure-Preserving Graph Representation Learning
Fang, Ruiyi
Wen, Liangjian
Kang, Zhao
Liu, Jianzhuang
[J]. 2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 927 - 932
[5] Jia R, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P3693
[6] BioBERT: a pre-trained biomedical language representation model for biomedical text mining
Lee, Jinhyuk
Yoon, Wonjin
Kim, Sungdong
Kim, Donghyeon
Kim, Sunkyu
So, Chan Ho
Kang, Jaewoo
[J]. BIOINFORMATICS, 2020, 36 (04) : 1234 - 1240
[7] Li B., 2020, P 28 INT C COMP LING, P1551
[8] BioCreative V CDR task corpus: a resource for chemical disease relation extraction
Li, Jiao
Sun, Yueping
Johnson, Robin J.
Sciaky, Daniela
Wei, Chih-Hsuan
Leaman, Robert
Davis, Allan Peter
Mattingly, Carolyn J.
Wiegers, Thomas C.
Lu, Zhiyong
[J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
[9] Li JY, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P1359
[10] Multilayer graph contrastive clustering network
Liu, Liang
Kang, Zhao
Ruan, Jiajia
He, Xixu
[J]. INFORMATION SCIENCES, 2022, 613 : 256 - 267

← 1 2 3 →