Document-Level Relation Extraction with Cross-sentence Reasoning Graph

被引:12
作者
Liu, Hongfei [1 ]
Kang, Zhao [1 ]
Zhang, Lizong [1 ]
Tian, Ling [1 ]
Hua, Fujun [2 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[2] TROY Informat Technol Co Ltd, Res & Dev Ctr, Chengdu, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT I | 2023年 / 13935卷
基金
中国国家自然科学基金;
关键词
Deep learning; Relation extraction; Document-level RE;
D O I
10.1007/978-3-031-33374-3_25
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relation extraction (RE) has recently moved from the sentence-level to document-level, which requires aggregating document information and using entities and mentions for reasoning. Existing works put entity nodes and mention nodes with similar representations in a document-level graph, whose complex edges may incur redundant information. Furthermore, existing studies only focus on entity-level reasoning paths without considering global interactions among entities cross-sentence. To these ends, we propose a novel document-level RE model with a GRaph information Aggregation and Cross-sentence Reasoning network (GRACR). Specifically, a simplified document-level graph is constructed to model the semantic information of all mentions and sentences in a document, and an entity-level graph is designed to explore relations of long-distance cross-sentence entity pairs. Experimental results show that GRACR achieves excellent performance on two public datasets of document-level RE. It is especially effective in extracting potential relations of cross-sentence entity pairs. Our code is available at https://github.com/UESTC-LHF/GRACR.
引用
收藏
页码:316 / 328
页数:13
相关论文
共 26 条
  • [1] Christopoulou F, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P4925
  • [2] Graph Fusion Network for Text Classification
    Dai, Yong
    Shou, Linjun
    Gong, Ming
    Xia, Xiaolin
    Kang, Zhao
    Xu, Zenglin
    Jiang, Daxin
    [J]. KNOWLEDGE-BASED SYSTEMS, 2022, 236
  • [3] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [4] Structure-Preserving Graph Representation Learning
    Fang, Ruiyi
    Wen, Liangjian
    Kang, Zhao
    Liu, Jianzhuang
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2022, : 927 - 932
  • [5] Jia R, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P3693
  • [6] BioBERT: a pre-trained biomedical language representation model for biomedical text mining
    Lee, Jinhyuk
    Yoon, Wonjin
    Kim, Sungdong
    Kim, Donghyeon
    Kim, Sunkyu
    So, Chan Ho
    Kang, Jaewoo
    [J]. BIOINFORMATICS, 2020, 36 (04) : 1234 - 1240
  • [7] Li B., 2020, P 28 INT C COMP LING, P1551
  • [8] BioCreative V CDR task corpus: a resource for chemical disease relation extraction
    Li, Jiao
    Sun, Yueping
    Johnson, Robin J.
    Sciaky, Daniela
    Wei, Chih-Hsuan
    Leaman, Robert
    Davis, Allan Peter
    Mattingly, Carolyn J.
    Wiegers, Thomas C.
    Lu, Zhiyong
    [J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,
  • [9] Li JY, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, P1359
  • [10] Multilayer graph contrastive clustering network
    Liu, Liang
    Kang, Zhao
    Ruan, Jiajia
    He, Xixu
    [J]. INFORMATION SCIENCES, 2022, 613 : 256 - 267