Dual-Channel and Hierarchical Graph Convolutional Networks for document-level relation extraction

被引:14
作者
Sun, Qi [1 ]
Xu, Tiancheng [2 ]
Zhang, Kun [1 ]
Huang, Kun [1 ]
Lv, Laishui [1 ]
Li, Xun [1 ]
Zhang, Ting [1 ]
Dore-Natteh, Doris [1 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Chinese Med, Key Lab Acupuncture & Med Res, Minist Educ, Nanjing 210023, Peoples R China
关键词
Document-level relation extraction; Graph Convolutional Network; Clinical data;
D O I
10.1016/j.eswa.2022.117678
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Document-level relation extraction aims to infer complex semantic relations among entities in an entire document. Compared with the sentence-level relation extraction, document-level relational facts are expressed by multiple mentions across the sentences in a long-distance, requiring excellent reasoning. In this paper, we propose Dual-Channel and Hierarchical Graph Convolutional Networks (DHGCN), which constructs three graphs in token-level, mention-level, and entity-level to model complex interactions among different semantic representations across the document. Based on the multi-level graphs, we apply the Graph Convolutional Network (GCN) for each level to aggregate the relevant information scattered throughout the document for better inferring the implicit relations. Moreover, we propose a dual-channel encoder to capture structural and contextual information simultaneously, which also supplies the contextual representation for the higher layer to avoid losing low-dimension information. Our DHGCN yields significant improvements over the state-of-the-art methods by 2.75, 5.5, and 3.5 F, on DocRED, CDR, and GDA, respectively, which are popular document-level relation extraction datasets. Furthermore, to demonstrate the effectiveness of our method, we evaluate DHGCN on a fine gained clinical document-level dataset Symptom-Acupoint Relation (SAR) proposed by ourselves and available at https://github.com/QiSun123/SAR. The experimental results illustrate that DHGCN is able to infer more valuable relations among entities in the document.
引用
收藏
页数:10
相关论文
共 55 条
  • [1] Cai R, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P756
  • [2] Chen Guimin, 2021, 59 ANN M ASS COMP, V1, P4458
  • [3] Enhancing structure modeling for relation extraction with fine-grained gating and co-attention
    Chen, Yubo
    Wu, Chuhan
    Huang, Yongfeng
    [J]. NEUROCOMPUTING, 2022, 467 : 282 - 291
  • [4] Christopoulou F, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P4925
  • [5] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [6] Eberts M, 2021, 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), P3650
  • [7] Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1007/978-3-642-24797-2, 10.1162/neco.1997.9.1.1]
  • [8] He ZQ, 2018, AAAI CONF ARTIF INTE, P5795
  • [9] Ji GL, 2017, AAAI CONF ARTIF INTE, P3060
  • [10] Jia R, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P3693