Document-level relation extraction via graph transformer networks and temporal convolutional networks

被引:18
作者
Shi, Yong [2 ,4 ]
Xiao, Yang [1 ,2 ]
Quan, Pei [1 ,2 ]
Lei, MingLong [3 ]
Niu, Lingfeng [2 ,4 ]
机构
[1] Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 101408, Peoples R China
[2] Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[3] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[4] Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Document-level relation extraction; Heterogeneous graph; Temporal convolutional networks; REPRESENTATION; MODEL;
D O I
10.1016/j.patrec.2021.06.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Relation Extraction (RE) aims at extracting meaningful relation facts between entities in texts. It is an important semantic processing task in the field of natural language processing (NLP) and has many ap-plications. Traditional RE focuses on extracting entity relationships from a single input sentence. Recently, the research scope has been extended from sentence level to document level. However, compared with sentence-level RE, document-level RE, which needs to identify the inter-sentence relations from entities scattered in different sentences, is more complex and still lacks of solutions. To solve this problem, we propose a novel document-level RE method based on Heterogeneous Graph Neural Networks in this pa -per. Concretely, to obtain token embeddings containing long-distance dependency signals well, we encode the document with Temporal Convolutional Networks, whose dilated convolution and residual structure allow the effective and efficient preservation of historical information. To better describe the interaction between different elements, we construct the input documents as heterogeneous graphs with different node and edge types and utilize Graph Transformer Networks to generate semantic paths. Numerical experiments on two document-level biomedical datasets demonstrate the effectiveness of the proposed method. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:150 / 156
页数:7
相关论文
共 48 条
[1]  
Bai S., 2018, arXiv1803.01271
[2]  
Christopoulou F., 2019, EMNLP IJCNLP, P4927
[3]  
Christopoulou F, 2018, PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, P81
[4]   KBQA: Learning Question Answering over QA Corpora and Knowledge Bases [J].
Cui, Wanyun ;
Xiao, Yanghua ;
Wang, Haixun ;
Song, Yangqiu ;
Hwang, Seung-won ;
Wang, Wei .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (05) :565-576
[5]  
Nguyen DQ, 2018, SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), P129
[6]  
De Cao N, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P2306
[7]   metapath2vec: Scalable Representation Learning for Heterogeneous Networks [J].
Dong, Yuxiao ;
Chawla, Nitesh V. ;
Swami, Ananthram .
KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, :135-144
[8]   Distributed representation learning for knowledge graphs with entity descriptions [J].
Fan, Miao ;
Zhou, Qiang ;
Zheng, Thomas Fang ;
Grishman, Ralph .
PATTERN RECOGNITION LETTERS, 2017, 93 :31-37
[9]   Graph based model for information retrieval using a stochastic local search [J].
Farhi, Sidali Hocine ;
Boughaci, Dalila .
PATTERN RECOGNITION LETTERS, 2018, 105 :234-239
[10]   HIN2Vec: Explore Meta-paths in Heterogeneous Information Networks for Representation Learning [J].
Fu, Tao-yang ;
Lee, Wang-Chien ;
Lei, Zhen .
CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, :1797-1806