Understanding More Knowledge Makes the Transformer Perform Better in Document-level Relation Extraction

被引：0

作者：

Chen, Haotian ^{[1
]}

Chen, Yijiang ^{[1
]}

Zhou, Xiangdong ^{[1
]}

机构：

[1] Fudan Univ, Sch Comp Sci, Shanghai, Peoples R China

来源：

ASIAN CONFERENCE ON MACHINE LEARNING, VOL 222 | 2023年 / 222卷

关键词：

Document-level relation extraction; graph-based method; a weighted multi-channel Transformer;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Relation extraction plays a vital role in knowledge graph construction. In contrast with the traditional relation extraction on a single sentence, extracting relations from multiple sentences as a whole will harvest more valuable and richer knowledge. Recently, the Transformer-based pre-trained language models (TPLMs) are widely adopted to tackle document-level relation extraction (DocRE). Graph-based methods, aiming to acquire knowledge between entities to form entity-level relation graphs, have facilitated the rapid development of DocRE by infusing their proposed models with the knowledge. However, beyond entity-level knowledge, we discover many other kinds of knowledge that can aid humans to extract relations. It remains unclear whether and in which way they can be adopted to improve the performance of the Transformer, which affects the maximum performance gain of Transformer-based methods. In this paper, we propose a novel weighted multi-channel Transformer (WMCT) to infuse unlimited kinds of knowledge into the vanilla Transformer. Based on WMCT, we also explore five kinds of knowledge to enhance both its reasoning ability and expressive power. Our extensive experimental results demonstrate that: (1) more knowledge makes the performance of the Transformer better and (2) more informative knowledge leads to more performance gain. We appeal to future Transformer-based work to consider exploring more informative knowledge to improve the performance of the Transformer.

引用

页数：16

共 29 条

[1] Christopoulou F, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P4925
[2] Nguyen DQ, 2018, SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), P129
[3] De Cao N, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P2306
[4] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[5] Diamos Gregory, 2017, INT C LEARN REPR, P6
[6] Chemical-induced disease relation extraction via convolutional neural network
Gu, Jinghang
Sun, Fuqing
Qian, Longhua
Zhou, Guodong
[J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2017,
[7] Guo ZJ, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P241
[8] Jia R, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P3693
[9] Deep learning
LeCun, Yann
Bengio, Yoshua
Hinton, Geoffrey
[J]. NATURE, 2015, 521 (7553) : 436 - 444
[10] BioCreative V CDR task corpus: a resource for chemical disease relation extraction
Li, Jiao
Sun, Yueping
Johnson, Robin J.
Sciaky, Daniela
Wei, Chih-Hsuan
Leaman, Robert
Davis, Allan Peter
Mattingly, Carolyn J.
Wiegers, Thomas C.
Lu, Zhiyong
[J]. DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2016,

← 1 2 3 →