Biomedical document-level relation extraction with thematic capture and localized entity pooling

被引:0
作者
Li, Yuqing [1 ]
Shao, Xinhui [1 ]
机构
[1] Northeastern Univ, Coll Sci, Dept Math, Shenyang, Peoples R China
关键词
Document-level relation extraction; Local entity pooling; Thematic capture;
D O I
10.1016/j.jbi.2024.104756
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In contrast to sentence-level relational extraction, document-level relation extraction poses greater challenges as a document typically contains multiple entities, and one entity may be associated with multiple other entities. Existing methods often rely on graph structures to capture path representations between entity pairs. However, this paper introduces a novel approach called local entity pooling that solely relies on the pre- training model to identify the bridge entity related to the current entity pair and generate the reasoning path representation. This technique effectively mitigates the multi-entity problem. Additionally, the model leverages the multi-entity and multi-label characteristics of the document to acquire the document's thematic representation, thereby enhancing the document-level relation extraction task. Experimental evaluations conducted on two biomedical datasets, CDR and GDA. Our TCLEP (Thematic C apture and L ocalized E ntity P ooling) model achieved the Macro-F1 scores of 71.7% and 85.3%, respectively. Simultaneously, we incorporated local entity pooling and thematic capture modules into the state-of-the-art model, resulting in performance improvements of 1.5% and 0.2% on the respective datasets. These results highlight the advanced performance of our proposed approach.
引用
收藏
页数:9
相关论文
共 46 条
[1]  
[Anonymous], 2014, P COLING 2014 25 INT
[2]  
[Anonymous], 2015, PROC C EMPIRICAL MET
[3]  
Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615
[4]  
Cai R, 2016, PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, P756
[5]  
Christopoulou F, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P4925
[6]  
Nguyen DQ, 2018, SIGBIOMED WORKSHOP ON BIOMEDICAL NATURAL LANGUAGE PROCESSING (BIONLP 2018), P129
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]   Relational distance and document-level contrastive pre-training based relation extraction model [J].
Dong, Yihao ;
Xu, Xiaolong .
PATTERN RECOGNITION LETTERS, 2023, 167 :132-140
[9]  
Giorgi J, 2022, PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), P10
[10]  
Goyal P, 2018, Arxiv, DOI arXiv:1706.02677