ADPG: Biomedical entity recognition based on Automatic Dependency Parsing Graph

被引:1
作者
Yang, Yumeng [1 ]
Lin, Hongfei [1 ]
Yang, Zhihao [1 ]
Zhang, Yijia [2 ]
Zhao, Di [3 ]
Huai, Shuaiheng [2 ]
机构
[1] Dalian Univ Technol, Sch Comp Sci & Technol, Dalian, Peoples R China
[2] Dalian Maritime Univ, Sch Informat Sci & Technol, Dalian, Peoples R China
[3] Dalian Minzu Univ, Sch Comp Sci & Engn, Dalian, Peoples R China
基金
中国博士后科学基金;
关键词
NER; Tree-transformer; Dependency parsing; Biomedical;
D O I
10.1016/j.jbi.2023.104317
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Named entity recognition is a key task in text mining. In the biomedical field, entity recognition focuses on extracting key information from large-scale biomedical texts for the downstream information extraction task. Biomedical literature contains a large amount of long-dependent text, and previous studies use external syntactic parsing tools to capture word dependencies in sentences to achieve nested biomedical entity recognition. However, the addition of external parsing tools often introduces unnecessary noise to the current auxiliary task and cannot improve the performance of entity recognition in an end-to-end way. Therefore, we propose a novel automatic dependency parsing approach, namely the ADPG model, to fuse syntactic structure information in an end-to-end way to recognize biomedical entities. Specifically, the method is based on a multilayer Tree-Transformer structure to automatically extract the semantic representation and syntactic structure in long-dependent sentences, and then combines a multilayer graph attention neural network (GAT) to extract the dependency paths between words in the syntactic structure to improve the performance of biomedical entity recognition. We evaluated our ADPG model on three biomedical domain and one news domain datasets, and the experimental results demonstrate that our model achieves state-of-the-art results on these four datasets with certain generalization performance. Our model is released on GitHub: https://github.com/Yumeng-Y/ADPG.
引用
收藏
页数:11
相关论文
共 50 条
[21]   Sentiment Analysis of Online Users'Negative Emotions Based on Graph Convolutional Network and Dependency Parsing [J].
Fan T. ;
Wang H. ;
Wu P. .
Data Analysis and Knowledge Discovery, 2021, 5 (09) :97-106
[22]   A Re-ranking Model for Dependency Parsing with Knowledge Graph Embeddings [J].
Kim, A-Yeong ;
Song, Hyun-Je ;
Park, Seong-Bae ;
Lee, Sang-Jo .
PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2015, :177-180
[23]   Named Entity Recognition From Biomedical Data [J].
Refaat, Maged ;
Rafea, Ahmed ;
Gaballah, Nada .
2023 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE, CSCI 2023, 2023, :838-844
[24]   Automatic Detection of Nominal Events in Hungarian Texts with Dependency Parsing and WordNet [J].
Subecz, Zoltan .
INFORMATION AND SOFTWARE TECHNOLOGIES, ICIST 2016, 2016, 639 :580-592
[25]   Biomedical Named Entity Recognition via Knowledge Guidance and Question Answering [J].
Banerjee P. ;
Pal K.K. ;
Devarakonda M. ;
Baral C. .
ACM Transactions on Computing for Healthcare, 2021, 2 (04)
[26]   Knowledge graph mining for realty domain using dependency parsing and QAT models [J].
Zamiralov, Alexander ;
Sohin, Timur ;
Butakov, Nikolay .
10TH INTERNATIONAL YOUNG SCIENTISTS CONFERENCE IN COMPUTATIONAL SCIENCE (YSC2021), 2021, 193 :32-41
[27]   Improving named entity recognition accuracy for gene and protein in biomedical text literature [J].
Tohidi, Hossein ;
Ibrahim, Hamidah ;
Murad, Masrah Azrifah Azmi .
INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2014, 10 (03) :239-268
[28]   Biomedical Named Entity Recognition Based on Hybrid Multistage CNN-RNN Learner [J].
Phan, Robert ;
Luu, Thoai Man ;
Davey, Rachel ;
Chetty, Girija .
2018 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND DATA ENGINEERING (ICMLDE 2018), 2018, :128-135
[29]   A Burmese Dependency Parsing Method Based on Transfer Learning [J].
Mao, Cunli ;
Man, Zhibo ;
Yu, Zhengtao ;
Wang, Zhenhan ;
Gao, Shengxiang ;
Zhang, Yafei .
2020 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2020), 2020, :92-97
[30]   Relationship classification based on dependency parsing and the pretraining model [J].
Baosheng Yin ;
Yifei Sun .
Soft Computing, 2022, 26 :8575-8583