Reinforcement learning based distantly supervised biomedical named entity recognition

被引:0
作者
Bali, Manish [1 ]
Anandaraj, S. P. [1 ]
机构
[1] Presidency Univ, Dept Comp Sci & Engn, Bengaluru 560064, Karnataka, India
来源
INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS | 2023年 / 17卷 / 02期
关键词
Named entity recognition; reinforcement learning; neural network; Markov decision process; graphical processing unit;
D O I
10.3233/IDT-220205
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Data used by current Biomedical named entity recognition (BioNER) systems has mostly been manually labelled for supervision. However, it might be difficult to find large amounts of annotated data, especially in fields with a high level of specialization, such as biomedical, bioinformatics, and so on. When dictionaries and ontologies are available, which are domain-specific knowledge resources, automatically tagged distantly supervised biomedical training data can be developed. However, any such distantly supervised NER result is normally noisy. The prevalence of false positives and false negatives with this type of autonomously generated data is the main problem that directly affects efficiency. This research investigates distant supervision to detect false positive occurrences in BioNER task. A reinforcement learning technique is employed that is modelled as a graphical processing unit (GPU) accelerated Markov decision process (MDP) with a neural network policy. To deal with false negative cases, we employ a partial annotation conditional random field (CRF) technique. Results on two benchmark datasets show a cutting-edge methodology that can enhance the functionality of the neural NER system. It goes on to show how the proposed approach cuts down on human annotated data for BioNER tasks in Natural Language Processing (NLP).
引用
收藏
页码:317 / 330
页数:14
相关论文
共 39 条
[1]  
Akbik A., 2018, P 27 INT C COMP LING, P1638
[2]  
Alfonseca E., 2012, P 50 ANN M ASS COMP, V2, P54
[3]  
Augenstein I, 2014, LECT NOTES ARTIF INT, V8876, P26, DOI 10.1007/978-3-319-13704-9_3
[4]  
Beltagy I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P3615
[5]  
Boutilier C, PRIORITIZED GOAL DEC
[6]  
Cassandra AR, 1996, IROS 96 - PROCEEDINGS OF THE 1996 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS - ROBOTIC INTELLIGENCE INTERACTING WITH DYNAMIC WORLDS, VOLS 1-3, P963, DOI 10.1109/IROS.1996.571080
[7]  
Chen P, 2013, MARKOV DECISION PROC, P299, DOI [10.2991/isca-13.2013.51, DOI 10.2991/ISCA-13.2013.51]
[8]  
Peters ME, 2018, Arxiv, DOI [arXiv:1802.05365, DOI 10.48550/ARXIV.1802.05365, DOI 10.18653/V1/N18-1202]
[9]  
Feng J, 2018, arXiv
[10]  
Foka AF, 2002, 2002 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-3, PROCEEDINGS, P490, DOI 10.1109/IRDS.2002.1041438