A BERT-Span model for Chinese named entity recognition in rehabilitation medicine

被引:5
作者
Zhong, Jinhong [1 ]
Xuan, Zhanxiang [1 ]
Wang, Kang [1 ]
Cheng, Zhou [1 ]
机构
[1] Hefei Univ Technol, Sch Management, Hefei, Anhui, Peoples R China
关键词
Rehabilitation medicine; Named entity recognition; Text annotation; BERT-Span;
D O I
10.7717/peerj-cs.1535
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background: Due to various factors such as the increasing aging of the population and the upgrading of people's health consumption needs, the demand group for rehabilitation medical care is expanding. Currently, China's rehabilitation medical care encounters several challenges, such as inadequate awareness and a scarcity of skilled professionals. Enhancing public awareness about rehabilitation and improving the quality of rehabilitation services are particularly crucial. Named entity recognition is an essential first step in information processing as it enables the automated extraction of rehabilitation medical entities. These entities play a crucial role in subsequent tasks, including information decision systems and the construction of medical knowledge graphs.Methods: In order to accomplish this objective, we construct the BERT-Span model to complete the Chinese rehabilitation medicine named entity recognition task. First, we collect rehabilitation information from multiple sources to build a corpus in the field of rehabilitation medicine, and fine-tune Bidirectional Encoder Representation from Transformers (BERT) with the rehabilitation medicine corpus. For the rehabilitation medicine corpus, we use BERT to extract the feature vectors of rehabilitation medicine entities in the text, and use the span model to complete the annotation of rehabilitation medicine entities.Result: Compared to existing baseline models, our model achieved the highest F1 value for the named entity recognition task in the rehabilitation medicine corpus. The experimental results demonstrate that our method outperforms in recognizing both long medical entities and nested medical entities in rehabilitation medical texts.Conclusion: The BERT-Span model can effectively identify and extract entity knowledge in the field of rehabilitation medicine in China, which supports the construction of the knowledge graph of rehabilitation medicine and the development of the decision-making system of rehabilitation medicine.
引用
收藏
页数:16
相关论文
共 40 条
[1]   Chinese clinical named entity recognition via multi-head self-attention based BiLSTM-CRF [J].
An, Ying ;
Xia, Xianyun ;
Chen, Xianlai ;
Wu, Fang-Xiang ;
Wang, Jianxin .
ARTIFICIAL INTELLIGENCE IN MEDICINE, 2022, 127
[2]  
Bhatia Parminder, 2020, PRECISION HLTH MED D, P69, DOI DOI 10.1007/978-3-030-24409-5_7
[3]  
Committee NHaW, 2012, China health and wellness statistical yearbook
[4]  
Committee NHaW, 2021, Opinions on accelerating the development of rehabilitation medical work
[5]  
Devlin J, 2019, Arxiv, DOI arXiv:1810.04805
[6]   Deep learning for named entity recognition on Chinese electronic medical records: Combining deep transfer learning with multitask bi-directional LSTM RNN [J].
Dong, Xishuang ;
Chowdhury, Shanta ;
Qian, Lijun ;
Li, Xiangfang ;
Guan, Yi ;
Yang, Jinfeng ;
Yu, Qiubin .
PLOS ONE, 2019, 14 (05)
[7]  
Dong XS, 2016, 2016 NEW YORK SCIENTIFIC DATA SUMMIT (NYSDS)
[8]  
Feng ZL, 2020, LANCET, V396, P1362, DOI 10.1016/S0140-6736(20)32136-X
[9]  
Florez E., 2018, International Workshop on Medication and Adverse Drug Event Detection, P7
[10]  
Fukuda K, 1998, Pac Symp Biocomput, P707