An imConvNet-based deep learning model for Chinese medical named entity recognition

被引：4

作者：

Zheng, Yuchen ^{[1
]}

Han, Zhenggong ^{[2
]}

Cai, Yimin ^{[1
]}

Duan, Xubo ^{[1
]}

Sun, Jiangling ^{[3
]}

Yang, Wei ^{[1
]}

Huang, Haisong ^{[2
]}

机构：

[1] Guizhou Univ, Med Coll, Guiyang 550025, Guizhou, Peoples R China

[2] Guizhou Univ, Key Lab Adv Mfg Technol, Minist Educ, Guiyang 550025, Guizhou, Peoples R China

[3] Guiyang Hosp Stomatol, Guiyang 550002, Guizhou, Peoples R China

来源：

BMC MEDICAL INFORMATICS AND DECISION MAKING | 2022年 / 22卷 / 01期

关键词：

Named entity recognition; Convolutional neural network; Chinese electronic medical records; BiLSTM-CRF; BERT; BIG DATA; HEALTH; CARE;

D O I：

10.1186/s12911-022-02049-4

中图分类号：

R-058 [];

学科分类号：

摘要：

Background With the development of current medical technology, information management becomes perfect in the medical field. Medical big data analysis is based on a large amount of medical and health data stored in the electronic medical system, such as electronic medical records and medical reports. How to fully exploit the resources of information included in these medical data has always been the subject of research by many scholars. The basis for text mining is named entity recognition (NER), which has its particularities in the medical field, where issues such as inadequate text resources and a large number of professional domain terms continue to face significant challenges in medical NER. Methods We improved the convolutional neural network model (imConvNet) to obtain additional text features. Concurrently, we continue to use the classical Bert pre-training model and BiLSTM model for named entity recognition. We use imConvNet model to extract additional word vector features and improve named entity recognition accuracy. The proposed model, named BERT-imConvNet-BiLSTM-CRF, is composed of four layers: BERT embedding layer-getting word embedding vector; imConvNet layer-capturing the context feature of each character; BiLSTM (Bidirectional Long Short-Term Memory) layer-capturing the long-distance dependencies; CRF (Conditional Random Field) layer-labeling characters based on their features and transfer rules. Results The average F1 score on the public medical data set yidu-s4k reached 91.38% when combined with the classical model; when real electronic medical record text in impacted wisdom teeth is used as the experimental object, the model's F1 score is 93.89%. They all show better results than classical models. Conclusions The suggested novel model (imConvNet) significantly improves the recognition accuracy of Chinese medical named entities and applies to various medical corpora.

引用

页数：12

共 50 条

[21] A Named Entity Recognition Model Based on Entity Trigger Reinforcement Learning
Wang, Ping
Si, Nong
Tong, Haopeng
2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 43 - 48
[22] Turkish Named Entity Recognition with Deep Learning
Gunes, Asim
Tantug, A. Cuneyd
2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
[23] Deep learning for named entity recognition: a survey
Hu Z.
Hou W.
Liu X.
Neural Comput. Appl., 16 (8995-9022): : 8995 - 9022
[24] A Chinese Medical Named Entity Recognition Method Based on Glyph Features
Meng, Wei-Lun
Guo, Jing-Feng
Xing, Ke-Xuan
Wei, Ning
Wang, Qiao-Suo
Liu, Bin
Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (06): : 1945 - 1954
[25] Named Entity Recognition in Chinese Electronic Medical Records Based on CRF
Liu, Kaixin
Hu, Qingcheng
Liu, Jianwei
Xing, Chunxiao
2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 105 - 110
[26] A hybrid model for Chinese named entity recognition
Sun, Xiao
Huang, Degen
RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237
[27] A self-attention based neural architecture for Chinese medical named entity recognition
Wan, Qian
Liu, Jie
Wei, Luona
Ji, Bin
MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2020, 17 (04) : 3498 - 3511
[28] Chinese Named Entity Recognition Based on BERT and Lightweight Feature Extraction Model
Yang, Ruisen
Gan, Yong
Zhang, Chenfang
INFORMATION, 2022, 13 (11)
[29] A hybrid approach for named entity recognition in Chinese electronic medical record
Bin Ji
Rui Liu
Shasha Li
Jie Yu
Qingbo Wu
Yusong Tan
Jiaju Wu
BMC Medical Informatics and Decision Making, 19
[30] A Research Toward Chinese Named Entity Recognition Based on Transfer Learning
Kang, Hui
Xiao, Jingwu
Zhang, Yunpeng
Zhang, Lei
Zhao, Xu
Feng, Tie
INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)

← 1 2 3 4 5 →