An imConvNet-based deep learning model for Chinese medical named entity recognition

被引:4
|
作者
Zheng, Yuchen [1 ]
Han, Zhenggong [2 ]
Cai, Yimin [1 ]
Duan, Xubo [1 ]
Sun, Jiangling [3 ]
Yang, Wei [1 ]
Huang, Haisong [2 ]
机构
[1] Guizhou Univ, Med Coll, Guiyang 550025, Guizhou, Peoples R China
[2] Guizhou Univ, Key Lab Adv Mfg Technol, Minist Educ, Guiyang 550025, Guizhou, Peoples R China
[3] Guiyang Hosp Stomatol, Guiyang 550002, Guizhou, Peoples R China
关键词
Named entity recognition; Convolutional neural network; Chinese electronic medical records; BiLSTM-CRF; BERT; BIG DATA; HEALTH; CARE;
D O I
10.1186/s12911-022-02049-4
中图分类号
R-058 [];
学科分类号
摘要
Background With the development of current medical technology, information management becomes perfect in the medical field. Medical big data analysis is based on a large amount of medical and health data stored in the electronic medical system, such as electronic medical records and medical reports. How to fully exploit the resources of information included in these medical data has always been the subject of research by many scholars. The basis for text mining is named entity recognition (NER), which has its particularities in the medical field, where issues such as inadequate text resources and a large number of professional domain terms continue to face significant challenges in medical NER. Methods We improved the convolutional neural network model (imConvNet) to obtain additional text features. Concurrently, we continue to use the classical Bert pre-training model and BiLSTM model for named entity recognition. We use imConvNet model to extract additional word vector features and improve named entity recognition accuracy. The proposed model, named BERT-imConvNet-BiLSTM-CRF, is composed of four layers: BERT embedding layer-getting word embedding vector; imConvNet layer-capturing the context feature of each character; BiLSTM (Bidirectional Long Short-Term Memory) layer-capturing the long-distance dependencies; CRF (Conditional Random Field) layer-labeling characters based on their features and transfer rules. Results The average F1 score on the public medical data set yidu-s4k reached 91.38% when combined with the classical model; when real electronic medical record text in impacted wisdom teeth is used as the experimental object, the model's F1 score is 93.89%. They all show better results than classical models. Conclusions The suggested novel model (imConvNet) significantly improves the recognition accuracy of Chinese medical named entities and applies to various medical corpora.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] A Named Entity Recognition Model Based on Entity Trigger Reinforcement Learning
    Wang, Ping
    Si, Nong
    Tong, Haopeng
    2022 IEEE 2ND INTERNATIONAL CONFERENCE ON COMPUTER COMMUNICATION AND ARTIFICIAL INTELLIGENCE (CCAI 2022), 2022, : 43 - 48
  • [22] Turkish Named Entity Recognition with Deep Learning
    Gunes, Asim
    Tantug, A. Cuneyd
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [23] Deep learning for named entity recognition: a survey
    Hu Z.
    Hou W.
    Liu X.
    Neural Comput. Appl., 16 (8995-9022): : 8995 - 9022
  • [24] A Chinese Medical Named Entity Recognition Method Based on Glyph Features
    Meng, Wei-Lun
    Guo, Jing-Feng
    Xing, Ke-Xuan
    Wei, Ning
    Wang, Qiao-Suo
    Liu, Bin
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2024, 52 (06): : 1945 - 1954
  • [25] Named Entity Recognition in Chinese Electronic Medical Records Based on CRF
    Liu, Kaixin
    Hu, Qingcheng
    Liu, Jianwei
    Xing, Chunxiao
    2017 14TH WEB INFORMATION SYSTEMS AND APPLICATIONS CONFERENCE (WISA 2017), 2017, : 105 - 110
  • [26] A hybrid model for Chinese named entity recognition
    Sun, Xiao
    Huang, Degen
    RECENT ADVANCE OF CHINESE COMPUTING TECHNOLOGIES, 2007, : 232 - 237
  • [27] A self-attention based neural architecture for Chinese medical named entity recognition
    Wan, Qian
    Liu, Jie
    Wei, Luona
    Ji, Bin
    MATHEMATICAL BIOSCIENCES AND ENGINEERING, 2020, 17 (04) : 3498 - 3511
  • [28] Chinese Named Entity Recognition Based on BERT and Lightweight Feature Extraction Model
    Yang, Ruisen
    Gan, Yong
    Zhang, Chenfang
    INFORMATION, 2022, 13 (11)
  • [29] A hybrid approach for named entity recognition in Chinese electronic medical record
    Bin Ji
    Rui Liu
    Shasha Li
    Jie Yu
    Qingbo Wu
    Yusong Tan
    Jiaju Wu
    BMC Medical Informatics and Decision Making, 19
  • [30] A Research Toward Chinese Named Entity Recognition Based on Transfer Learning
    Kang, Hui
    Xiao, Jingwu
    Zhang, Yunpeng
    Zhang, Lei
    Zhao, Xu
    Feng, Tie
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2023, 16 (01)