An imConvNet-based deep learning model for Chinese medical named entity recognition

被引:4
|
作者
Zheng, Yuchen [1 ]
Han, Zhenggong [2 ]
Cai, Yimin [1 ]
Duan, Xubo [1 ]
Sun, Jiangling [3 ]
Yang, Wei [1 ]
Huang, Haisong [2 ]
机构
[1] Guizhou Univ, Med Coll, Guiyang 550025, Guizhou, Peoples R China
[2] Guizhou Univ, Key Lab Adv Mfg Technol, Minist Educ, Guiyang 550025, Guizhou, Peoples R China
[3] Guiyang Hosp Stomatol, Guiyang 550002, Guizhou, Peoples R China
关键词
Named entity recognition; Convolutional neural network; Chinese electronic medical records; BiLSTM-CRF; BERT; BIG DATA; HEALTH; CARE;
D O I
10.1186/s12911-022-02049-4
中图分类号
R-058 [];
学科分类号
摘要
Background With the development of current medical technology, information management becomes perfect in the medical field. Medical big data analysis is based on a large amount of medical and health data stored in the electronic medical system, such as electronic medical records and medical reports. How to fully exploit the resources of information included in these medical data has always been the subject of research by many scholars. The basis for text mining is named entity recognition (NER), which has its particularities in the medical field, where issues such as inadequate text resources and a large number of professional domain terms continue to face significant challenges in medical NER. Methods We improved the convolutional neural network model (imConvNet) to obtain additional text features. Concurrently, we continue to use the classical Bert pre-training model and BiLSTM model for named entity recognition. We use imConvNet model to extract additional word vector features and improve named entity recognition accuracy. The proposed model, named BERT-imConvNet-BiLSTM-CRF, is composed of four layers: BERT embedding layer-getting word embedding vector; imConvNet layer-capturing the context feature of each character; BiLSTM (Bidirectional Long Short-Term Memory) layer-capturing the long-distance dependencies; CRF (Conditional Random Field) layer-labeling characters based on their features and transfer rules. Results The average F1 score on the public medical data set yidu-s4k reached 91.38% when combined with the classical model; when real electronic medical record text in impacted wisdom teeth is used as the experimental object, the model's F1 score is 93.89%. They all show better results than classical models. Conclusions The suggested novel model (imConvNet) significantly improves the recognition accuracy of Chinese medical named entities and applies to various medical corpora.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Deep learning with language models improves named entity recognition for PharmaCoNER
    Cong Sun
    Zhihao Yang
    Lei Wang
    Yin Zhang
    Hongfei Lin
    Jian Wang
    BMC Bioinformatics, 22
  • [42] Deep learning with language models improves named entity recognition for PharmaCoNER
    Sun, Cong
    Yang, Zhihao
    Wang, Lei
    Zhang, Yin
    Lin, Hongfei
    Wang, Jian
    BMC BIOINFORMATICS, 2021, 22 (SUPPL 1)
  • [43] A Sequence Transformation Model for Chinese Named Entity Recognition
    Wang, Qingyue
    Song, Yanjing
    Liu, Hao
    Cao, Yanan
    Liu, Yanbing
    Guo, Li
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 491 - 502
  • [44] Chinese Clinical Named Entity Recognition Based on Stroke ELMo and Multi-Task Learning
    Luo L.
    Yang Z.-H.
    Song Y.-W.
    Li N.
    Lin H.-F.
    Yang, Zhi-Hao (yangzh@dlut.edu.cn), 1943, Science Press (43): : 1943 - 1957
  • [45] Combined Attention Mechanism for Named Entity Recognition in Chinese Electronic Medical Records
    Li, Luqi
    Hou, Li
    2019 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2019, : 476 - 477
  • [46] Research on Named Entity Recognition for Chinese Medical Case Reports
    Wang, Yue
    Zhang, Xi
    PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 1165 - 1169
  • [47] Local and global character representation enhanced model for Chinese medical named entity recognition
    Xiang, Yan
    Liu, Wei
    Guo, Junjun
    Zhang, Li
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (03) : 3779 - 3790
  • [48] A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity
    Dasgupta, Soham
    Piplai, Aritran
    Kotal, Anantaa
    Joshi, Anupam
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2596 - 2604
  • [49] Combining self learning and active learning for Chinese Named Entity Recognition
    Yao L.
    Sun C.
    Wang X.
    Wang X.
    Journal of Software, 2010, 5 (05) : 530 - 537
  • [50] Chinese Named Entity Recognition and Disambiguation Based on Wikipedia
    Yu Miao
    Lv Yajuan
    Liu Qun
    Su Jinsong
    Xiong Hao
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, 2012, 333 : 272 - 283