An imConvNet-based deep learning model for Chinese medical named entity recognition

被引：4

作者：

Zheng, Yuchen ^{[1
]}

Han, Zhenggong ^{[2
]}

Cai, Yimin ^{[1
]}

Duan, Xubo ^{[1
]}

Sun, Jiangling ^{[3
]}

Yang, Wei ^{[1
]}

Huang, Haisong ^{[2
]}

机构：

[1] Guizhou Univ, Med Coll, Guiyang 550025, Guizhou, Peoples R China

[2] Guizhou Univ, Key Lab Adv Mfg Technol, Minist Educ, Guiyang 550025, Guizhou, Peoples R China

[3] Guiyang Hosp Stomatol, Guiyang 550002, Guizhou, Peoples R China

来源：

BMC MEDICAL INFORMATICS AND DECISION MAKING | 2022年 / 22卷 / 01期

关键词：

Named entity recognition; Convolutional neural network; Chinese electronic medical records; BiLSTM-CRF; BERT; BIG DATA; HEALTH; CARE;

D O I：

10.1186/s12911-022-02049-4

中图分类号：

R-058 [];

学科分类号：

摘要：

Background With the development of current medical technology, information management becomes perfect in the medical field. Medical big data analysis is based on a large amount of medical and health data stored in the electronic medical system, such as electronic medical records and medical reports. How to fully exploit the resources of information included in these medical data has always been the subject of research by many scholars. The basis for text mining is named entity recognition (NER), which has its particularities in the medical field, where issues such as inadequate text resources and a large number of professional domain terms continue to face significant challenges in medical NER. Methods We improved the convolutional neural network model (imConvNet) to obtain additional text features. Concurrently, we continue to use the classical Bert pre-training model and BiLSTM model for named entity recognition. We use imConvNet model to extract additional word vector features and improve named entity recognition accuracy. The proposed model, named BERT-imConvNet-BiLSTM-CRF, is composed of four layers: BERT embedding layer-getting word embedding vector; imConvNet layer-capturing the context feature of each character; BiLSTM (Bidirectional Long Short-Term Memory) layer-capturing the long-distance dependencies; CRF (Conditional Random Field) layer-labeling characters based on their features and transfer rules. Results The average F1 score on the public medical data set yidu-s4k reached 91.38% when combined with the classical model; when real electronic medical record text in impacted wisdom teeth is used as the experimental object, the model's F1 score is 93.89%. They all show better results than classical models. Conclusions The suggested novel model (imConvNet) significantly improves the recognition accuracy of Chinese medical named entities and applies to various medical corpora.

引用

页数：12

共 50 条

[41] Deep learning with language models improves named entity recognition for PharmaCoNER
Cong Sun
Zhihao Yang
Lei Wang
Yin Zhang
Hongfei Lin
Jian Wang
BMC Bioinformatics, 22
[42] Deep learning with language models improves named entity recognition for PharmaCoNER
Sun, Cong
Yang, Zhihao
Wang, Lei
Zhang, Yin
Lin, Hongfei
Wang, Jian
BMC BIOINFORMATICS, 2021, 22 (SUPPL 1)
[43] A Sequence Transformation Model for Chinese Named Entity Recognition
Wang, Qingyue
Song, Yanjing
Liu, Hao
Cao, Yanan
Liu, Yanbing
Guo, Li
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2018), PT I, 2018, 11061 : 491 - 502
[44] Chinese Clinical Named Entity Recognition Based on Stroke ELMo and Multi-Task Learning
Luo L.
Yang Z.-H.
Song Y.-W.
Li N.
Lin H.-F.
Yang, Zhi-Hao (yangzh@dlut.edu.cn), 1943, Science Press (43): : 1943 - 1957
[45] Combined Attention Mechanism for Named Entity Recognition in Chinese Electronic Medical Records
Li, Luqi
Hou, Li
2019 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2019, : 476 - 477
[46] Research on Named Entity Recognition for Chinese Medical Case Reports
Wang, Yue
Zhang, Xi
PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 1165 - 1169
[47] Local and global character representation enhanced model for Chinese medical named entity recognition
Xiang, Yan
Liu, Wei
Guo, Junjun
Zhang, Li
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (03) : 3779 - 3790
[48] A Comparative Study of Deep Learning based Named Entity Recognition Algorithms for Cybersecurity
Dasgupta, Soham
Piplai, Aritran
Kotal, Anantaa
Joshi, Anupam
2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2596 - 2604
[49] Combining self learning and active learning for Chinese Named Entity Recognition
Yao L.
Sun C.
Wang X.
Wang X.
Journal of Software, 2010, 5 (05) : 530 - 537
[50] Chinese Named Entity Recognition and Disambiguation Based on Wikipedia
Yu Miao
Lv Yajuan
Liu Qun
Su Jinsong
Xiong Hao
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, 2012, 333 : 272 - 283

← 1 2 3 4 5 →