Multi-task learning for Chinese clinical named entity recognition with external knowledge

被引:6
作者
Cheng, Ming [1 ]
Xiong, Shufeng [2 ,3 ]
Li, Fei [4 ]
Liang, Pan [5 ]
Gao, Jianbo [5 ]
机构
[1] Zhengzhou Univ, Dept Med Informat, Affiliated Hosp 1, Zhengzhou, Peoples R China
[2] Henan Agr Univ, Coll Informat, Zhengzhou, Peoples R China
[3] Henan Agr Univ, Coll Management Sci, Zhengzhou, Peoples R China
[4] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan, Peoples R China
[5] Zhengzhou Univ, Dept Radiol, Affiliated Hosp 1, Zhengzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Chinese clinical named entity recognition; Multi-task learning; Deep neural network; Dictionary features; MODEL; TEXT;
D O I
10.1186/s12911-021-01717-1
中图分类号
R-058 [];
学科分类号
摘要
Background Named entity recognition (NER) on Chinese electronic medical/healthcare records has attracted significantly attentions as it can be applied to building applications to understand these records. Most previous methods have been purely data-driven, requiring high-quality and large-scale labeled medical data. However, labeled data is expensive to obtain, and these data-driven methods are difficult to handle rare and unseen entities. Methods To tackle these problems, this study presents a novel multi-task deep neural network model for Chinese NER in the medical domain. We incorporate dictionary features into neural networks, and a general secondary named entity segmentation is used as auxiliary task to improve the performance of the primary task of named entity recognition. Results In order to evaluate the proposed method, we compare it with other currently popular methods, on three benchmark datasets. Two of the datasets are publicly available, and the other one is constructed by us. Experimental results show that the proposed model achieves 91.07% average f-measure on the two public datasets and 87.05% f-measure on private dataset. Conclusions The comparison results of different models demonstrated the effectiveness of our model. The proposed model outperformed traditional statistical models.
引用
收藏
页数:11
相关论文
共 41 条
  • [1] Aguilar G., 2017, P 3 WORKSH NOIS US G, P148, DOI DOI 10.18653/V1/W17-4419
  • [2] Prediction of blood culture outcome using hybrid neural network model based on electronic health records
    Cheng, Ming
    Zhao, Xiaolei
    Ding, Xianfei
    Gao, Jianbo
    Xiong, Shufeng
    Ren, Yafeng
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 3)
  • [3] A Hybrid Method to Extract Clinical Information From Chinese Electronic Medical Records
    Cheng, Ming
    Li, Liming
    Ren, Yafeng
    Lou, Yinxia
    Gao, Jianbo
    [J]. IEEE ACCESS, 2019, 7 : 70624 - 70633
  • [4] Curran J. R., 2003, P 7 C NAT LANG LEARN, P164, DOI DOI 10.3115/1119176.1119200
  • [5] Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition
    Dong, Chuanhai
    Zhang, Jiajun
    Zong, Chengqing
    Hattori, Masanori
    Di, Hui
    [J]. NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 : 239 - 250
  • [6] Dispatched attention with multi-task learning for nested mention recognition
    Fei, Hao
    Ren, Yafeng
    Ji, Donghong
    [J]. INFORMATION SCIENCES, 2020, 513 : 241 - 251
  • [7] Transfer learning for biomedical named entity recognition with neural networks
    Giorgi, John M.
    Bader, Gary D.
    [J]. BIOINFORMATICS, 2018, 34 (23) : 4087 - 4094
  • [8] Hu J, P CCKS 2017
  • [9] A hybrid approach for named entity recognition in Chinese electronic medical record
    Ji, Bin
    Liu, Rui
    Li, Shasha
    Yu, Jie
    Wu, Qingbo
    Tan, Yusong
    Wu, Jiaju
    [J]. BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 2)
  • [10] Khan MAA, 2019, IEEE INT C BIOINFORM, P2541, DOI 10.1109/BIBM47256.2019.8983421