Multi-task learning for Chinese clinical named entity recognition with external knowledge

被引:8
作者
Cheng, Ming [1 ]
Xiong, Shufeng [2 ,3 ]
Li, Fei [4 ]
Liang, Pan [5 ]
Gao, Jianbo [5 ]
机构
[1] Zhengzhou Univ, Dept Med Informat, Affiliated Hosp 1, Zhengzhou, Peoples R China
[2] Henan Agr Univ, Coll Informat, Zhengzhou, Peoples R China
[3] Henan Agr Univ, Coll Management Sci, Zhengzhou, Peoples R China
[4] Wuhan Univ, Sch Cyber Sci & Engn, Wuhan, Peoples R China
[5] Zhengzhou Univ, Dept Radiol, Affiliated Hosp 1, Zhengzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
Chinese clinical named entity recognition; Multi-task learning; Deep neural network; Dictionary features; MODEL; TEXT;
D O I
10.1186/s12911-021-01717-1
中图分类号
R-058 [];
学科分类号
摘要
Background Named entity recognition (NER) on Chinese electronic medical/healthcare records has attracted significantly attentions as it can be applied to building applications to understand these records. Most previous methods have been purely data-driven, requiring high-quality and large-scale labeled medical data. However, labeled data is expensive to obtain, and these data-driven methods are difficult to handle rare and unseen entities. Methods To tackle these problems, this study presents a novel multi-task deep neural network model for Chinese NER in the medical domain. We incorporate dictionary features into neural networks, and a general secondary named entity segmentation is used as auxiliary task to improve the performance of the primary task of named entity recognition. Results In order to evaluate the proposed method, we compare it with other currently popular methods, on three benchmark datasets. Two of the datasets are publicly available, and the other one is constructed by us. Experimental results show that the proposed model achieves 91.07% average f-measure on the two public datasets and 87.05% f-measure on private dataset. Conclusions The comparison results of different models demonstrated the effectiveness of our model. The proposed model outperformed traditional statistical models.
引用
收藏
页数:11
相关论文
共 41 条
[1]  
[Anonymous], 2003, Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons
[2]  
[Anonymous], 2003, P 7 C NAT LANG LEARN, DOI DOI 10.3115/1119176.1119200
[3]  
[Anonymous], 2017, P 3 WORKSH NOIS US G, DOI DOI 10.18653/V1/W17-4419
[4]   Prediction of blood culture outcome using hybrid neural network model based on electronic health records [J].
Cheng, Ming ;
Zhao, Xiaolei ;
Ding, Xianfei ;
Gao, Jianbo ;
Xiong, Shufeng ;
Ren, Yafeng .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2020, 20 (Suppl 3)
[5]   A Hybrid Method to Extract Clinical Information From Chinese Electronic Medical Records [J].
Cheng, Ming ;
Li, Liming ;
Ren, Yafeng ;
Lou, Yinxia ;
Gao, Jianbo .
IEEE ACCESS, 2019, 7 :70624-70633
[6]   Character-Based LSTM-CRF with Radical-Level Features for Chinese Named Entity Recognition [J].
Dong, Chuanhai ;
Zhang, Jiajun ;
Zong, Chengqing ;
Hattori, Masanori ;
Di, Hui .
NATURAL LANGUAGE UNDERSTANDING AND INTELLIGENT APPLICATIONS (NLPCC 2016), 2016, 10102 :239-250
[7]   Dispatched attention with multi-task learning for nested mention recognition [J].
Fei, Hao ;
Ren, Yafeng ;
Ji, Donghong .
INFORMATION SCIENCES, 2020, 513 :241-251
[8]   Transfer learning for biomedical named entity recognition with neural networks [J].
Giorgi, John M. ;
Bader, Gary D. .
BIOINFORMATICS, 2018, 34 (23) :4087-4094
[9]  
Hu J, P CCKS 2017
[10]   A hybrid approach for named entity recognition in Chinese electronic medical record [J].
Ji, Bin ;
Liu, Rui ;
Li, Shasha ;
Yu, Jie ;
Wu, Qingbo ;
Tan, Yusong ;
Wu, Jiaju .
BMC MEDICAL INFORMATICS AND DECISION MAKING, 2019, 19 (Suppl 2)