Incorporating dictionaries into deep neural networks for the Chinese clinical named entity recognition

被引:82
作者
Wang, Qi [1 ]
Zhou, Yangming [1 ]
Ruan, Tong [1 ]
Gao, Daqi [1 ]
Xia, Yuhang [1 ]
He, Ping [2 ]
机构
[1] East China Univ Sci & Technol, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China
[2] Shanghai Hosp, Dev Ctr, Shanghai 200040, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Clinical named entity recognition; Electronic health records; Deep neural network; Dictionary features; TEXT;
D O I
10.1016/j.jbi.2019.103133
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Clinical named entity recognition aims to identify and classify clinical terms such as diseases, symptoms, treatments, exams, and body parts in electronic health records, which is a fundamental and crucial task for clinical and translational research. In recent years, deep neural networks have achieved significant success in named entity recognition and many other natural language processing tasks. Most of these algorithms are trained end to end, and can automatically learn features from large scale labeled datasets. However, these data-driven methods typically lack the capability of processing rare or unseen entities. Previous statistical methods and feature engineering practice have demonstrated that human knowledge can provide valuable information for handling rare and unseen cases. In this paper, we propose a new model which combines data-driven deep learning approaches and knowledge-driven dictionary approaches. Specifically, we incorporate dictionaries into deep neural networks. In addition, two different architectures that extend the bi-directional long short-term memory neural network and five different feature representation schemes are also proposed to handle the task. Computational results on the CCKS-2017 Task 2 benchmark dataset show that the proposed method achieves the highly competitive performance compared with the state-of-the-art deep learning methods.
引用
收藏
页数:9
相关论文
共 43 条
[1]  
[Anonymous], BIOINFORMATICS
[2]  
[Anonymous], 2017, CEUR WORKSHOP PROC
[3]  
Cho Kyunghyun, 2014, C EMPIRICAL METHODS, P1724
[4]  
Duan JB, 2011, IEEE INT C BIO BIO W, P3, DOI 10.1109/BIBMW.2011.6112348
[5]  
Finkel J., 2004, JOINT WORKSHOP NATUR, P88
[6]   A GENERAL NATURAL-LANGUAGE TEXT PROCESSOR FOR CLINICAL RADIOLOGY [J].
FRIEDMAN, C ;
ALDERSON, PO ;
AUSTIN, JHM ;
CIMINO, JJ ;
JOHNSON, SB .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1994, 1 (02) :161-174
[7]  
Fukuda K, 1998, Pac Symp Biocomput, P707
[8]  
Gaizauskas R., 2000, 2 INT CONFERENCEON N, P37
[9]  
Graves A, 2005, IEEE IJCNN, P2047
[10]  
Graves A, 2012, STUD COMPUT INTELL, V385, P1, DOI [10.1162/neco.1997.9.1.1, 10.1007/978-3-642-24797-2]