Entity recognition in Chinese clinical text using attention-based CNN-LSTM-CRF

被引:35
作者
Tang, Buzhou [1 ]
Wang, Xiaolong [1 ]
Yan, Jun [2 ]
Chen, Qingcai [1 ]
机构
[1] Harbin Inst Technol, Key Lab Network Oriented Intelligent Computat, Shenzhen 518055, Peoples R China
[2] Yidu Cloud Beijing Technol Co Ltd, Beijing 100191, Peoples R China
关键词
Chinese clinical entity recognition; Neural network; Convolutional neural network; Long-short term memory; Conditional random field;
D O I
10.1186/s12911-019-0787-y
中图分类号
R-058 [];
学科分类号
摘要
BackgroundClinical entity recognition as a fundamental task of clinical text processing has been attracted a great deal of attention during the last decade. However, most studies focus on clinical text in English rather than other languages. Recently, a few researchers have began to study entity recognition in Chinese clinical text.MethodsIn this paper, a novel deep neural network, called attention-based CNN-LSTM-CRF, is proposed to recognize entities in Chinese clinical text. Attention-based CNN-LSTM-CRF is an extension of LSTM-CRF by introducing a CNN (convolutional neural network) layer after the input layer to capture local context information of words of interest and an attention layer before the CRF layer to select relevant words in the same sentence.ResultsIn order to evaluate the proposed method, we compare it with other two currently popular methods, CRF (conditional random field) and LSTM-CRF, on two benchmark datasets. One of the datasets is publically available and only contains contiguous clinical entities, and the other one is constructed by us and contains contiguous and discontiguous clinical entities. Experimental results show that attention-based CNN-LSTM-CRF outperforms CRF and LSTM-CRF.ConclusionsCNN and attention mechanism are individually beneficial to LSTM-CRF-based Chinese clinical entity recognition system, no matter whether contiguous clinical entities are considered. The conribution of attention mechanism is greater than CNN.
引用
收藏
页数:9
相关论文
共 50 条
  • [41] SKELETON-BASED ACTION RECOGNITION USING LSTM AND CNN
    Li, Chuankun
    Wang, Pichao
    Wang, Shuang
    Hou, Yonghong
    Li, Wanqing
    2017 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO WORKSHOPS (ICMEW), 2017,
  • [42] Named Entity Recognition for Biomedical Patent Text using Bi-LSTM Variants
    Saad, Farag
    IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 617 - 621
  • [43] Attention-Based Deep Neural Network and Its Application to Scene Text Recognition
    He, Haizhen
    Li, Jiehan
    2019 IEEE 11TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN 2019), 2019, : 672 - 677
  • [44] Attention-based hybrid CNN-LSTM and spectral data augmentation for COVID-19 diagnosis from cough sound
    Hamdi, Skander
    Oussalah, Mourad
    Moussaoui, Abdelouahab
    Saidi, Mohamed
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2022, 59 (02) : 367 - 389
  • [45] Attention-based hybrid CNN-LSTM and spectral data augmentation for COVID-19 diagnosis from cough sound
    Skander Hamdi
    Mourad Oussalah
    Abdelouahab Moussaoui
    Mohamed Saidi
    Journal of Intelligent Information Systems, 2022, 59 : 367 - 389
  • [46] A named entity recognition method towards product reviews based on BiLSTM-attention-CRF
    Zhang, Shunxiang
    Zhu, Haiyang
    Xu, Hanqing
    Zhu, Guangli
    Li, Kuan Ching
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2022, 25 (05) : 479 - 489
  • [47] Multisource learning for skeleton-based action recognition using deep LSTM and CNN
    Cui, Ran
    Zhu, Aichun
    Hua, Gang
    Yin, Hongsheng
    Liu, Haiqiang
    JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (04)
  • [48] Automatic cardiac arrhythmias classification using CNN and attention-based RNN network
    Sun, Jie
    HEALTHCARE TECHNOLOGY LETTERS, 2023, 10 (03) : 53 - 61
  • [49] An Attention Based Bi-LSTM DenseNet Model for Named Entity Recognition in English Texts
    VeeraSekharReddy, B.
    Rao, Koppula Srinivas
    Koppula, Neerja
    WIRELESS PERSONAL COMMUNICATIONS, 2023, 130 (02) : 1435 - 1448
  • [50] An Attention Based Bi-LSTM DenseNet Model for Named Entity Recognition in English Texts
    B. VeeraSekharReddy
    Koppula Srinivas Rao
    Neerja Koppula
    Wireless Personal Communications, 2023, 130 : 1435 - 1448