Entity recognition in Chinese clinical text using attention-based CNN-LSTM-CRF

被引:30
作者
Tang, Buzhou [1 ]
Wang, Xiaolong [1 ]
Yan, Jun [2 ]
Chen, Qingcai [1 ]
机构
[1] Harbin Inst Technol, Key Lab Network Oriented Intelligent Computat, Shenzhen 518055, Peoples R China
[2] Yidu Cloud Beijing Technol Co Ltd, Beijing 100191, Peoples R China
关键词
Chinese clinical entity recognition; Neural network; Convolutional neural network; Long-short term memory; Conditional random field;
D O I
10.1186/s12911-019-0787-y
中图分类号
R-058 [];
学科分类号
摘要
BackgroundClinical entity recognition as a fundamental task of clinical text processing has been attracted a great deal of attention during the last decade. However, most studies focus on clinical text in English rather than other languages. Recently, a few researchers have began to study entity recognition in Chinese clinical text.MethodsIn this paper, a novel deep neural network, called attention-based CNN-LSTM-CRF, is proposed to recognize entities in Chinese clinical text. Attention-based CNN-LSTM-CRF is an extension of LSTM-CRF by introducing a CNN (convolutional neural network) layer after the input layer to capture local context information of words of interest and an attention layer before the CRF layer to select relevant words in the same sentence.ResultsIn order to evaluate the proposed method, we compare it with other two currently popular methods, CRF (conditional random field) and LSTM-CRF, on two benchmark datasets. One of the datasets is publically available and only contains contiguous clinical entities, and the other one is constructed by us and contains contiguous and discontiguous clinical entities. Experimental results show that attention-based CNN-LSTM-CRF outperforms CRF and LSTM-CRF.ConclusionsCNN and attention mechanism are individually beneficial to LSTM-CRF-based Chinese clinical entity recognition system, no matter whether contiguous clinical entities are considered. The conribution of attention mechanism is greater than CNN.
引用
收藏
页数:9
相关论文
共 50 条
  • [1] Entity recognition in Chinese clinical text using attention-based CNN-LSTM-CRF
    Buzhou Tang
    Xiaolong Wang
    Jun Yan
    Qingcai Chen
    BMC Medical Informatics and Decision Making, 19
  • [2] Chinese Clinical Entity Recognition via Attention-based CNN-LSTM-CRF
    Liu, Zengjian
    Wang, Xiaolong
    Chen, Qingcai
    Tang, Buzhou
    2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS WORKSHOPS (ICHI-W), 2018, : 68 - 69
  • [3] Neural Chinese Named Entity Recognition via CNN-LSTM-CRF and Joint Training with Word Segmentation
    Wu, Fangzhao
    Liu, Junxin
    Wu, Chuhan
    Huang, Yongfeng
    Xie, Xing
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3342 - 3348
  • [4] Attention-based LSTM, GRU and CNN for short text classification
    Yu, Shujuan
    Liu, Danlei
    Zhu, Wenfeng
    Zhang, Yun
    Zhao, Shengmei
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (01) : 333 - 340
  • [5] Named Entity Recognition From Biomedical Texts Using a Fusion Attention-Based BiLSTM-CRF
    Wei, Hao
    Gao, Mingyuan
    Zhou, Ai
    Chen, Fei
    Qu, Wen
    Wang, Chunli
    Lu, Mingyu
    IEEE ACCESS, 2019, 7 : 73627 - 73636
  • [6] Chinese News Text Classification based on Attention-based CNN-BiLSTM
    Wang, Meng
    Cai, Qiong
    Wang, Liya
    Li, Jun
    Wang, Xiaoke
    MIPPR 2019: PATTERN RECOGNITION AND COMPUTER VISION, 2020, 11430
  • [7] An Attention-Based ID-CNNs-CRF Model for Named Entity Recognition on Clinical Electronic Medical Records
    Gao, Ming
    Xiao, Qifeng
    Wu, Shaochun
    Deng, Kun
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 : 231 - 242
  • [8] Recognition of Ironic Sentences in Twitter using Attention-Based LSTM
    Martini, Andrianarisoa Tojo
    Farrukh, Makhmudov
    Ge, Hongwei
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2018, 9 (08) : 7 - 11
  • [9] Driver stress detection via multimodal fusion using attention-based CNN-LSTM
    Mou, Luntian
    Zhou, Chao
    Zhao, Pengfei
    Nakisa, Bahareh
    Rastgoo, Mohammad Naim
    Jain, Ramesh
    Gao, Wen
    EXPERT SYSTEMS WITH APPLICATIONS, 2021, 173
  • [10] A Korean named entity recognition method using Bi-LSTM-CRF and masked self-attention
    Jin, Guozhe
    Yu, Zhezhou
    Yu, Zhezhou (yuzz@jlu.edu.cn), 1600, Academic Press (65):