Entity recognition in Chinese clinical text using attention-based CNN-LSTM-CRF

被引:30
作者
Tang, Buzhou [1 ]
Wang, Xiaolong [1 ]
Yan, Jun [2 ]
Chen, Qingcai [1 ]
机构
[1] Harbin Inst Technol, Key Lab Network Oriented Intelligent Computat, Shenzhen 518055, Peoples R China
[2] Yidu Cloud Beijing Technol Co Ltd, Beijing 100191, Peoples R China
关键词
Chinese clinical entity recognition; Neural network; Convolutional neural network; Long-short term memory; Conditional random field;
D O I
10.1186/s12911-019-0787-y
中图分类号
R-058 [];
学科分类号
摘要
BackgroundClinical entity recognition as a fundamental task of clinical text processing has been attracted a great deal of attention during the last decade. However, most studies focus on clinical text in English rather than other languages. Recently, a few researchers have began to study entity recognition in Chinese clinical text.MethodsIn this paper, a novel deep neural network, called attention-based CNN-LSTM-CRF, is proposed to recognize entities in Chinese clinical text. Attention-based CNN-LSTM-CRF is an extension of LSTM-CRF by introducing a CNN (convolutional neural network) layer after the input layer to capture local context information of words of interest and an attention layer before the CRF layer to select relevant words in the same sentence.ResultsIn order to evaluate the proposed method, we compare it with other two currently popular methods, CRF (conditional random field) and LSTM-CRF, on two benchmark datasets. One of the datasets is publically available and only contains contiguous clinical entities, and the other one is constructed by us and contains contiguous and discontiguous clinical entities. Experimental results show that attention-based CNN-LSTM-CRF outperforms CRF and LSTM-CRF.ConclusionsCNN and attention mechanism are individually beneficial to LSTM-CRF-based Chinese clinical entity recognition system, no matter whether contiguous clinical entities are considered. The conribution of attention mechanism is greater than CNN.
引用
收藏
页数:9
相关论文
共 50 条
  • [31] Enhanced Attention-Based Encoder-Decoder Framework for Text Recognition
    Prabu, S.
    Sundar, K. Joseph Abraham
    INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (02) : 2071 - 2086
  • [32] Thai Named Entity Recognition Using Bi-LSTM-CRF with Word and Character Representation
    Thattinaphanich, Suphanut
    Prom-on, Santitham
    PROCEEDINGS OF THE 2019 4TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY (INCIT): ENCOMPASSING INTELLIGENT TECHNOLOGY AND INNOVATION TOWARDS THE NEW ERA OF HUMAN LIFE, 2019, : 149 - 154
  • [33] An Attention-Based Convolutional Recurrent Neural Networks for Scene Text Recognition
    Alshawi, Adil Abdullah Abdulhussein
    Tanha, Jafar
    Balafar, Mohammad Ali
    IEEE ACCESS, 2024, 12 : 8123 - 8134
  • [34] Fusion of multiple features for Chinese Named Entity Recognition based on CRF model
    Zhang, Yuejie
    Xu, Zhiting
    Zhang, Tao
    INFORMATION RETRIEVAL TECHNOLOGY, 2008, 4993 : 95 - +
  • [35] Urban PM2.5 Concentration Prediction via Attention-Based CNN-LSTM
    Li, Songzhou
    Xie, Gang
    Ren, Jinchang
    Guo, Lei
    Yang, Yunyun
    Xu, Xinying
    APPLIED SCIENCES-BASEL, 2020, 10 (06):
  • [36] SEQUENCE LABELING OF CHINESE TEXT BASED ON BIDIRECTIONAL GRU-CNN-CRF MODEL
    Liu, Di
    Zou, Xinyi
    2018 15TH INTERNATIONAL COMPUTER CONFERENCE ON WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING (ICCWAMTIP), 2018, : 31 - 34
  • [37] A network security entity recognition method based on feature template and CNN-BiLSTM-CRF
    Ya Qin
    Guo-wei Shen
    Wen-bo Zhao
    Yan-ping Chen
    Miao Yu
    Xin Jin
    Frontiers of Information Technology & Electronic Engineering, 2019, 20 : 872 - 884
  • [38] A network security entity recognition method based on feature template and CNN-BiLSTM-CRF
    Qin, Ya
    Shen, Guo-wei
    Zhao, Wen-bo
    Chen, Yan-ping
    Yu, Miao
    Jin, Xin
    FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2019, 20 (06) : 872 - 884
  • [39] Traditional Chinese medicine entity relation extraction based on CNN with segment attention
    Bai, Tian
    Guan, Haotian
    Wang, Shang
    Wang, Ye
    Huang, Lan
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (04) : 2739 - 2748
  • [40] Traditional Chinese medicine entity relation extraction based on CNN with segment attention
    Tian Bai
    Haotian Guan
    Shang Wang
    Ye Wang
    Lan Huang
    Neural Computing and Applications, 2022, 34 : 2739 - 2748