Deep neural models for ICD-10 coding of death certificates and autopsy reports in free-text

被引:37
作者
Duarte, Francisco [1 ]
Martins, Bruno [1 ]
Pinto, Catia Sousa [2 ]
Silva, Mario J. [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, Lisbon, Portugal
[2] Direcao Geral Saude, Lisbon, Portugal
关键词
Automated ICD coding; Clinical text mining; Deep learning; Natural language processing; Artificial intelligence in medicine;
D O I
10.1016/j.jbi.2018.02.011
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We address the assignment of ICD-10 codes for causes of death by analyzing free-text descriptions in death certificates, together with the associated autopsy reports and clinical bulletins, from the Portuguese Ministry of Health. We leverage a deep neural network that combines word embeddings, recurrent units, and neural attention, for the generation of intermediate representations of the textual contents. The neural network also explores the hierarchical nature of the input data, by building representations from the sequences of words within individual fields, which are then combined according to the sequences of fields that compose the inputs. Moreover, we explore innovative mechanisms for initializing the weights of the final nodes of the network, leveraging co-occurrences between classes together with the hierarchical structure of ICD-10. Experimental results attest to the contribution of the different neural network components. Our best model achieves accuracy scores over 89%, 81%, and 76%, respectively for ICD-10 chapters, blocks, and full-codes. Through examples, we also show that our method can produce interpretable results, useful for public health surveillance.
引用
收藏
页码:64 / 77
页数:14
相关论文
共 54 条
  • [1] [Anonymous], 2017, P 5 INT C LEARN REPR
  • [2] [Anonymous], P INT C MACH LEARN
  • [3] [Anonymous], 2016, ARXIV160506065
  • [4] [Anonymous], NATURE
  • [5] [Anonymous], 1994, PROC 20 INT C VERY L
  • [6] [Anonymous], 2016, IEEE T NEURAL NETWOR
  • [7] [Anonymous], 2015, 3 INT C LEARN REPR I
  • [8] [Anonymous], P C N AM CHAPT ASS C
  • [9] [Anonymous], 2015, CoRR
  • [10] [Anonymous], P IEEE C VIS AN SCI