Automated ICD-9-CM medical coding of diabetic patient's clinical reports

被引:1
作者
Pereira, Vitor [1 ]
Matos, Sergio [1 ]
Oliveira, Jose Luis [1 ]
机构
[1] Univ Aveiro, DETI IEETA, Aveiro, Portugal
来源
PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON DATA SCIENCE, E-LEARNING AND INFORMATION SYSTEMS 2018 (DATA'18) | 2018年
关键词
Electronic Health Records; Classification; Supervised Learning;
D O I
10.1145/3279996.3280019
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The assignment of ICD-9-CM codes to patient's clinical reports is a costly and wearing process manually done by medical personnel, estimated to cost about $25 billion per year in the United States. To develop a system that automates this process has been an ambition of researchers but is still an unsolved problem due to the inherent difficulties in processing unstructured clinical text. This problem is here formulated as a multi-label supervised learning one where the independent variable is the report's text and the dependent the several assigned ICD-9-CM labels. Different variations of two neural network based models, the Bag-of-Tricks and the Convolutional Neural Network (CNN) are investigated. The models are trained on the diabetic patient subset of the freely available MIMIC-III dataset. The results show that a CNN with three parallel convolutional layers achieves F-1 scores of 44.51% for five digit codes and 51.73% for three digit, rolled up, codes. Although fully automated coding is not achievable, these results suggest that automated classification could be used to aid clinical staff by selecting the most probable codes.
引用
收藏
页数:6
相关论文
共 14 条
[1]  
Ayyar Sandeep, 2017, TAGGING PATIENT NOTE
[2]  
Baumel T., 2017, MULTILABEL CLASSIFIC
[3]   Automatic construction of rule-based ICD-9-CM coding systems [J].
Farkas, Richard ;
Szarvas, Gyoergy .
BMC BIOINFORMATICS, 2008, 9 (Suppl 3)
[4]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[5]   MIMIC-III, a freely accessible critical care database [J].
Johnson, Alistair E. W. ;
Pollard, Tom J. ;
Shen, Lu ;
Lehman, Li-wei H. ;
Feng, Mengling ;
Ghassemi, Mohammad ;
Moody, Benjamin ;
Szolovits, Peter ;
Celi, Leo Anthony ;
Mark, Roger G. .
SCIENTIFIC DATA, 2016, 3
[6]  
Joulin A., 2016, CoRR
[7]  
Kingma D. P., P 3 INT C LEARN REPR
[8]  
Liu HF, 2001, J AM MED INFORM ASSN, P393
[9]  
Meystre S M, 2008, Yearb Med Inform, P128
[10]  
Mikolov T., 2013, ICLR, P3111