Unstructured Data in Predictive Process Monitoring: Lexicographic and Semantic Mapping to ICD-9-CM Codes for the Home Hospitalization Service

被引:3
作者
Ronzani, Massimiliano [3 ]
Ferrod, Roger [1 ]
Di Francescomarino, Chiara [3 ]
Sulis, Emilio [1 ]
Aringhieri, Roberto [1 ]
Boella, Guido [1 ]
Brunetti, Enrico [1 ,2 ]
Di Caro, Luigi [1 ]
Dragoni, Mauro [3 ]
Ghidini, Chiara [3 ]
Marinello, Renata [2 ]
机构
[1] Univ Turin, Turin, Italy
[2] City Hlth & Sci, Turin, Italy
[3] Fdn Bruno Kessler, Trento, Italy
来源
AIXIA 2021 - ADVANCES IN ARTIFICIAL INTELLIGENCE | 2022年 / 13196卷
关键词
Healthcare processes; Predictive process monitoring; Natural language processing; Home hospitalization service;
D O I
10.1007/978-3-031-08421-8_48
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The large availability of hospital administrative and clinical data has encouraged the application of Process Mining techniques to the healthcare domain. Predictive Process Monitoring techniques can be used in order to learn from these data related to past historical executions and predict the future of incomplete cases. However, some of these data, possibly the most informative ones, are often available in natural language text, while structured information-extracted from these data-would be more beneficial for training predictive models. In this paper we focus on the scenario of the Home Hospitalization Service, supporting the team in making decisions on the home hospitalization of a patient, by predicting whether it is likely that a new patient will successfully undergo home hospitalization. We aim at investigating whether, in this scenario, we can take advantage of mapping unstructured textual diagnoses, reported by the doctor in the Emergency Department, into structured information, as the standardized disease ICD-9-CM codes, to provide more accurate predictions. To this aim, we devise two different approaches involving respectively lexicographic and semantic distance for mapping textual diagnoses in ICD-9-CM codes and leverage the structured information for making predictions.
引用
收藏
页码:700 / 715
页数:16
相关论文
共 26 条
[1]   Integrating Structured and Unstructured Patient Data for ICD9 Disease Code Group Prediction [J].
Akshara, P. ;
Shidharth, S. ;
Krishnan, Gokul S. ;
Kamath, Sowmya S. .
CODS-COMAD 2021: PROCEEDINGS OF THE 3RD ACM INDIA JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE & MANAGEMENT OF DATA (8TH ACM IKDD CODS & 26TH COMAD), 2021, :436-436
[2]   A Process Mining Application for the Analysis of Hospital-at-Home Admissions [J].
Amantea, Ilaria Angela ;
Sulis, Emilio ;
Boella, Guido ;
Marinello, Renata ;
Bianca, Dario ;
Brunetti, Enrico ;
Bo, Mario ;
Fernandez-Llatas, Carlos .
DIGITAL PERSONALIZED HEALTH AND MEDICINE, 2020, 270 :522-526
[3]  
Aringhieri R., 2021, P WORKSHOP SMARTER H, V3060, P48
[4]   Automatic ICD-10 Classification of Diseases from Dutch Discharge Letters [J].
Bagheri, Ayoub ;
Sammani, Arjan ;
Van der Heijden, Peter G. M. ;
Asselbergs, Folkert W. ;
Oberski, Daniel L. .
PROCEEDINGS OF THE 13TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES, VOL 3: BIOINFORMATICS, 2020, :281-289
[5]   The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation [J].
Chicco, Davide ;
Jurman, Giuseppe .
BMC GENOMICS, 2020, 21 (01)
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]   Predictive Process Monitoring Methods: Which One Suits Me Best? [J].
Di Francescomarino, Chiara ;
Ghidini, Chiara ;
Maggi, Fabrizio Maria ;
Milani, Fredrik .
BUSINESS PROCESS MANAGEMENT (BPM 2018), 2018, 11080 :462-479
[8]   Clustering-Based Predictive Process Monitoring [J].
Di Francescomarino, Chiara ;
Dumas, Marlon ;
Maggi, Fabrizio Maria ;
Teinemaa, Irene .
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2019, 12 (06) :896-909
[9]   A Deep Learning Method for ICD-10 Coding of Free-Text Death Certificates [J].
Duarte, Francisco ;
Martins, Bruno ;
Pinto, Catia Sousa ;
Silva, Mario J. .
PROGRESS IN ARTIFICIAL INTELLIGENCE (EPIA 2017), 2017, 10423 :137-149
[10]   FarSight: Long-Term Disease Prediction Using Unstructured Clinical Nursing Notes [J].
Gangavarapu, Tushaar ;
Krishnan, Gokul S. ;
Kamath, Sowmya S. ;
Jeganathan, Jayakumar .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2021, 9 (03) :1151-1169