Classification Models for Predicting Inflammatory Bowel Disease Healthcare Utilization

被引:0
作者
Babichenko, Dmitriy [1 ]
Rahdari, Behnam [1 ]
Stein, Ben [1 ]
Subramanian, Suraj [1 ]
Rivers, Claudia [2 ]
Tang, Gong [3 ]
Binion, David [2 ]
机构
[1] Univ Pittsburgh, Sch Comp & Informat, Pittsburgh, PA 15261 USA
[2] Univ Pittsburgh, Sch Publ Hlth, Pittsburgh, PA 15260 USA
[3] Univ Pittsburgh, Sch Med, Pittsburgh, PA USA
来源
HEALTHINF: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL 5: HEALTHINF | 2021年
关键词
Inflammatory Bowel Disease; Healthcare Utilization; Machine Learning; Classification; Deep Learning; Clinical Decision Support Systems; CROHNS-DISEASE; ULCERATIVE-COLITIS; PREVALENCE; RISK; ASSOCIATION; COMORBIDITY; PATTERNS; ANEMIA; TIME;
D O I
10.5220/0010852100003123
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective. Inflammatory Bowel Disorders (IBD) is a group of gastric disorders that include well-known maladies such as Crohn's disease and Ulcerative Colitis (UC), as well as a number of other gastric disorders that are not well classified. Subgroups of patients contribute disproportionately to treatment costs. This work aims to create and evaluate machine learning models designed to use demographic and clinical predictors of IBD to predict which patients would fall into the "high healthcare utilization" category. Materials and Methods. A series of machine learning models were trained on a dataset extracted from a prospective natural history registry from a tertiary IBD center and associated healthcare charges. The models were trained to predict which patients are likely to have the highest healthcare utilization charges (top 15%). Results. A gradient-boosted trees classification model (accuracy 0.898, AUC 0.748) performed best out of the 12 evaluated modeling approaches. Conclusion. Classification models such as the ones evaluated in this work provide a reasonable basis for a clinical decision support system designed to predict which IBD patients are likely to become high expenditure.
引用
收藏
页码:154 / 161
页数:8
相关论文
共 39 条
[21]   A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms [J].
Lim, TS ;
Loh, WY ;
Shih, YS .
MACHINE LEARNING, 2000, 40 (03) :203-228
[22]   Update on the incidence and prevalence of Crohn's disease and ulcerative colitis in Olmsted county, Minnesota, 1940-2000 [J].
Loftus, Conor G. ;
Loftus, Edward V., Jr. ;
Harmsen, W. Scott ;
Zinsmeister, Alan R. ;
Tremaine, William J. ;
Melton, L. Joseph, III ;
Sandborn, William J. .
INFLAMMATORY BOWEL DISEASES, 2007, 13 (03) :254-261
[23]   Comparison of three data mining models for predicting diabetes or prediabetes by risk factors [J].
Meng, Xue-Hui ;
Huang, Yi-Xiang ;
Rao, Dong-Ping ;
Zhang, Qiu ;
Liu, Qing .
KAOHSIUNG JOURNAL OF MEDICAL SCIENCES, 2013, 29 (02) :93-99
[24]   The effect of functional gastrointestinal disorders on psychological comorbidity and quality of life in patients with inflammatory bowel disease [J].
Mikocka-Walus, A. A. ;
Turnbull, D. A. ;
Andrews, J. M. ;
Moulding, N. T. ;
Holtmann, G. J. .
ALIMENTARY PHARMACOLOGY & THERAPEUTICS, 2008, 28 (04) :475-483
[25]   Increasing Incidence and Prevalence of the Inflammatory Bowel Diseases With Time, Based on Systematic Review [J].
Molodecky, Natalie A. ;
Soon, Ing Shian ;
Rabi, Doreen M. ;
Ghali, William A. ;
Ferris, Mollie ;
Chernoff, Greg ;
Benchimol, Eric I. ;
Panaccione, Remo ;
Ghosh, Subrata ;
Barkema, Herman W. ;
Kaplan, Gilaad G. .
GASTROENTEROLOGY, 2012, 142 (01) :46-54
[26]   Learning hidden patterns from patient multivariate time series data using convolutional neural networks: A case study of healthcare cost prediction [J].
Morid, Mohammad Amin ;
Sheng, Olivia R. Liu ;
Kawamoto, Kensaku ;
Abdelrahman, Samir .
JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 111
[27]   Healthcare cost prediction: Leveraging fine-grain temporal patterns [J].
Morid, Mohammad Amin ;
Sheng, Olivia R. Liu ;
Kawamoto, Kensaku ;
Ault, Travis ;
Dorius, Josette ;
Abdelrahman, Samir .
JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 91
[28]  
Morid Mohammad Amin, 2017, AMIA Annu Symp Proc, V2017, P1312
[29]   Performance Analysis of Data Mining Classification Techniques to Predict Diabetes [J].
Perveen, Sajida ;
Shahbaz, Muhammad ;
Guergachi, Aziz ;
Keshavjee, Karim .
4TH SYMPOSIUM ON DATA MINING APPLICATIONS (SDMA2016), 2016, 82 :115-121
[30]  
Raghunathan TE., 2002, IVEWARE IMPUTATION V