Identifying inpatient mortality in MarketScan claims data using machine learning

被引:3
作者
Xie, Fenglong [1 ,2 ]
Beukelman, Timothy [2 ]
Sun, Dongmei [1 ]
Yun, Huifeng [1 ]
Curtis, Jeffrey R. [1 ,2 ]
机构
[1] Univ Alabama Birmingham, Dept Med, Div Clin Immunol & Rheumatol, Birmingham, AL USA
[2] Fdn Sci Technol Educ & Res FASTER, Birmingham, AL 35244 USA
基金
美国国家卫生研究院;
关键词
claims data; machine learning; mortality; IN-HOSPITAL MORTALITY; PREDICTION;
D O I
10.1002/pds.5658
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Purpose Inpatient mortality is an important variable in epidemiology studies using claims data. In 2016, MarketScan data began obscuring specific hospital discharge status types for patient privacy, including inpatient deaths, by setting the values to missing. We used a machine learning approach to correctly identify hospitalizations that resulted in inpatient death using data prior to 2016.Methods All hospitalizations from 2011 to 2015 with discharge status of missing, died, or one of the other subsequently obscured values were identified and divided into a training set and two test sets. Predictor variables included age, sex, elapsed time from hospital discharge until last observed claim and until healthcare plan disenrollment, and absence of any discharge diagnoses. Four machine learning methods were used to train statistical models and assess sensitivity and positive predictive value (PPV) for inpatient mortality.Results Overall 1 307 917 hospitalizations were included. All four machine learning approaches performed well in all datasets. Random forest performed best with 88% PPV and 93% sensitivity for the training set and both test sets. The two factors with the highest relative importance for identifying inpatient mortality were having no observed claims for the patient on days 2-91 following hospital discharge and patient disenrollment from the healthcare plan within 60 days following hospital discharge.Conclusion We successfully developed machine learning algorithms to identify inpatient mortality. This approach can be applied to obscured data to accurately identify inpatient mortality among hospitalizations with missing discharge status.
引用
收藏
页码:1299 / 1305
页数:7
相关论文
共 12 条
[1]  
[Anonymous], 2020, MMWR Morb Mortal Wkly Rep, V69, P611, DOI [10.15585/mmwr.mm6919a4, 10.15585/mmwr.mm6919a4]
[2]  
[Anonymous], IBM MARKETSCAN RES D
[3]   Prospective and External Evaluation of a Machine Learning Model to Predict In-Hospital Mortality of Adults at Time of Admission [J].
Brajer, Nathan ;
Cozzi, Brian ;
Gao, Michael ;
Nichols, Marshall ;
Revoir, Mike ;
Balu, Suresh ;
Futoma, Joseph ;
Bae, Jonathan ;
Setji, Noppon ;
Hernandez, Adrian ;
Sendak, Mark .
JAMA NETWORK OPEN, 2020, 3 (02)
[4]   Development and Evaluation of an Automated Machine Learning Algorithm for In-Hospital Mortality Risk Adjustment Among Critical Care Patients [J].
Delahanty, Ryan J. ;
Kaufman, David ;
Jones, Spencer S. .
CRITICAL CARE MEDICINE, 2018, 46 (06) :E481-E488
[5]   Predicting in-hospital mortality of patients with febrile neutropenia using machine learning models [J].
Du, Xinsong ;
Min, Jae ;
Shah, Chintan P. ;
Bishnoi, Rohit ;
Hogan, William R. ;
Lemas, Dominick J. .
INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2020, 139
[6]   SafeNET: Initial development and validation of a real-time tool for predicting mortality risk at the time of hospital transfer to a higher level of care [J].
Dunn, Stefanie C. Altieri ;
Bellon, Johanna E. ;
Bilderback, Andrew ;
Borrebach, Jeffrey D. ;
Hodges, Jacob C. ;
Wisniewski, Mary Kay ;
Harinstein, Matthew E. ;
Minnier, Tamra E. ;
Nelson, Joel B. ;
Hall, Daniel E. .
PLOS ONE, 2021, 16 (02)
[7]   Machine Learning Prediction Models for In-Hospital Mortality After Transcatheter Aortic Valve Replacement [J].
Hernandez-Suarez, Dagmar F. ;
Kim, Yeunjung ;
Villablanca, Pedro ;
Gupta, Tanush ;
Wiley, Jose ;
Nieves-Rodriguez, Brenda G. ;
Rodriguez-Maldonado, Jovaniel ;
Maldonado, Roberto Feliu ;
Sant'Ana, Istoni da Luz ;
Sanina, Cristina ;
Cox-Alomar, Pedro ;
Ramakrishna, Harish ;
Lopez-Candales, Angel ;
O'Neill, William W. ;
Pinto, Duane S. ;
Latib, Azeem ;
Roche-Lima, Abiel .
JACC-CARDIOVASCULAR INTERVENTIONS, 2019, 12 (14) :1328-1338
[8]   Imputation and characterization of uncoded self-harm in major mental illness using machine learning [J].
Kumar, Praveen ;
Nestsiarovich, Anastasiya ;
Nelson, Stuart J. ;
Kerner, Berit ;
Perkins, Douglas J. ;
Lambert, Christophe G. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2020, 27 (01) :136-146
[9]   Identifying the DEAD: Development and Validation of a Patient-Level Model to Predict Death Status in Population-Level Claims Data [J].
Reps, Jenna M. ;
Rijnbeek, Peter R. ;
Ryan, Patrick B. .
DRUG SAFETY, 2019, 42 (11) :1377-1386
[10]   Prediction of In-hospital Mortality in Emergency Department Patients With Sepsis: A Local Big Data-Driven, Machine Learning Approach [J].
Taylor, R. Andrew ;
Pare, Joseph R. ;
Venkatesh, Arjun K. ;
Mowafi, Hani ;
Melnick, Edward R. ;
Fleischman, William ;
Hall, M. Kennedy .
ACADEMIC EMERGENCY MEDICINE, 2016, 23 (03) :269-278