Fairness gaps in Machine learning models for hospitalization and emergency department visit risk prediction in home healthcare patients with heart failure

被引:0
作者
Davoudi, Anahita [1 ]
Chae, Sena [2 ]
Evans, Lauren [1 ]
Sridharan, Sridevi [1 ]
Song, Jiyoun [3 ]
Bowles, Kathryn H. [1 ,3 ]
McDonald, Margaret V. [1 ]
Topaz, Maxim [1 ,4 ,5 ]
机构
[1] VNS Hlth, Ctr Home Care Policy & Res, New York, NY 10017 USA
[2] Univ Iowa, Coll Nursing, Iowa City, IA USA
[3] Univ Penn, Sch Nursing, Dept Biobehav Hlth Sci, Philadelphia, PA USA
[4] Columbia Univ, Sch Nursing, New York, NY USA
[5] Columbia Univ, Data Sci Inst, New York, NY USA
基金
美国医疗保健研究与质量局;
关键词
Machine Learning; Socioeconomic Factors; Bias; Healthcare Disparities; Heart Failure; Home Care Services; OUTCOMES; BIAS;
D O I
10.1016/j.ijmedinf.2024.105534
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objectives: This study aims to evaluate the fairness performance metrics of Machine Learning (ML) models to predict hospitalization and emergency department (ED) visits in heart failure patients receiving home healthcare. We analyze biases, assess performance disparities, and propose solutions to improve model performance in diverse subpopulations. Methods: The study used a dataset of 12,189 episodes of home healthcare collected between 2015 and 2017, including structured (e.g., standard assessment tool) and unstructured data (i.e., clinical notes). ML risk prediction models, including Light Gradient-boosting model (LightGBM) and AutoGluon, were developed using demographic information, vital signs, comorbidities, service utilization data, and the area deprivation index (ADI) associated with the patient's home address. Fairness metrics, such as Equal Opportunity, Predictive Equality, Predictive Parity, and Statistical Parity, were calculated to evaluate model performance across subpopulations. Results: Our study revealed significant disparities in model performance across diverse demographic subgroups. For example, the Hispanic, Male, High-ADI subgroup excelled in terms of Equal Opportunity with a metric value of 0.825, which was 28% higher than the lowest-performing Other, Female, Low-ADI subgroup, which scored 0.644. In Predictive Parity, the gap between the highest and lowest-performing groups was 29%, and in Statistical Parity, the gap reached 69%. In Predictive Equality, the difference was 45%. Discussion and Conclusion: The findings highlight substantial differences in fairness metrics across diverse patient subpopulations in ML risk prediction models for heart failure patients receiving home healthcare services. Ongoing monitoring and improvement of fairness metrics are essential to mitigate biases.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] Black Patients With Heart Failure Living in Distressed Communities Disproportionately Experience Excess Risk of Mortality After Emergency Department Visit for Dyspnea
    Kraevsky-Phillips, Karina
    Callaway, Clifton W.
    Henker, Richard A.
    Scott, Paul
    Al-Zaiti, Salah S.
    CIRCULATION, 2023, 148
  • [32] Unsupervised machine learning identifies symptoms of indigestion as a predictor of acute decompensation and adverse cardiac events in patients with heart failure presenting to the emergency department
    Kraevsky-Phillips, Karina
    Sereika, Susan M.
    Bouzid, Zeineb
    Hickey, Gavin
    Callaway, Clifton W.
    Saba, Samir
    Martin-Gill, Christian
    Al-Zaiti, Salah S.
    HEART & LUNG, 2023, 61 : 107 - 113
  • [33] Establishment and validation of a prediction nomogram for heart failure risk in patients with acute myocardial infarction during hospitalization
    Chen, Shengyue
    Pan, Xinling
    Mo, Jiahang
    Wang, Bin
    BMC CARDIOVASCULAR DISORDERS, 2023, 23 (01)
  • [34] Prediction of bacteremia at the emergency department during triage and disposition stages using machine learning models
    Choi, Dong Hyun
    Hong, Ki Jeong
    Park, Jeong Ho
    Shin, Sang Do
    Ro, Young Sun
    Song, Kyoung Jun
    Kim, Ki Hong
    Kim, Sungwan
    AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2022, 53 : 86 - 93
  • [35] Emergency Heart Failure Mortality Risk Grade score performance for 7-day mortality prediction in patients with heart failure attended at the emergency department: validation in a Spanish cohort
    Gil, Victor
    Miro, Oscar
    Schull, Michael J.
    Llorens, Pere
    Herrero-Puente, Pablo
    Jacob, Javier
    Rios, Jose
    Lee, Douglas S.
    Martin-Sanchez, Francisco J.
    EUROPEAN JOURNAL OF EMERGENCY MEDICINE, 2018, 25 (03) : 169 - 177
  • [36] Barriers and Opportunities Regarding Implementation of a Machine Learning-Based Acute Heart Failure Risk Stratification Tool in the Emergency Department
    Sax, Dana R.
    Sturmer, Lillian R.
    Mark, Dustin G.
    Rana, Jamal S.
    Reed, Mary E.
    DIAGNOSTICS, 2022, 12 (10)
  • [37] Prediction Model Using Machine Learning for Mortality in Patients with Heart Failure
    Negassa, Abdissa
    Ahmed, Selim
    Zolty, Ronald
    Patel, Snehal R.
    AMERICAN JOURNAL OF CARDIOLOGY, 2021, 153 : 86 - 93
  • [38] The Price of Explainability in Machine Learning Models for 100-Day Readmission Prediction in Heart Failure: Retrospective, Comparative, Machine Learning Study
    Soliman, Amira
    Agvall, Bjorn
    Etminani, Kobra
    Hamed, Omar
    Lingman, Markus
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2023, 25
  • [39] Diuretic Resistance Prediction and Risk Factor Analysis of Patients with Heart Failure During Hospitalization
    Lu, Xiao
    Xin, Yi
    Zhu, Jiang
    Dong, Wei
    Guan, Tong-Peng
    Li, Jia-Yue
    Li, Qin
    GLOBAL HEART, 2022, 17 (01)
  • [40] Circulating microRNA-132 levels improve risk prediction for heart failure hospitalization in patients with chronic heart failure
    Masson, Serge
    Batkai, Sandor
    Beermann, Julia
    Baer, Christian
    Pfanne, Angelika
    Thum, Sabrina
    Magnoli, Michela
    Balconi, Giovanna
    Nicolosi, Gian Luigi
    Tavazzi, Luigi
    Latini, Roberto
    Thum, Thomas
    EUROPEAN JOURNAL OF HEART FAILURE, 2018, 20 (01) : 78 - 85