Evaluating Algorithmic Bias in 30-Day Hospital Readmission Models: Retrospective Analysis

被引:2
|
作者
Wang, H. Echo [1 ]
Weiner, Jonathan P. [1 ,2 ]
Saria, Suchi [3 ]
Kharrazi, Hadi [1 ,2 ]
机构
[1] Johns Hopkins Univ, Bloomberg Sch Publ Hlth, 624 N Broadway,Hampton House, Baltimore, MD 21205 USA
[2] Johns Hopkins Ctr Populat Hlth Informat Technol, Baltimore, MD USA
[3] Johns Hopkins Univ, Whiting Sch Engn, Baltimore, MD USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
algorithmic bias; model bias; predictive models; model fairness; health disparity; hospital readmission; retrospective analysis; ARTIFICIAL-INTELLIGENCE; MEDICARE BENEFICIARIES; HEALTH DISPARITIES; RISK; CARE; RACE; IMPLEMENTATION; VALIDATION; BLACKS; WHITES;
D O I
10.2196/47125
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: The adoption of predictive algorithms in health care comes with the potential for algorithmic bias, which could exacerbate existing disparities. Fairness metrics have been proposed to measure algorithmic bias, but their application to real -world tasks is limited. Objective: This study aims to evaluate the algorithmic bias associated with the application of common 30 -day hospital readmission models and assess the usefulness and interpretability of selected fairness metrics. Methods: We used 10.6 million adult inpatient discharges from Maryland and Florida from 2016 to 2019 in this retrospective study. Models predicting 30 -day hospital readmissions were evaluated: LACE Index, modified HOSPITAL score, and modified Centers for Medicare & Medicaid Services (CMS) readmission measure, which were applied as -is (using existing coefficients) and retrained (recalibrated with 50% of the data). Predictive performances and bias measures were evaluated for all, between Black and White populations, and between low- and other -income groups. Bias measures included the parity of false negative rate (FNR), false positive rate (FPR), 0-1 loss, and generalized entropy index. Racial bias represented by FNR and FPR differences was stratified to explore shifts in algorithmic bias in different populations. Results: The retrained CMS model demonstrated the best predictive performance (area under the curve: 0.74 in Maryland and 0.68-0.70 in Florida), and the modified HOSPITAL score demonstrated the best calibration (Brier score: 0.16-0.19 in Maryland and 0.19-0.21 in Florida). Calibration was better in White (compared to Black) populations and other -income (compared to low-income) groups, and the area under the curve was higher or similar in the Black (compared to White) populations. The retrained CMS and modified HOSPITAL score had the lowest racial and income bias in Maryland. In Florida, both of these models overall had the lowest income bias and the modified HOSPITAL score showed the lowest racial bias. In both states, the White and higher -income populations showed a higher FNR, while the Black and low-income populations resulted in a higher FPR and a higher 0-1 loss. When stratified by hospital and population composition, these models demonstrated heterogeneous algorithmic bias in different contexts and populations. Conclusions: Caution must be taken when interpreting fairness measures' face value. A higher FNR or FPR could potentially reflect missed opportunities or wasted resources, but these measures could also reflect health care use patterns and gaps in care. Simply relying on the statistical notions of bias could obscure or underplay the causes of health disparity. The imperfect health data, analytic frameworks, and the underlying health systems must be carefully considered. Fairness measures can serve as a useful routine assessment to detect disparate model performances but are insufficient to inform mechanisms or policy changes. However, such an assessment is an important first step toward data -driven improvement to address existing health disparities.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Hospital Characteristics and 30-Day All-Cause Readmission Rates
    Al-Amin, Mona
    JOURNAL OF HOSPITAL MEDICINE, 2016, 11 (10) : 682 - 687
  • [22] Factors and experiences associated with unscheduled 30-day hospital readmission: A mixed method study
    Mukhopadhyay, Amartya
    Mohankumar, Bhuvaneshwari
    Chong, Lin Siew
    Hildon, Zoe J. L.
    Tai, Bee Choo
    Quek, Swee Chye
    ANNALS ACADEMY OF MEDICINE SINGAPORE, 2021, 50 (10) : 751 - 764
  • [23] Effect of the Hospital Elder Life Program on Risk of 30-Day Readmission
    Rubin, Fred H.
    Bellon, Johanna
    Bilderback, Andrew
    Urda, Kevin
    Inouye, Sharon K.
    JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2018, 66 (01) : 145 - 149
  • [24] Predictors of 30-Day Readmission for Pneumonia
    Flanagan, Jane
    Stamp, Kelly D.
    Gregas, Matt
    Shindul-Rothschild, Judy
    JOURNAL OF NURSING ADMINISTRATION, 2016, 46 (02): : 69 - 74
  • [25] Machine learning methods to predict 30-day hospital readmission outcome among US adults with pneumonia: analysis of the national readmission database
    Huang, Yinan
    Talwar, Ashna
    Lin, Ying
    Aparasu, Rajender R.
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [26] Decreasing 30-Day Readmission Rates
    Lacker, Cynthia
    AMERICAN JOURNAL OF NURSING, 2011, 111 (11) : 65 - 69
  • [27] Risk factors for 30-day hospital readmission in patients with diabetic foot
    Sanchez, C. A.
    Galeano, A.
    Jaramillo, D.
    Pupo, G.
    Reyes, C.
    FOOT AND ANKLE SURGERY, 2025, 31 (01) : 25 - 30
  • [28] Assessing the impact of social determinants of health on predictive models for potentially avoidable 30-day readmission or death
    Zhang, Yongkang
    Zhang, Yiye
    Sholle, Evan
    Abedian, Sajjad
    Sharko, Marianne
    Turchioe, Meghan Reading
    Wu, Yiyuan
    Ancker, Jessica S.
    PLOS ONE, 2020, 15 (06):
  • [29] Predicting 7-day, 30-day and 60-day all-cause unplanned readmission: a case study of a Sydney hospital
    Maali, Yashar
    Perez-Concha, Oscar
    Coiera, Enrico
    Roffe, David
    Day, Richard O.
    Gallego, Blanca
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2018, 18
  • [30] 30-day Readmission After Pancreatic Resection A Systematic Review of the Literature and Meta-analysis
    Fisher, Alexander V.
    Fernandes-Taylor, Sara
    Campbell-Flohr, Stephanie A.
    Clarkson, Sam J.
    Winslow, Emily R.
    Abbott, Daniel E.
    Weber, Sharon M.
    ANNALS OF SURGERY, 2017, 266 (02) : 242 - 250