Evaluating Algorithmic Bias in 30-Day Hospital Readmission Models: Retrospective Analysis

被引:2
|
作者
Wang, H. Echo [1 ]
Weiner, Jonathan P. [1 ,2 ]
Saria, Suchi [3 ]
Kharrazi, Hadi [1 ,2 ]
机构
[1] Johns Hopkins Univ, Bloomberg Sch Publ Hlth, 624 N Broadway,Hampton House, Baltimore, MD 21205 USA
[2] Johns Hopkins Ctr Populat Hlth Informat Technol, Baltimore, MD USA
[3] Johns Hopkins Univ, Whiting Sch Engn, Baltimore, MD USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
algorithmic bias; model bias; predictive models; model fairness; health disparity; hospital readmission; retrospective analysis; ARTIFICIAL-INTELLIGENCE; MEDICARE BENEFICIARIES; HEALTH DISPARITIES; RISK; CARE; RACE; IMPLEMENTATION; VALIDATION; BLACKS; WHITES;
D O I
10.2196/47125
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: The adoption of predictive algorithms in health care comes with the potential for algorithmic bias, which could exacerbate existing disparities. Fairness metrics have been proposed to measure algorithmic bias, but their application to real -world tasks is limited. Objective: This study aims to evaluate the algorithmic bias associated with the application of common 30 -day hospital readmission models and assess the usefulness and interpretability of selected fairness metrics. Methods: We used 10.6 million adult inpatient discharges from Maryland and Florida from 2016 to 2019 in this retrospective study. Models predicting 30 -day hospital readmissions were evaluated: LACE Index, modified HOSPITAL score, and modified Centers for Medicare & Medicaid Services (CMS) readmission measure, which were applied as -is (using existing coefficients) and retrained (recalibrated with 50% of the data). Predictive performances and bias measures were evaluated for all, between Black and White populations, and between low- and other -income groups. Bias measures included the parity of false negative rate (FNR), false positive rate (FPR), 0-1 loss, and generalized entropy index. Racial bias represented by FNR and FPR differences was stratified to explore shifts in algorithmic bias in different populations. Results: The retrained CMS model demonstrated the best predictive performance (area under the curve: 0.74 in Maryland and 0.68-0.70 in Florida), and the modified HOSPITAL score demonstrated the best calibration (Brier score: 0.16-0.19 in Maryland and 0.19-0.21 in Florida). Calibration was better in White (compared to Black) populations and other -income (compared to low-income) groups, and the area under the curve was higher or similar in the Black (compared to White) populations. The retrained CMS and modified HOSPITAL score had the lowest racial and income bias in Maryland. In Florida, both of these models overall had the lowest income bias and the modified HOSPITAL score showed the lowest racial bias. In both states, the White and higher -income populations showed a higher FNR, while the Black and low-income populations resulted in a higher FPR and a higher 0-1 loss. When stratified by hospital and population composition, these models demonstrated heterogeneous algorithmic bias in different contexts and populations. Conclusions: Caution must be taken when interpreting fairness measures' face value. A higher FNR or FPR could potentially reflect missed opportunities or wasted resources, but these measures could also reflect health care use patterns and gaps in care. Simply relying on the statistical notions of bias could obscure or underplay the causes of health disparity. The imperfect health data, analytic frameworks, and the underlying health systems must be carefully considered. Fairness measures can serve as a useful routine assessment to detect disparate model performances but are insufficient to inform mechanisms or policy changes. However, such an assessment is an important first step toward data -driven improvement to address existing health disparities.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Causes for 30-Day Readmission following Transsphenoidal Surgery
    Hendricks, Brian L.
    Shikary, Tasneem A.
    Zimmer, Lee A.
    OTOLARYNGOLOGY-HEAD AND NECK SURGERY, 2016, 154 (02) : 359 - 365
  • [32] Predictors and Impact of Postoperative 30-Day Readmission in Glioblastoma
    Botros, David
    Khalafallah, Adham M.
    Huq, Sakibul
    Dux, Hayden
    Oliveira, Leonardo A. P.
    Pellegrino, Richard
    Jackson, Christopher
    Gallia, Gary L.
    Bettegowda, Chetan
    Lim, Michael
    Weingart, Jon
    Brem, Henry
    Mukherjee, Debraj
    NEUROSURGERY, 2022, 91 (03) : 477 - 484
  • [33] Risk factors for 30-day hospital readmission after radical gastrectomy: a single-center retrospective study
    Raito Asaoka
    Taiichi Kawamura
    Rie Makuuchi
    Tomoyuki Irino
    Yutaka Tanizawa
    Etsuro Bando
    Masanori Terashima
    Gastric Cancer, 2019, 22 : 413 - 420
  • [34] Implementation Experience with a 30-Day Hospital Readmission Risk Score in a Large, Integrated Health System: A Retrospective Study
    Anita D. Misra-Hebert
    Christina Felix
    Alex Milinovich
    Michael W. Kattan
    Marc A. Willner
    Kevin Chagin
    Janine Bauman
    Aaron C. Hamilton
    Jay Alberts
    Journal of General Internal Medicine, 2022, 37 : 3054 - 3061
  • [35] Risk factors for 30-day hospital readmission after radical gastrectomy: a single-center retrospective study
    Asaoka, Raito
    Kawamura, Taiichi
    Makuuchi, Rie
    Irino, Tomoyuki
    Tanizawa, Yutaka
    Bando, Etsuro
    Terashima, Masanori
    GASTRIC CANCER, 2019, 22 (02) : 413 - 420
  • [36] Implementation Experience with a 30-Day Hospital Readmission Risk Score in a Large, Integrated Health System: A Retrospective Study
    Misra-Hebert, Anita D.
    Felix, Christina
    Milinovich, Alex
    Kattan, Michael W.
    Willner, Marc A.
    Chagin, Kevin
    Bauman, Janine
    Hamilton, Aaron C.
    Alberts, Jay
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2022, 37 (12) : 3054 - 3061
  • [37] The association of hospital teaching intensity with 30-day postdischarge heart failure readmission and mortality rates
    Shahian, David M.
    Liu, Xiu
    Mort, Elizabeth A.
    Normand, Sharon-Lise T.
    HEALTH SERVICES RESEARCH, 2020, 55 (02) : 259 - 272
  • [38] Development of an iterative validation process for a 30-day hospital readmission prediction index
    McConachie, Sean M.
    Raub, Joshua N.
    Trupianio, David
    Yost, Raymond
    AMERICAN JOURNAL OF HEALTH-SYSTEM PHARMACY, 2019, 76 (07) : 444 - 452
  • [39] The HOSPITAL score and LACE index as predictors of 30 day readmission in a retrospective study at a university-affiliated community hospital
    Robinson, Robert
    Hudali, Tamer
    PEERJ, 2017, 5
  • [40] Health literacy and 30-day hospital readmission after acute myocardial infarction
    Bailey, Stacy Cooper
    Fang, Gang
    Annis, Izabela E.
    O'Conor, Rachel
    Paasche-Orlow, Michael K.
    Wolf, Michael S.
    BMJ OPEN, 2015, 5 (06):