Sex-Based Performance Disparities in Machine Learning Algorithms for Cardiac Disease Prediction: Exploratory Study

被引:0
|
作者
Straw, Isabel [1 ]
机构
[1] UCL, 222 Euston Rd, London NW1 2DA, England
基金
英国科研创新办公室;
关键词
artificial intelligence; machine learning; cardiology; health care; health equity; medicine; cardiac; quantitative evaluation; inequality; cardiac disease; performance; sex; management; heart failure; WOMENS HEALTH; HEART-FAILURE; TRANSGENDER; GENDER; BIAS; CARE;
D O I
10.2196/46936
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Background: The presence of bias in artificial intelligence has garnered increased attention, with inequities in algorithmic performance being exposed across the fields of criminal justice, education, and welfare services. In health care, the inequitable performance of algorithms across demographic groups may widen health inequalities. Objective: Here, we identify and characterize bias in cardiology algorithms, looking specifically at algorithms used in the management of heart failure. Methods: Stage 1 involved a literature search of PubMed and Web of Science for key terms relating to cardiac machine learning(ML) algorithms. Papers that built ML models to predict cardiac disease were evaluated for their focus on demographic bias in model performance, and open-source data sets were retained for our investigation. Two open-source data sets were identified:(1) the University of California Irvine Heart Failure data set and (2) the University of California Irvine Coronary Artery Disease data set. We reproduced existing algorithms that have been reported for these data sets, tested them for sex biases in algorithm performance, and assessed a range of remediation techniques for their efficacy in reducing inequities. Particular attention was paid to the false negative rate (FNR), due to the clinical significance of under diagnosis and missed opportunities for treatment. Results: In stage 1, our literature search returned 127 papers, with 60 meeting the criteria for a full review and only 3 papers highlighting sex differences in algorithm performance. In the papers that reported sex, there was a consistent under representation of female patients in the data sets. No papers investigated racial or ethnic differences. In stage 2, we reproduced algorithms reported in the literature, achieving mean accuracies of 84.24% (SD 3.51%) for data set 1 and 85.72% (SD 1.75%) for data set2 (random forest models). For data set 1, the FNR was significantly higher for female patients in 13 out of 16 experiments, meeting the threshold of statistical significance (-17.81% to -3.37%; P<.05). A smaller disparity in the false positive rate was significant for male patients in 13 out of 16 experiments (-0.48% to +9.77%; P<.05). We observed an overprediction of disease for male patients (higher false positive rate) and an underprediction of disease for female patients (higher FNR). Sex differences in feature importance suggest that feature selection needs to be demographically tailored. Conclusions: Our research exposes a significant gap in cardiac ML research, highlighting that the underperformance of algorithms for female patients has been overlooked in the published literature. Our study quantifies sex disparities in algorithmic performance and explores several sources of bias. We found an underrepresentation of female patients in the data sets used to train algorithms, identified sex biases in model error rates, and demonstrated that a series of remediation techniques were unable to address the inequities present.
引用
收藏
页数:18
相关论文
共 50 条
  • [11] Study on Machine Learning based Heart Disease Prediction Model
    Zhang, Shihan
    PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023, 2023, : 346 - 352
  • [12] Comparison of machine learning algorithms for automatic prediction of Alzheimer disease
    Aslan, Emrah
    Ozupak, Yildirim
    JOURNAL OF THE CHINESE MEDICAL ASSOCIATION, 2025, 88 (02) : 98 - 107
  • [13] Special Issue on Using Machine Learning Algorithms in the Prediction of Kyphosis Disease: A Comparative Study
    Dankwa, Stephen
    Zheng, Wenfeng
    APPLIED SCIENCES-BASEL, 2019, 9 (16):
  • [14] Performance prediction of perovskite materials based on different machine learning algorithms
    Zheng W.-D.
    Zhang H.-R.
    Hu H.-Q.
    Liu Y.
    Li S.-Z.
    Ding G.-T.
    Zhang J.-C.
    Zhongguo Youse Jinshu Xuebao/Chinese Journal of Nonferrous Metals, 2019, 29 (04): : 803 - 809
  • [15] A Comparative Study with Different Machine Learning Algorithms for Diabetes Disease Prediction
    Kibria, Hafsa Binte
    Matin, Abdul
    Jahan, Nusrat
    Islam, Sanzida
    2021 18TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTING SCIENCE AND AUTOMATIC CONTROL (CCE 2021), 2021,
  • [16] Optimization and comparison of machine learning algorithms for the prediction of the performance of football players
    Gianluca Morciano
    Andrea Zingoni
    Giuseppe Calabrò
    Neural Computing and Applications, 2024, 36 (31) : 19653 - 19666
  • [17] Analysis and Prediction of Colorectal Cancer Based on Machine Learning Algorithms
    Chen, Yanming
    He, Xiaolin
    Lin, Chuan
    2024 4TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND INTELLIGENT SYSTEMS ENGINEERING, MLISE 2024, 2024, : 279 - 283
  • [18] Race and sex-based disparities associated with carotid endarterectomy in the Atherosclerosis Risk in Communities (ARIC) study
    Hicks, Caitlin W.
    Daya, Natalie R.
    Black, James H., III
    Matsushita, Kunihiro
    Selvin, Elizabeth
    ATHEROSCLEROSIS, 2020, 292 : 10 - 16
  • [19] Visibility Prediction Based on Machine Learning Algorithms
    Zhang, Yu
    Wang, Yangjun
    Zhu, Yinqian
    Yang, Lizhi
    Ge, Lin
    Luo, Chun
    ATMOSPHERE, 2022, 13 (07)
  • [20] Machine Learning Enables Prediction of Cardiac Amyloidosis by Routine Laboratory Parameters: A Proof-of-Concept Study
    Agibetov, Asan
    Seirer, Benjamin
    Dachs, Theresa-Marie
    Koschutnik, Matthias
    Dalos, Daniel
    Rettl, Rene
    Duca, Franz
    Schrutka, Lore
    Agis, Hermine
    Kain, Renate
    Auer-Grumbach, Michela
    Binder, Christina
    Mascherbauer, Julia
    Hengstenberg, Christian
    Samwald, Matthias
    Dorffner, Georg
    Bonderman, Diana
    JOURNAL OF CLINICAL MEDICINE, 2020, 9 (05)