Sex-Based Performance Disparities in Machine Learning Algorithms for Cardiac Disease Prediction: Exploratory Study

被引：0

作者：

Straw, Isabel ^{[1
]}

机构：

[1] UCL, 222 Euston Rd, London NW1 2DA, England

来源：

JOURNAL OF MEDICAL INTERNET RESEARCH | 2024年 / 26卷

基金：

英国科研创新办公室;

关键词：

artificial intelligence; machine learning; cardiology; health care; health equity; medicine; cardiac; quantitative evaluation; inequality; cardiac disease; performance; sex; management; heart failure; WOMENS HEALTH; HEART-FAILURE; TRANSGENDER; GENDER; BIAS; CARE;

D O I：

10.2196/46936

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Background: The presence of bias in artificial intelligence has garnered increased attention, with inequities in algorithmic performance being exposed across the fields of criminal justice, education, and welfare services. In health care, the inequitable performance of algorithms across demographic groups may widen health inequalities. Objective: Here, we identify and characterize bias in cardiology algorithms, looking specifically at algorithms used in the management of heart failure. Methods: Stage 1 involved a literature search of PubMed and Web of Science for key terms relating to cardiac machine learning(ML) algorithms. Papers that built ML models to predict cardiac disease were evaluated for their focus on demographic bias in model performance, and open-source data sets were retained for our investigation. Two open-source data sets were identified:(1) the University of California Irvine Heart Failure data set and (2) the University of California Irvine Coronary Artery Disease data set. We reproduced existing algorithms that have been reported for these data sets, tested them for sex biases in algorithm performance, and assessed a range of remediation techniques for their efficacy in reducing inequities. Particular attention was paid to the false negative rate (FNR), due to the clinical significance of under diagnosis and missed opportunities for treatment. Results: In stage 1, our literature search returned 127 papers, with 60 meeting the criteria for a full review and only 3 papers highlighting sex differences in algorithm performance. In the papers that reported sex, there was a consistent under representation of female patients in the data sets. No papers investigated racial or ethnic differences. In stage 2, we reproduced algorithms reported in the literature, achieving mean accuracies of 84.24% (SD 3.51%) for data set 1 and 85.72% (SD 1.75%) for data set2 (random forest models). For data set 1, the FNR was significantly higher for female patients in 13 out of 16 experiments, meeting the threshold of statistical significance (-17.81% to -3.37%; P<.05). A smaller disparity in the false positive rate was significant for male patients in 13 out of 16 experiments (-0.48% to +9.77%; P<.05). We observed an overprediction of disease for male patients (higher false positive rate) and an underprediction of disease for female patients (higher FNR). Sex differences in feature importance suggest that feature selection needs to be demographically tailored. Conclusions: Our research exposes a significant gap in cardiac ML research, highlighting that the underperformance of algorithms for female patients has been overlooked in the published literature. Our study quantifies sex disparities in algorithmic performance and explores several sources of bias. We found an underrepresentation of female patients in the data sets used to train algorithms, identified sex biases in model error rates, and demonstrated that a series of remediation techniques were unable to address the inequities present.

引用

页数：18

共 50 条

[41] Refining heart disease prediction accuracy using hybrid machine learning techniques with novel metaheuristic algorithms
Zhang, Haifeng
Mu, Rui
INTERNATIONAL JOURNAL OF CARDIOLOGY, 2024, 416
[42] Efficient Prediction of Cardiovascular Disease Using Machine Learning Algorithms With Relief and LASSO Feature Selection Techniques
Ghosh, Pronab
Azam, Sami
Jonkman, Mirjam
Karim, Asif
Shamrat, F. M. Javed Mehedi
Ignatious, Eva
Shultana, Shahana
Beeravolu, Abhijith Reddy
De Boer, Friso
IEEE ACCESS, 2021, 9 : 19304 - 19326
[43] Resource Quality Prediction Based on Machine Learning Algorithms
Wang, Yu
Yang, Dingyu
Shi, Yunfan
Wang, Yizhen
Chen, Wanli
2017 4TH INTERNATIONAL CONFERENCE ON SYSTEMS AND INFORMATICS (ICSAI), 2017, : 1541 - 1545
[44] Cybersecurity and Risk Prediction Based on Machine Learning Algorithms
Yang, Haoliang
Zhu, Jianan
Li, Jiaqing
Applied Mathematics and Nonlinear Sciences, 2024, 9 (01)
[45] Machine learning algorithms performance evaluation in traffic flow prediction
Ramchandra, Nazirkar Reshma
Rajabhushanam, C.
MATERIALS TODAY-PROCEEDINGS, 2022, 51 : 1046 - 1050
[46] Student Performance Prediction and Classification Using Machine Learning Algorithms
Sekeroglu, Boran
Dimililer, Kamil
Tuncal, Kubra
PROCEEDINGS OF 2019 8TH INTERNATIONAL CONFERENCE ON EDUCATIONAL AND INFORMATION TECHNOLOGY (ICEIT 2019), 2019, : 7 - 11
[47] Performance Analysis of Machine Learning Algorithms in Storm Surge Prediction
Ian, Vai-Kei
Tse, Rita
Tang, Su-Kit
Pau, Giovanni
PROCEEDINGS OF THE 7TH INTERNATIONAL CONFERENCE ON INTERNET OF THINGS, BIG DATA AND SECURITY (IOTBDS), 2022, : 297 - 303
[48] Evapotranspiration evaluation models based on machine learning algorithms-A comparative study
Granata, Francesco
AGRICULTURAL WATER MANAGEMENT, 2019, 217 : 303 - 315
[49] Exploratory Study of Machine Learning Techniques for Supporting Failure Prediction
Campos, Joao R.
Vieira, Marco
Costa, Ernesto
2018 14TH EUROPEAN DEPENDABLE COMPUTING CONFERENCE (EDCC 2018), 2018, : 9 - 16
[50] Clinician Preimplementation Perspectives of a Decision-Support Tool for the Prediction of Cardiac Arrhythmia Based on Machine Learning: Near-Live Feasibility and Qualitative Study
Matthiesen, Stina
Diederichsen, Soren Zoga
Hansen, Mikkel Klitzing Hartmann
Villumsen, Christina
Lassen, Mats Christian Hojbjerg
Jacobsen, Peter Karl
Risum, Niels
Winkel, Bo Gregers
Philbert, Berit T.
Svendsen, Jesper Hastrup
Andersen, Tariq Osman
JMIR HUMAN FACTORS, 2021, 8 (04):

← 1 2 3 4 5 →