Investigating for bias in healthcare algorithms: a sex-stratified analysis of supervised machine learning models in liver disease prediction

被引:29
作者
Straw, Isabel [1 ]
Wu, Honghan [1 ]
机构
[1] UCL, Inst Hlth Informat, London, England
基金
英国工程与自然科学研究理事会; 英国科研创新办公室;
关键词
Artificial intelligence; BMJ Health Informatics; Health Equity; Machine Learning; Public health informatics; ASSOCIATION; GENDER;
D O I
10.1136/bmjhci-2021-100457
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Objectives The Indian Liver Patient Dataset (ILPD) is used extensively to create algorithms that predict liver disease. Given the existing research describing demographic inequities in liver disease diagnosis and management, these algorithms require scrutiny for potential biases. We address this overlooked issue by investigating ILPD models for sex bias. Methods Following our literature review of ILPD papers, the models reported in existing studies are recreated and then interrogated for bias. We define four experiments, training on sex-unbalanced/balanced data, with and without feature selection. We build random forests (RFs), support vector machines (SVMs), Gaussian Naive Bayes and logistic regression (LR) classifiers, running experiments 100 times, reporting average results with SD. Results We reproduce published models achieving accuracies of >70% (LR 71.31% (2.37 SD) - SVM 79.40% (2.50 SD)) and demonstrate a previously unobserved performance disparity. Across all classifiers females suffer from a higher false negative rate (FNR). Presently, RF and LR classifiers are reported as the most effective models, yet in our experiments they demonstrate the greatest FNR disparity (RF; -21.02%; LR; -24.07%). Discussion We demonstrate a sex disparity that exists in published ILPD classifiers. In practice, the higher FNR for females would manifest as increased rates of missed diagnosis for female patients and a consequent lack of appropriate care. Our study demonstrates that evaluating biases in the initial stages of machine learning can provide insights into inequalities in current clinical practice, reveal pathophysiological differences between the male and females, and can mitigate the digitisation of inequalities into algorithmic systems. Conclusion Our findings are important to medical data scientists, clinicians and policy-makers involved in the implementation medical artificial intelligence systems. An awareness of the potential biases of these systems is essential in preventing the digital exacerbation of healthcare inequalities.
引用
收藏
页数:8
相关论文
共 30 条
  • [1] Adil SH, 2018, 2018 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION SCIENCES (ICCOINS)
  • [2] [Anonymous], 2019, VIKALPA
  • [3] Aswathy C, 2018, LIVER PATIENT DATASE
  • [4] Auxilia LA, 2018, 2018 2 INT C TRENDS, P45, DOI DOI 10.1109/ICOEI.2018.8553682
  • [5] The burden of liver disease in Europe: A review of available epidemiological data
    Blachier, Martin
    Leleu, Henri
    Peck-Radosavljevic, Markus
    Valla, Dominique-Charles
    Roudot-Thoraval, Francoise
    [J]. JOURNAL OF HEPATOLOGY, 2013, 58 (03) : 593 - 608
  • [6] Sex and gender differences and biases in artificial intelligence for biomedicine and healthcare
    Cirillo, Davide
    Catuara-Solarz, Silvina
    Morey, Czuee
    Guney, Emre
    Subirats, Laia
    Mellino, Simona
    Gigante, Annalisa
    Valencia, Alfonso
    Rementeria, Maria Jose
    Chadha, Antonella Santuccione
    Mavridis, Nikolaos
    [J]. NPJ DIGITAL MEDICINE, 2020, 3 (01)
  • [7] Cleghorn Elinor., 2021, UNWELL WOMEN MISDIAG
  • [8] Dua D, 2019, UCI MACHINE LEARNING
  • [9] Health inequities and the inappropriate use of race in nephrology
    Eneanya, Nwamaka D.
    Boulware, L. Ebony
    Tsai, Jennifer
    Bruce, Marino A.
    Ford, Chandra L.
    Harris, Christina
    Morales, Leo S.
    Ryan, Michael J.
    Reese, Peter P.
    Thorpe, Roland J., Jr.
    Morse, Michelle
    Walker, Valencia
    Arogundade, Fatiu A.
    Lopes, Antonio A.
    Norris, Keith C.
    [J]. NATURE REVIEWS NEPHROLOGY, 2022, 18 (02) : 84 - 94
  • [10] Sex differences in the association between albumin and all-cause and vascular mortality
    Grimm, G.
    Haslacher, H.
    Kampitsch, T.
    Endler, G.
    Marsik, C.
    Schickbauer, T.
    Wagner, O.
    Jilma, B.
    [J]. EUROPEAN JOURNAL OF CLINICAL INVESTIGATION, 2009, 39 (10) : 860 - 865