A critical moment in machine learning in medicine: on reproducible and interpretable learning

被引：20

作者：

Ciobanu-Caraus, Olga ^{[1
]}

Aicher, Anatol ^{[1
]}

Kernbach, Julius M. ^{[2
]}

Regli, Luca ^{[1
]}

Serra, Carlo ^{[1
]}

Staartjes, Victor E. ^{[1
]}

机构：

[1] Univ Zurich, Univ Hosp Zurich, Clin Neurosci Ctr, Dept Neurosurg,Machine Intelligence Clin Neurosci, Zurich, Switzerland

[2] Univ Hosp Heidelberg, Dept Neuroradiol, Heidelberg, Germany

来源：

ACTA NEUROCHIRURGICA | 2024年 / 166卷 / 01期

关键词：

Machine learning; Reproducibility; Interpretability; Methodology; PREDICTION; RISK;

D O I：

10.1007/s00701-024-05892-8

中图分类号：

R74 [神经病学与精神病学];

学科分类号：

摘要：

Over the past two decades, advances in computational power and data availability combined with increased accessibility to pre-trained models have led to an exponential rise in machine learning (ML) publications. While ML may have the potential to transform healthcare, this sharp increase in ML research output without focus on methodological rigor and standard reporting guidelines has fueled a reproducibility crisis. In addition, the rapidly growing complexity of these models compromises their interpretability, which currently impedes their successful and widespread clinical adoption. In medicine, where failure of such models may have severe implications for patients' health, the high requirements for accuracy, robustness, and interpretability confront ML researchers with a unique set of challenges. In this review, we discuss the semantics of reproducibility and interpretability, as well as related issues and challenges, and outline possible solutions to counteracting the "black box". To foster reproducibility, standard reporting guidelines need to be further developed and data or code sharing encouraged. Editors and reviewers may equally play a critical role by establishing high methodological standards and thus preventing the dissemination of low-quality ML publications. To foster interpretable learning, the use of simpler models more suitable for medical data can inform the clinician how results are generated based on input data. Model-agnostic explanation tools, sensitivity analysis, and hidden layer representations constitute further promising approaches to increase interpretability. Balancing model performance and interpretability are important to ensure clinical applicability. We have now reached a critical moment for ML in medicine, where addressing these issues and implementing appropriate solutions will be vital for the future evolution of the field.

引用

页数：7

共 69 条

[1] Reproducibility and replicability in neuroimaging data analysis [J].

Adali, Tulay ;

Calhoun, Vince D. .

CURRENT OPINION IN NEUROLOGY, 2022, 35 (04) :475-481

[2] A clinician's guide to understanding and critically appraising machine learning studies: a checklist for Ruling Out Bias Using Standard Tools in Machine Learning (ROBUST-ML) [J].

Al-Zaiti, Salah S. ;

Alghwiri, Alaa A. ;

Hu, Xiao ;

Clermont, Gilles ;

Peace, Aaron ;

Macfarlane, Peter ;

Bond, Raymond .

EUROPEAN HEART JOURNAL - DIGITAL HEALTH, 2022, 3 (02) :125-140

[3]

[Anonymous], 2019, Reproducibility and Replicability in Science

[4] Opening the Black Box: Interpretable Machine Learning for Geneticists [J].

Azodi, Christina B. ;

Tang, Jiliang ;

Shiu, Shin-Han .

TRENDS IN GENETICS, 2020, 36 (06) :442-455

[5] Clinical AI tools must convey predictive uncertainty for each individual patient [J].

Banerji, Christopher R. S. ;

Chakraborti, Tapabrata ;

Harbron, Chris ;

Macarthur, Ben D. .

NATURE MEDICINE, 2023, 29 (12) :2996-2998

[6] Challenges to the Reproducibility of Machine Learning Models in Health Care [J].

Beam, Andrew L. ;

Manrai, Arjun K. ;

Ghassemi, Marzyeh .

JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2020, 323 (04) :305-306

[7] Dimensionality reduction for visualizing single-cell data using UMAP [J].

Becht, Etienne ;

McInnes, Leland ;

Healy, John ;

Dutertre, Charles-Antoine ;

Kwok, Immanuel W. H. ;

Ng, Lai Guan ;

Ginhoux, Florent ;

Newell, Evan W. .

NATURE BIOTECHNOLOGY, 2019, 37 (01) :38-+

[8] Implementation and Evaluation of an Algorithm for Cryptographically Private Principal Component Analysis on Genomic Data [J].

Bogdanov, Dan ;

Kamm, Liina ;

Laur, Sven ;

Sokk, Ville .

IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2018, 15 (05) :1427-1432

[9]

Campbell D.T., 1986, ADV QUASIEXPERIMENTA, P67, DOI DOI 10.1002/EV.1434

[10]

Celi LA, 2019, PLOS ONE, V14, DOI 10.1371/journal.pone.0210232

← 1 2 3 4 5 6 7 →