Shifting machine learning for healthcare from development to deployment and from models to data

被引：203

作者：

Zhang, Angela ^{[1
,2
,3
,4
]}

Xing, Lei ^{[5
]}

Zou, James ^{[4
,6
]}

Wu, Joseph C. ^{[1
,3
,7
,8
]}

机构：

[1] Stanford Univ, Sch Med, Stanford Cardiovasc Inst, Stanford, CA 94305 USA

[2] Stanford Univ, Sch Med, Dept Genet, Stanford, CA 94305 USA

[3] Greenstone Biosci, Palo Alto, CA 94304 USA

[4] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

[5] Stanford Univ, Sch Med, Dept Radiat Oncol, Stanford, CA USA

[6] Stanford Univ, Sch Med, Dept Biomed Informat, Stanford, CA USA

[7] Stanford Univ, Dept Med, Div Cardiovasc Med, Stanford, CA 94305 USA

[8] Stanford Univ, Sch Med, Dept Radiol, Stanford, CA 94305 USA

来源：

NATURE BIOMEDICAL ENGINEERING | 2022年 / 6卷 / 12期

基金：

美国国家科学基金会; 美国国家卫生研究院;

关键词：

ARTIFICIAL-INTELLIGENCE; DIABETIC-RETINOPATHY; DEEP; PERFORMANCE; AI; VALIDATION; ALGORITHM; FRAMEWORK; NETWORKS; MEDICINE;

D O I：

10.1038/s41551-022-00898-y

中图分类号：

R318 [生物医学工程];

学科分类号：

0831 ;

摘要：

This Review discusses the use of deep generative models, federated learning and transformer models to address challenges in the deployment of machine learning for healthcare. In the past decade, the application of machine learning (ML) to healthcare has helped drive the automation of physician tasks as well as enhancements in clinical capabilities and access to care. This progress has emphasized that, from model development to model deployment, data play central roles. In this Review, we provide a data-centric view of the innovations and challenges that are defining ML for healthcare. We discuss deep generative models and federated learning as strategies to augment datasets for improved model performance, as well as the use of the more recent transformer models for handling larger datasets and enhancing the modelling of clinical text. We also discuss data-focused problems in the deployment of ML, emphasizing the need to efficiently deliver data to ML models for timely clinical predictions and to account for natural data shifts that can deteriorate model performance.

引用

页码：1330 / 1345

页数：16

共 185 条

[1] Deep Learning with Differential Privacy [J].

Abadi, Martin ;

Chu, Andy ;

Goodfellow, Ian ;

McMahan, H. Brendan ;

Mironov, Ilya ;

Talwar, Kunal ;

Zhang, Li .

CCS'16: PROCEEDINGS OF THE 2016 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, :308-318

[2] Large language models associate Muslims with violence [J].

Abid, Abubakar ;

Farooqi, Maheen ;

Zou, James .

NATURE MACHINE INTELLIGENCE, 2021, 3 (06) :461-463

[3] Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices [J].

Abramoff, Michael D. ;

Lavin, Philip T. ;

Birch, Michele ;

Shah, Nilay ;

Folk, James C. .

NPJ DIGITAL MEDICINE, 2018, 1

[4] Machine Learning and Health Care Disparities in Dermatology [J].

Adamson, Adewole S. ;

Smith, Avery .

JAMA DERMATOLOGY, 2018, 154 (11) :1247-1248

[5]

Alsentzer E., 2019, 2 CLIN NATURAL LANG, DOI [DOI 10.18653/V1/W19-1909, 10.18653/v1/W19-1909]

[6]

[Anonymous], 2017, DAT DRIV HEALTHC ORG

[7]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[8] MedGAN: Medical image translation using GANs [J].

Armanious, Karim ;

Jiang, Chenming ;

Fischer, Marc ;

Kuestner, Thomas ;

Nikolaou, Konstantin ;

Gatidis, Sergios ;

Yang, Bin .

COMPUTERIZED MEDICAL IMAGING AND GRAPHICS, 2020, 79

[9]

Augenstein S., 2020, ICLR 2020 8 INT C LE

[10]

Bahdanau D, 2016, Arxiv, DOI arXiv:1409.0473

← 1 2 3 4 5 6 7 8 9 10 →