Randomized Clinical Trials of Machine Learning Interventions in Health Care A Systematic Review

被引:117
作者
Plana, Deborah [2 ]
Shung, Dennis L. [3 ]
Grimshaw, Alyssa A. [4 ]
Saraf, Anurag [5 ]
Sung, Joseph J. Y. [6 ]
Kann, Benjamin H. [1 ]
机构
[1] Harvard Med Sch, Brigham & Womens Hosp, Artificial Intelligence Med Program, 221 Longwood Ave,Suite 442, Boston, MA 02115 USA
[2] Harvard Med Sch, Boston, MA 02115 USA
[3] Yale Univ, Dept Med, New Haven, CT 06520 USA
[4] Yale Univ, Harvey Cushing John Hay Whitney Med Lib, New Haven, CT USA
[5] Massachusetts Gen Hosp, Dept Radiat Oncol, Boston, MA 02114 USA
[6] Nanyang Technol Univ, Lee Kong Chian Sch Med, Singapore, Singapore
基金
美国国家卫生研究院;
关键词
COMPUTER-AIDED DETECTION; ARTIFICIAL-INTELLIGENCE; MISS RATE; MULTICENTER; COLONOSCOPY; NEOPLASIA;
D O I
10.1001/jamanetworkopen.2022.33946
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
IMPORTANCE Despite the potential of machine learning to improve multiple aspects of patient care, barriers to clinical adoption remain. Randomized clinical trials (RCTs) are often a prerequisite to large-scale clinical adoption of an intervention, and important questions remain regarding how machine learning interventions are being incorporated into clinical trials in health care. OBJECTIVE To systematically examine the design, reporting standards, risk of bias, and inclusivity of RCTs for medical machine learning interventions. EVIDENCE REVIEW In this systematic review, the Cochrane Library, Google Scholar, Ovid Embase, Ovid MEDLINE, PubMed, Scopus, and Web of Science Core Collection online databases were searched and citation chasing was done to find relevant articles published from the inception of each database to October 15, 2021. Search terms for machine learning, clinical decision-making, and RCTs were used. Exclusion criteria included implementation of a non-RCT design, absence of original data, and evaluation of nonclinical interventions. Data were extracted from published articles. Trial characteristics, including primary intervention, demographics, adherence to the CONSORT-AI reporting guideline, and Cochrane risk of bias were analyzed. FINDINGS Literature search yielded 19 737 articles, of which 41 RCTs involved a median of 294 participants (range, 17-2488 participants). A total of 16 RCTS (39%) were published in 2021, 21 (51%) were conducted at single sites, and 15 (37%) involved endoscopy. No trials adhered to all CONSORT-AI standards. Common reasons for nonadherence were not assessing poor-quality or unavailable input data (38 trials [93%]), not analyzing performance errors (38 [93%]), and not including a statement regarding code or algorithm availability (37 [90%]). Overall risk of bias was high in 7 trials (17%). Of 11 trials (27%) that reported race and ethnicity data, the median proportion of participants from underrepresented minority groups was 21% (range, 0%-51%). CONCLUSIONS AND RELEVANCE This systematic review found that despite the large number of medical machine learning-based algorithms in development, few RCTs for these technologies have been conducted. Among published RCTs, there was high variability in adherence to reporting standards and risk of bias and a lack of participants from underrepresented minority groups. These findings merit attention and should be considered in future RCT design and reporting.
引用
收藏
页数:14
相关论文
共 75 条
[61]   Effect of Wearable Digital Intervention for Improving Socialization in Children With Autism Spectrum Disorder A Randomized Clinical Trial [J].
Voss, Catalin ;
Schwartz, Jessey ;
Daniels, Jena ;
Kline, Aaron ;
Haber, Nick ;
Washington, Peter ;
Tariq, Qandeel ;
Robinson, Thomas N. ;
Desai, Manisha ;
Phillips, Jennifer M. ;
Feinstein, Carl ;
Winograd, Terry ;
Wall, Dennis P. .
JAMA PEDIATRICS, 2019, 173 (05) :446-454
[62]   Deep Learning in Medicine-Promise, Progress, and Challenges [J].
Wang, Fei ;
Casalino, Lawrence Peter ;
Khullar, Dhruv .
JAMA INTERNAL MEDICINE, 2019, 179 (03) :293-294
[63]   Lower Adenoma Miss Rate of Computer-Aided Detection-Assisted Colonoscopy vs Routine White-Light Colonoscopy in a Prospective Tandem Study [J].
Wang, Pu ;
Liu, Peixi ;
Brown, Jeremy R. Glissen ;
Berzin, Tyler M. ;
Zhou, Guanyu ;
Lei, Shan ;
Liu, Xiaogang ;
Li, Liangping ;
Xiao, Xun .
GASTROENTEROLOGY, 2020, 159 (04) :1252-+
[64]  
Wang P, 2020, LANCET GASTROENTEROL, V5, P343, DOI [10.1016/S2468-1253(19)30411-x, 10.1016/S2468-1253(19)30411-X]
[65]   Real-time automatic detection system increases colonoscopic polyp and adenoma detection rates: a prospective randomised controlled study [J].
Wang, Pu ;
Berzin, Tyler M. ;
Brown, Jeremy Romek Glissen ;
Bharadwaj, Shishira ;
Becq, Aymeric ;
Xiao, Xun ;
Liu, Peixi ;
Li, Liangping ;
Song, Yan ;
Zhang, Di ;
Li, Yi ;
Xu, Guangre ;
Tu, Mengtian ;
Liu, Xiaogang .
GUT, 2019, 68 (10) :1813-1819
[66]   Effect of a Machine Learning-Derived Early Warning System for Intraoperative Hypotension vs Standard Care on Depth and Duration of Intraoperative Hypotension During Elective Noncardiac Surgery The HYPE Randomized Clinical Trial [J].
Wijnberge, Marije ;
Geerts, Bart F. ;
Hol, Liselotte ;
Lemmers, Nikki ;
Mulder, Marijn P. ;
Berge, Patrick ;
Schenk, Jimmy ;
Terwindt, Lotte E. ;
Hollmann, Markus W. ;
Vlaar, Alexander P. ;
Veelo, Denise P. .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2020, 323 (11) :1052-1060
[67]   Time to reality check the promises of machine learning-powered precision medicine [J].
Wilkinson, Jack ;
Arnold, Kellyn F. ;
Murray, Eleanor J. ;
van Smeden, Maarten ;
Carr, Kareem ;
Sippy, Rachel ;
de Kamps, Marc ;
Beam, Andrew ;
Konigorski, Stefan ;
Lippert, Christoph ;
Gilthorpe, Mark S. ;
Tennant, Peter W. G. .
LANCET DIGITAL HEALTH, 2020, 2 (12) :E677-E680
[68]  
Wu LL, 2021, LANCET GASTROENTEROL, V6, P700, DOI 10.1016/S2468-1253(21)00216-8
[69]   Evaluation of the effects of an artificial intelligence system on endoscopy quality and preliminary testing of its performance in detecting early gastric cancer: a randomized controlled trial [J].
Wu, Lianlian ;
He, Xinqi ;
Liu, Mei ;
Xie, Huaping ;
An, Ping ;
Zhang, Jun ;
Zhang, Heng ;
Ai, Yaowei ;
Tong, Qiaoyun ;
Guo, Mingwen ;
Huang, Manling ;
Ge, Cunjin ;
Yang, Zhi ;
Yuan, Jingping ;
Liu, Jun ;
Zhou, Wei ;
Jiang, Xiaoda ;
Huang, Xu ;
Mu, Ganggang ;
Wan, Xinyue ;
Li, Yanxia ;
Wang, Hongguang ;
Wang, Yonggui ;
Zhang, Hongfeng ;
Chen, Di ;
Gong, Dexin ;
Wang, Jing ;
Huang, Li ;
Li, Jia ;
Yao, Liwen ;
Zhu, Yijie ;
Yu, Honggang .
ENDOSCOPY, 2021, 53 (12) :1199-1207
[70]   Randomised controlled trial of WISENSE, a real-time quality improving system for monitoring blind spots during esophagogastroduodenoscopy [J].
Wu, Lianlian ;
Zhang, Jun ;
Zhou, Wei ;
An, Ping ;
Shen, Lei ;
Liu, Jun ;
Jiang, Xiaoda ;
Huang, Xu ;
Mu, Ganggang ;
Wan, Xinyue ;
Lv, Xiaoguang ;
Gao, Juan ;
Cui, Ning ;
Hu, Shan ;
Chen, Yiyun ;
Hu, Xiao ;
Li, Jiangjie ;
Chen, Di ;
Gong, Dexin ;
He, Xinqi ;
Ding, Qianshan ;
Zhu, Xiaoyun ;
Li, Suqin ;
Wei, Xiao ;
Li, Xia ;
Wang, Xuemei ;
Zhou, Jie ;
Zhang, Mengjiao ;
Yu, Hong Gang .
GUT, 2019, 68 (12) :2161-2169