Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans

被引:558
作者
Roberts, Michael [1 ,2 ]
Driggs, Derek [1 ]
Thorpe, Matthew [3 ]
Gilbey, Julian [1 ]
Yeung, Michael [4 ]
Ursprung, Stephan [4 ,5 ]
Aviles-Rivero, Angelica I. [1 ]
Etmann, Christian [1 ]
McCague, Cathal [4 ,5 ]
Beer, Lucian [4 ]
Weir-McCall, Jonathan R. [4 ,6 ]
Teng, Zhongzhao [4 ]
Gkrania-Klotsas, Effrossyni [7 ]
Rudd, James H. F. [8 ]
Sala, Evis [4 ,5 ]
Schonlieb, Carola-Bibiane [1 ]
机构
[1] Univ Cambridge, Dept Appl Math & Theoret Phys, Cambridge, England
[2] AstraZeneca, Oncol R&D, Cambridge, England
[3] Univ Manchester, Dept Math, Manchester, Lancs, England
[4] Univ Cambridge, Dept Radiol, Cambridge, England
[5] Univ Cambridge, Canc Res UK Cambridge Ctr, Cambridge, England
[6] Royal Papworth Hosp, Royal Papworth Hosp NHS Fdn Trust, Cambridge, England
[7] Cambridge Univ Hosp NHS Trust, Dept Infect Dis, Cambridge, England
[8] Univ Cambridge, Dept Med, Cambridge, England
基金
英国工程与自然科学研究理事会; 英国惠康基金; 欧洲研究理事会;
关键词
PREDICTION; IMAGES; RISK; TOOL;
D O I
10.1038/s42256-021-00307-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many machine learning-based approaches have been developed for the prognosis and diagnosis of COVID-19 from medical images and this Analysis identifies over 2,200 relevant published papers and preprints in this area. After initial screening, 62 studies are analysed and the authors find they all have methodological flaws standing in the way of clinical utility. The authors have several recommendations to address these issues. Machine learning methods offer great promise for fast and accurate detection and prognostication of coronavirus disease 2019 (COVID-19) from standard-of-care chest radiographs (CXR) and chest computed tomography (CT) images. Many articles have been published in 2020 describing new machine learning-based models for both of these tasks, but it is unclear which are of potential clinical utility. In this systematic review, we consider all published papers and preprints, for the period from 1 January 2020 to 3 October 2020, which describe new machine learning models for the diagnosis or prognosis of COVID-19 from CXR or CT images. All manuscripts uploaded to bioRxiv, medRxiv and arXiv along with all entries in EMBASE and MEDLINE in this timeframe are considered. Our search identified 2,212 studies, of which 415 were included after initial screening and, after quality screening, 62 studies were included in this systematic review. Our review finds that none of the models identified are of potential clinical use due to methodological flaws and/or underlying biases. This is a major weakness, given the urgency with which validated COVID-19 models are needed. To address this, we give many recommendations which, if followed, will solve these issues and lead to higher-quality model development and well-documented manuscripts.
引用
收藏
页码:199 / 217
页数:19
相关论文
共 98 条
  • [1] Acar E., 2020, IMPROVING EFFECTIVEN, DOI [10.1101/2020.06.12.20129643, DOI 10.1101/2020.06.12.20129643]
  • [2] Correlation of Chest CT and RT-PCR Testing for Coronavirus Disease 2019 (COVID-19) in China: A Report of 1014 Cases
    Ai, Tao
    Yang, Zhenlu
    Hou, Hongyan
    Zhan, Chenao
    Chen, Chong
    Lv, Wenzhi
    Tao, Qian
    Sun, Ziyong
    Xia, Liming
    [J]. RADIOLOGY, 2020, 296 (02) : E32 - E40
  • [3] Alabool H., 2021, RES SQ, DOI [10.21203/rs.3.rs-30432/v1, DOI 10.21203/RS.3.RS-30432/V1]
  • [4] Systematic review of artificial intelligence techniques in the detection and classification of COVID-19 medical images in terms of evaluation and benchmarking: Taxonomy analysis, challenges, future solutions and methodological aspects
    Albahri, O. S.
    Zaidan, A. A.
    Albahri, A. S.
    Zaidan, B. B.
    Abdulkareem, Karrar Hameed
    Al-qaysi, Z. T.
    Alamoodi, A. H.
    Aleesa, A. M.
    Chyad, M. A.
    Alesa, R. M.
    Kem, L. C.
    Lakulu, Muhammad Modi
    Ibrahim, A. B.
    Rashid, Nazre Abdul
    [J]. JOURNAL OF INFECTION AND PUBLIC HEALTH, 2020, 13 (10) : 1381 - 1396
  • [5] Amer R., 2020, PREPRINT
  • [6] Amyar A., 2020, MULTITASK DEEP LEARN, DOI [10.1101/2020.04.16.2006470, DOI 10.1101/2020.04.16.2006470]
  • [7] Multi-task deep learning based CT imaging analysis for COVID-19 pneumonia: Classification and segmentation
    Amyar, Amine
    Modzelewski, Romain
    Li, Hua
    Ruan, Su
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 126
  • [8] [Anonymous], 2020, JAMA
  • [9] [Anonymous], 2020, Covidence Systematic Review Software
  • [10] Application of deep learning technique to manage COVID-19 in routine clinical practice using CT images: Results of 10 convolutional neural networks
    Ardakani, Ali Abbasian
    Kanafi, Alireza Rajabzadeh
    Acharya, U. Rajendra
    Khadem, Nazanin
    Mohammadi, Afshin
    [J]. COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 121 (121)