Multi-center validation of an artificial intelligence system for detection of COVID-19 on chest radiographs in symptomatic patients

被引:8
作者
Kuo, Michael D. [1 ,2 ]
Chiu, Keith W. H. [1 ]
Wang, David S. [3 ]
Larici, Anna Rita [4 ,5 ]
Poplavskiy, Dmytro [2 ]
Valentini, Adele [6 ]
Napoli, Alessandro [7 ]
Borghesi, Andrea [8 ]
Ligabue, Guido [9 ,10 ]
Fang, Xin Hao B. [11 ]
Wong, Hing Ki C. [12 ]
Zhang, Sailong [1 ]
Hunter, John R. [3 ]
Mousa, Abeer [13 ]
Infante, Amato [5 ,14 ]
Elia, Lorenzo [4 ,5 ]
Golemi, Salvatore [8 ]
Yu, Leung Ho P. [15 ]
Hui, Christopher K. M. [16 ,17 ]
Erickson, Bradley J. [13 ]
机构
[1] Univ Hong Kong, LKS Fac Med, Dept Diagnost Radiol, Med Artificial Intelligence Lab Program, Hong Kong, Peoples R China
[2] Ensemblehealth Ai, Ensemble Grp Holdings, Scottsdale, AZ 85251 USA
[3] Stanford Hlth Care, Dept Radiol, Stanford, CA USA
[4] Univ Cattolica Sacro Cuore, Dept Radiol & Hematol Sci, Sect Radiol, Rome, Italy
[5] Fdn Policlin Univ A Gemelli IRCCS, Dept Diagnost Imaging Oncol Radiotherapy & Hemato, Rome, Italy
[6] Fdn IRCCS Policlin San Matteo, Dept Radiol, Pavia, Italy
[7] Sapienza Univ Rome, Dept Radiol Oncol & Pathol Sci, Rome, Italy
[8] Univ Brescia, ASST Spedali Civili Brescia, Dept Med & Surg Specialties Radiol Sci & Publ Hlt, Brescia, Italy
[9] Modena & Reggio Emilia Univ, Dept Med & Surg Sci Children & Adults, Modena, Italy
[10] Azienda Osped Univ Policlin Modena, Div Radiol, Modena, Italy
[11] Queen Mary Hosp, Radiol Dept, Hong Kong, Peoples R China
[12] United Christian Hosp, Radiol Dept, Hong Kong, Peoples R China
[13] Mayo Clin, Radiol Dept, Rochester, MN USA
[14] Columbus Covid 2 Hosp, Rome, Italy
[15] Educ Univ Hong Kong, Dept Math & Informat Technol, Hong Kong, Peoples R China
[16] Univ Hong Kong, LKS Fac Med, Dept Med, Hong Kong, Peoples R China
[17] Matilda & War Mem Hosp, Dept Resp & Crit Care Med, Hong Kong, Peoples R China
关键词
Artificial intelligence; COVID-19; Radiology; Thoracic; Public health; SARS-COV-2;
D O I
10.1007/s00330-022-08969-z
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objectives While chest radiograph (CXR) is the first-line imaging investigation in patients with respiratory symptoms, differentiating COVID-19 from other respiratory infections on CXR remains challenging. We developed and validated an AI system for COVID-19 detection on presenting CXR. Methods A deep learning model (RadGenX), trained on 168,850 CXRs, was validated on a large international test set of presenting CXRs of symptomatic patients from 9 study sites (US, Italy, and Hong Kong SAR) and 2 public datasets from the US and Europe. Performance was measured by area under the receiver operator characteristic curve (AUC). Bootstrapped simulations were performed to assess performance across a range of potential COVID-19 disease prevalence values (3.33 to 33.3%). Comparison against international radiologists was performed on an independent test set of 852 cases. Results RadGenX achieved an AUC of 0.89 on 4-fold cross-validation and an AUC of 0.79 (95%CI 0.78-0.80) on an independent test cohort of 5,894 patients. Delong's test showed statistical differences in model performance across patients from different regions (p < 0.01), disease severity (p < 0.001), gender (p < 0.001), and age (p = 0.03). Prevalence simulations showed the negative predictive value increases from 86.1% at 33.3% prevalence, to greater than 98.5% at any prevalence below 4.5%. Compared with radiologists, McNemar's test showed the model has higher sensitivity (p < 0.001) but lower specificity (p < 0.001). Conclusion An AI model that predicts COVID-19 infection on CXR in symptomatic patients was validated on a large international cohort providing valuable context on testing and performance expectations for AI systems that perform COVID-19 prediction on CXR.
引用
收藏
页码:23 / 33
页数:11
相关论文
共 37 条
  • [1] COVID-CAPS: A capsule network-based framework for identification of COVID-19 cases from X-ray images
    Afshar, Parnian
    Heidarian, Shahin
    Naderkhani, Farnoosh
    Oikonomou, Anastasia
    Plataniotis, Konstantinos N.
    Mohammadi, Arash
    [J]. PATTERN RECOGNITION LETTERS, 2020, 138 : 638 - 643
  • [2] Blueprint WJG Switzerland, 2020, NOVEL CORONAVIRUS CO
  • [3] PadChest: A large chest x-ray image dataset with multi-label annotated reports
    Bustos, Aurelia
    Pertusa, Antonio
    Salinas, Jose-Maria
    de la Iglesia-Vaya, Maria
    [J]. MEDICAL IMAGE ANALYSIS, 2020, 66
  • [4] Post-lockdown SARS-CoV-2 nucleic acid screening in nearly ten million residents of Wuhan, China
    Cao, Shiyi
    Gan, Yong
    Wang, Chao
    Bachmann, Max
    Wei, Shanbo
    Gong, Jie
    Huang, Yuchai
    Wang, Tiantian
    Li, Liqing
    Lu, Kai
    Jiang, Heng
    Gong, Yanhong
    Xu, Hongbin
    Shen, Xin
    Tian, Qingfeng
    Lv, Chuanzhu
    Song, Fujian
    Yin, Xiaoxv
    Lu, Zuxun
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [5] Detection of COVID-19 Using Deep Learning Algorithms on Chest Radiographs
    Chiu, Wan Hang Keith
    Vardhanabhuti, Varut
    Poplavskiy, Dmytro
    Yu, Philip Leung Ho
    Du, Richard
    Yap, Alistair Yun Hee
    Zhang, Sailong
    Fong, Ambrose Ho-Tung
    Chin, Thomas Wing-Yan
    Lee, Jonan Chun Yin
    Leung, Siu Ting
    Lo, Christine Shing Yen
    Lui, Macy Mei-Sze
    Fang, Benjamin Xin Hao
    Ng, Ming-Yen
    Kuo, Michael D.
    [J]. JOURNAL OF THORACIC IMAGING, 2020, 35 (06) : 369 - 376
  • [6] Chiu WHK, 2021, MEDRXIV, DOI [10.1101/2021.01.10.21249370, DOI 10.1101/2021.01.10.21249370]
  • [7] Public Covid-19 X-ray datasets and their impact on model bias-A systematic review of a significant problem
    Cruz, Beatriz Garcia Santa
    Bossa, Matias Nicolas
    Solter, Jan
    Husch, Andreas Dominik
    [J]. MEDICAL IMAGE ANALYSIS, 2021, 74
  • [8] Chest imaging representing a COVID-19 positive rural US population
    Desai, Shivang
    Baghal, Ahmad
    Wongsurawat, Thidathip
    Jenjaroenpun, Piroon
    Powell, Thomas
    Al-Shukri, Shaymaa
    Gates, Kim
    Farmer, Phillip
    Rutherford, Michael
    Blake, Geri
    Nolan, Tracy
    Sexton, Kevin
    Bennett, William
    Smith, Kirk
    Syed, Shorabuddin
    Prior, Fred
    [J]. SCIENTIFIC DATA, 2020, 7 (01)
  • [9] Automatic coronavirus disease 2019 diagnosis based on chest radiography and deep learning - Success story or dataset bias?
    Dhont, Jennifer
    Wolfs, Cecile
    Verhaegen, Frank
    [J]. MEDICAL PHYSICS, 2022, 49 (02) : 978 - 987
  • [10] Disparities in SARS-CoV-2 Testing in Massachusetts During the COVID-19 Pandemic
    Dryden-Peterson, Scott
    Velasquez, Gustavo E.
    Stopka, Thomas J.
    Davey, Sonya
    Lockman, Shahin
    Ojikutu, Bisola O.
    [J]. JAMA NETWORK OPEN, 2021, 4 (02) : E2037067