Validating the accuracy of deep learning for the diagnosis of pneumonia on chest x-ray against a robust multimodal reference diagnosis: a post hoc analysis of two prospective studies

被引:4
|
作者
Hofmeister, Jeremy [1 ]
Garin, Nicolas [2 ,3 ]
Montet, Xavier [1 ]
Scheffler, Max [1 ]
Platon, Alexandra [1 ]
Poletti, Pierre-Alexandre [1 ]
Stirnemann, Jerome [3 ]
Debray, Marie-Pierre [4 ]
Claessens, Yann-Erick [5 ]
Duval, Xavier [6 ]
Prendki, Virginie [7 ,8 ]
机构
[1] Geneva Univ Hosp, Dept Diagnost, Geneva, Switzerland
[2] Riviera Chablais Hosp, Div Internal Med, Rennaz, Switzerland
[3] Geneva Univ Hosp, Dept Med, Geneva, Switzerland
[4] Univ Paris Cite, Hop Bichat, APHP, Dept Radiol,Inserm UMR1152, Paris, France
[5] Ctr Hosp Princesse Grace, Dept Emergency Med, La Colle, Principality of, Monaco
[6] Univ Paris Cite, Hop Bichat, APHP, Dept Epidemiol & Clin Res,Inserm C 1425UMR 1138, Paris, France
[7] Geneva Univ Hosp, Dept Rehabil & Geriatr, Geneva, Switzerland
[8] Geneva Univ Hosp, Div Infect Dis, 4 Rue Gabrielle Perret Gentil, CH-1211 Geneva 14, Switzerland
关键词
Artificial intelligence; Chest x-ray; Deep learning; Diagnosis; Pneumonia; COMMUNITY-ACQUIRED PNEUMONIA; COMPUTED-TOMOGRAPHY; INTEROBSERVER RELIABILITY; IMPLEMENTATION; RADIOLOGISTS; VARIABILITY; RADIOGRAPHS; MODEL;
D O I
10.1186/s41747-023-00416-y
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background Artificial intelligence (AI) seems promising in diagnosing pneumonia on chest x-rays (CXR), but deep learning (DL) algorithms have primarily been compared with radiologists, whose diagnosis can be not completely accurate. Therefore, we evaluated the accuracy of DL in diagnosing pneumonia on CXR using a more robust reference diagnosis. Methods We trained a DL convolutional neural network model to diagnose pneumonia and evaluated its accuracy in two prospective pneumonia cohorts including 430 patients, for whom the reference diagnosis was determined a posteriori by a multidisciplinary expert panel using multimodal data. The performance of the DL model was compared with that of senior radiologists and emergency physicians reviewing CXRs and that of radiologists reviewing computed tomography (CT) performed concomitantly. Results Radiologists and DL showed a similar accuracy on CXR for both cohorts (p >= 0.269): cohort 1, radiologist 1 75.5% (95% confidence interval 69.1-80.9), radiologist 2 71.0% (64.4-76.8), DL 71.0% (64.4-76.8); cohort 2, radiologist 70.9% (64.7-76.4), DL 72.6% (66.5-78.0). The accuracy of radiologists and DL was significantly higher (p <= 0.022) than that of emergency physicians (cohort 1 64.0% [57.1-70.3], cohort 2 63.0% [55.6-69.0]). Accuracy was significantly higher for CT (cohort 1 79.0% [72.8-84.1], cohort 2 89.6% [84.9-92.9]) than for CXR readers including radiologists, clinicians, and DL (all p-values < 0.001). Conclusions When compared with a robust reference diagnosis, the performance of AI models to identify pneumonia on CXRs was inferior than previously reported but similar to that of radiologists and better than that of emergency physicians. Relevance statement: The clinical relevance of AI models for pneumonia diagnosis may have been overestimated. AI models should be benchmarked against robust reference multimodal diagnosis to avoid overestimating its performance. Key point center dot We evaluated an openly-access convolutional neural network (CNN) model to diagnose pneumonia on CXRs. center dot CNN was validated against a strong multimodal reference diagnosis. center dot In our study, the CNN performance (area under the receiver operating characteristics curve 0.74) was lower than that previously reported when validated against radiologists' diagnosis (0.99 in a recent meta-analysis). center dot The CNN performance was significantly higher than emergency physicians' (p <= 0.022) and comparable to that of board-certified radiologists (p >= 0.269).
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Validating the accuracy of deep learning for the diagnosis of pneumonia on chest x-ray against a robust multimodal reference diagnosis: a post hoc analysis of two prospective studies
    Jeremy Hofmeister
    Nicolas Garin
    Xavier Montet
    Max Scheffler
    Alexandra Platon
    Pierre-Alexandre Poletti
    Jérôme Stirnemann
    Marie-Pierre Debray
    Yann-Erick Claessens
    Xavier Duval
    Virginie Prendki
    European Radiology Experimental, 8
  • [2] Diagnosis of Pneumonia from Chest X-Ray Images using Deep Learning
    Ayan, Enes
    Unver, Halil Murat
    2019 SCIENTIFIC MEETING ON ELECTRICAL-ELECTRONICS & BIOMEDICAL ENGINEERING AND COMPUTER SCIENCE (EBBT), 2019,
  • [3] Progressive and Combined Deep Transfer Learning for pneumonia diagnosis in chest X-ray images
    Khaled, Mamar
    Gaceb, Djamel
    Touazi, Faycal
    Otsmane, Ahmed
    Boutoutaou, Farouk
    5TH INTERNATIONAL CONFERENCE ON INFORMATICS & DATA-DRIVEN MEDICINE, IDDM 2022, 2022, 3302
  • [4] COVID-19 Pneumonia Diagnosis Using Chest X-ray Radiography and Deep Learning
    Griner, Dalton
    Zhang, Ran
    Tie, Xin
    Zhang, Chengzhu
    Garrett, John
    Li, Ke
    Chen, Guang-Hong
    MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
  • [5] Joint Diagnosis of Pneumonia, COVID-19, and Tuberculosis from Chest X-ray Images: A Deep Learning Approach
    Ahmed, Mohammed Salih
    Rahman, Atta
    AlGhamdi, Faris
    AlDakheel, Saleh
    Hakami, Hammam
    AlJumah, Ali
    AlIbrahim, Zuhair
    Youldash, Mustafa
    Alam Khan, Mohammad Aftab
    Basheer Ahmed, Mohammed Imran
    DIAGNOSTICS, 2023, 13 (15)
  • [6] Enhancing Chest X-ray Diagnosis with a Multimodal Deep Learning Network by Integrating Clinical History to Refine Attention
    Yang, Lian
    Wan, Yiliang
    Pan, Feng
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2025,
  • [7] A comprehensive segmentation of chest X-ray improves deep learning-based WHO radiologically confirmed pneumonia diagnosis in children
    Li, Yuemei
    Zhang, Lin
    Yu, Hu
    Wang, Jian
    Wang, Shuo
    Liu, Jungang
    Zheng, Qiang
    EUROPEAN RADIOLOGY, 2024, 34 (05) : 3471 - 3482
  • [8] Accuracy of deep learning for automated detection of pneumonia using chest X-Ray images: A systematic review and meta-analysis
    Li, Yuanyuan
    Zhang, Zhenyan
    Dai, Cong
    Dong, Qiang
    Badrigilan, Samireh
    COMPUTERS IN BIOLOGY AND MEDICINE, 2020, 123
  • [9] Diagnosis of Pediatric Pneumonia with Ensemble of Deep Convolutional Neural Networks in Chest X-Ray Images
    Ayan, Enes
    Karabulut, Bergen
    Unver, Halil Murat
    ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING, 2022, 47 (02) : 2123 - 2139
  • [10] Diagnosis of Pediatric Pneumonia with Ensemble of Deep Convolutional Neural Networks in Chest X-Ray Images
    Enes Ayan
    Bergen Karabulut
    Halil Murat Ünver
    Arabian Journal for Science and Engineering, 2022, 47 : 2123 - 2139