Validating the accuracy of deep learning for the diagnosis of pneumonia on chest x-ray against a robust multimodal reference diagnosis: a post hoc analysis of two prospective studies

被引:4
作者
Hofmeister, Jeremy [1 ]
Garin, Nicolas [2 ,3 ]
Montet, Xavier [1 ]
Scheffler, Max [1 ]
Platon, Alexandra [1 ]
Poletti, Pierre-Alexandre [1 ]
Stirnemann, Jerome [3 ]
Debray, Marie-Pierre [4 ]
Claessens, Yann-Erick [5 ]
Duval, Xavier [6 ]
Prendki, Virginie [7 ,8 ]
机构
[1] Geneva Univ Hosp, Dept Diagnost, Geneva, Switzerland
[2] Riviera Chablais Hosp, Div Internal Med, Rennaz, Switzerland
[3] Geneva Univ Hosp, Dept Med, Geneva, Switzerland
[4] Univ Paris Cite, Hop Bichat, APHP, Dept Radiol,Inserm UMR1152, Paris, France
[5] Ctr Hosp Princesse Grace, Dept Emergency Med, La Colle, Principality of, Monaco
[6] Univ Paris Cite, Hop Bichat, APHP, Dept Epidemiol & Clin Res,Inserm C 1425UMR 1138, Paris, France
[7] Geneva Univ Hosp, Dept Rehabil & Geriatr, Geneva, Switzerland
[8] Geneva Univ Hosp, Div Infect Dis, 4 Rue Gabrielle Perret Gentil, CH-1211 Geneva 14, Switzerland
关键词
Artificial intelligence; Chest x-ray; Deep learning; Diagnosis; Pneumonia; COMMUNITY-ACQUIRED PNEUMONIA; COMPUTED-TOMOGRAPHY; INTEROBSERVER RELIABILITY; IMPLEMENTATION; RADIOLOGISTS; VARIABILITY; RADIOGRAPHS; MODEL;
D O I
10.1186/s41747-023-00416-y
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background Artificial intelligence (AI) seems promising in diagnosing pneumonia on chest x-rays (CXR), but deep learning (DL) algorithms have primarily been compared with radiologists, whose diagnosis can be not completely accurate. Therefore, we evaluated the accuracy of DL in diagnosing pneumonia on CXR using a more robust reference diagnosis. Methods We trained a DL convolutional neural network model to diagnose pneumonia and evaluated its accuracy in two prospective pneumonia cohorts including 430 patients, for whom the reference diagnosis was determined a posteriori by a multidisciplinary expert panel using multimodal data. The performance of the DL model was compared with that of senior radiologists and emergency physicians reviewing CXRs and that of radiologists reviewing computed tomography (CT) performed concomitantly. Results Radiologists and DL showed a similar accuracy on CXR for both cohorts (p >= 0.269): cohort 1, radiologist 1 75.5% (95% confidence interval 69.1-80.9), radiologist 2 71.0% (64.4-76.8), DL 71.0% (64.4-76.8); cohort 2, radiologist 70.9% (64.7-76.4), DL 72.6% (66.5-78.0). The accuracy of radiologists and DL was significantly higher (p <= 0.022) than that of emergency physicians (cohort 1 64.0% [57.1-70.3], cohort 2 63.0% [55.6-69.0]). Accuracy was significantly higher for CT (cohort 1 79.0% [72.8-84.1], cohort 2 89.6% [84.9-92.9]) than for CXR readers including radiologists, clinicians, and DL (all p-values < 0.001). Conclusions When compared with a robust reference diagnosis, the performance of AI models to identify pneumonia on CXRs was inferior than previously reported but similar to that of radiologists and better than that of emergency physicians. Relevance statement: The clinical relevance of AI models for pneumonia diagnosis may have been overestimated. AI models should be benchmarked against robust reference multimodal diagnosis to avoid overestimating its performance. Key point center dot We evaluated an openly-access convolutional neural network (CNN) model to diagnose pneumonia on CXRs. center dot CNN was validated against a strong multimodal reference diagnosis. center dot In our study, the CNN performance (area under the receiver operating characteristics curve 0.74) was lower than that previously reported when validated against radiologists' diagnosis (0.99 in a recent meta-analysis). center dot The CNN performance was significantly higher than emergency physicians' (p <= 0.022) and comparable to that of board-certified radiologists (p >= 0.269).
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Osteo-Net: A Robust Deep Learning-Based Diagnosis of Osteoporosis Using X-ray images
    Kumar, Arnav
    Joshi, Rakesh Chandra
    Dutta, Malay Kishore
    Burget, Radim
    Myska, Vojtech
    2022 45TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING, TSP, 2022, : 91 - 95
  • [32] Lung Disease Diagnosis Using Various Deep Learning Algorithms from the chest X-ray images
    Benkrama, Soumia
    Hemdani, Nour Elhouda
    PROGRAM OF THE 2ND INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND AUTOMATIC CONTROL, ICEEAC 2024, 2024,
  • [33] Comparison of deep learning architectures for COVID-19 diagnosis using chest X-ray images
    Sampen, Denilson
    Lavarello, Roberto
    MEDICAL IMAGING 2022: IMAGE PERCEPTION, OBSERVER PERFORMANCE, AND TECHNOLOGY ASSESSMENT, 2022, 12035
  • [34] Hybrid deep learning assisted chest X-ray image segmentation and classification for tuberculosis disease diagnosis
    Tiwari, Ajay
    Katiyar, Alok
    INTELLIGENT DECISION TECHNOLOGIES-NETHERLANDS, 2024, 18 (01): : 561 - 569
  • [35] Deep-Learning-Based Diagnosis of Bedside Chest X-ray in Intensive Care and Emergency Medicine
    Niehues, Stefan M.
    Adams, Lisa C.
    Gaudin, Robert A.
    Erxleben, Christoph
    Keller, Sarah
    Makowski, Marcus R.
    Vahldiek, Janis L.
    Bressem, Keno K.
    INVESTIGATIVE RADIOLOGY, 2021, 56 (08) : 525 - 534
  • [36] Covid-19 Diagnosis Using a Deep Learning Ensemble Model with Chest X-Ray Images
    Türk F.
    Computer Systems Science and Engineering, 2023, 45 (02): : 1357 - 1373
  • [37] Disease Area Detection for Chest X-Ray Image Diagnosis Using Deep Learning with Pseudo Labeling and Ensemble Learning
    Gerdprasert, Thanawit
    Mabu, Shingo
    Kido, Shoji
    IEEJ TRANSACTIONS ON ELECTRICAL AND ELECTRONIC ENGINEERING, 2023, 18 (11) : 1772 - 1780
  • [38] Transfer Learning for the Detection and Diagnosis of Types of Pneumonia including Pneumonia Induced by COVID-19 from Chest X-ray Images
    Brima, Yusuf
    Atemkeng, Marcellin
    Djiokap, Stive Tankio
    Ebiele, Jaures
    Tchakounte, Franklin
    DIAGNOSTICS, 2021, 11 (08)
  • [39] Fast deep learning computer-aided diagnosis of COVID-19 based on digital chest x-ray images
    Al-antari, Mugahed A.
    Hua, Cam-Hao
    Bang, Jaehun
    Lee, Sungyoung
    APPLIED INTELLIGENCE, 2021, 51 (05) : 2890 - 2907
  • [40] Improving Respiratory Infection Diagnosis with Deep Learning and Combinatorial Fusion: A Two-Stage Approach Using Chest X-ray Imaging
    Pan, Cheng-Tang
    Kumar, Rahul
    Wen, Zhi-Hong
    Wang, Chih-Hsuan
    Chang, Chun-Yung
    Shiue, Yow-Ling
    DIAGNOSTICS, 2024, 14 (05)