Validating the accuracy of deep learning for the diagnosis of pneumonia on chest x-ray against a robust multimodal reference diagnosis: a post hoc analysis of two prospective studies

被引:4
作者
Hofmeister, Jeremy [1 ]
Garin, Nicolas [2 ,3 ]
Montet, Xavier [1 ]
Scheffler, Max [1 ]
Platon, Alexandra [1 ]
Poletti, Pierre-Alexandre [1 ]
Stirnemann, Jerome [3 ]
Debray, Marie-Pierre [4 ]
Claessens, Yann-Erick [5 ]
Duval, Xavier [6 ]
Prendki, Virginie [7 ,8 ]
机构
[1] Geneva Univ Hosp, Dept Diagnost, Geneva, Switzerland
[2] Riviera Chablais Hosp, Div Internal Med, Rennaz, Switzerland
[3] Geneva Univ Hosp, Dept Med, Geneva, Switzerland
[4] Univ Paris Cite, Hop Bichat, APHP, Dept Radiol,Inserm UMR1152, Paris, France
[5] Ctr Hosp Princesse Grace, Dept Emergency Med, La Colle, Principality of, Monaco
[6] Univ Paris Cite, Hop Bichat, APHP, Dept Epidemiol & Clin Res,Inserm C 1425UMR 1138, Paris, France
[7] Geneva Univ Hosp, Dept Rehabil & Geriatr, Geneva, Switzerland
[8] Geneva Univ Hosp, Div Infect Dis, 4 Rue Gabrielle Perret Gentil, CH-1211 Geneva 14, Switzerland
关键词
Artificial intelligence; Chest x-ray; Deep learning; Diagnosis; Pneumonia; COMMUNITY-ACQUIRED PNEUMONIA; COMPUTED-TOMOGRAPHY; INTEROBSERVER RELIABILITY; IMPLEMENTATION; RADIOLOGISTS; VARIABILITY; RADIOGRAPHS; MODEL;
D O I
10.1186/s41747-023-00416-y
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background Artificial intelligence (AI) seems promising in diagnosing pneumonia on chest x-rays (CXR), but deep learning (DL) algorithms have primarily been compared with radiologists, whose diagnosis can be not completely accurate. Therefore, we evaluated the accuracy of DL in diagnosing pneumonia on CXR using a more robust reference diagnosis. Methods We trained a DL convolutional neural network model to diagnose pneumonia and evaluated its accuracy in two prospective pneumonia cohorts including 430 patients, for whom the reference diagnosis was determined a posteriori by a multidisciplinary expert panel using multimodal data. The performance of the DL model was compared with that of senior radiologists and emergency physicians reviewing CXRs and that of radiologists reviewing computed tomography (CT) performed concomitantly. Results Radiologists and DL showed a similar accuracy on CXR for both cohorts (p >= 0.269): cohort 1, radiologist 1 75.5% (95% confidence interval 69.1-80.9), radiologist 2 71.0% (64.4-76.8), DL 71.0% (64.4-76.8); cohort 2, radiologist 70.9% (64.7-76.4), DL 72.6% (66.5-78.0). The accuracy of radiologists and DL was significantly higher (p <= 0.022) than that of emergency physicians (cohort 1 64.0% [57.1-70.3], cohort 2 63.0% [55.6-69.0]). Accuracy was significantly higher for CT (cohort 1 79.0% [72.8-84.1], cohort 2 89.6% [84.9-92.9]) than for CXR readers including radiologists, clinicians, and DL (all p-values < 0.001). Conclusions When compared with a robust reference diagnosis, the performance of AI models to identify pneumonia on CXRs was inferior than previously reported but similar to that of radiologists and better than that of emergency physicians. Relevance statement: The clinical relevance of AI models for pneumonia diagnosis may have been overestimated. AI models should be benchmarked against robust reference multimodal diagnosis to avoid overestimating its performance. Key point center dot We evaluated an openly-access convolutional neural network (CNN) model to diagnose pneumonia on CXRs. center dot CNN was validated against a strong multimodal reference diagnosis. center dot In our study, the CNN performance (area under the receiver operating characteristics curve 0.74) was lower than that previously reported when validated against radiologists' diagnosis (0.99 in a recent meta-analysis). center dot The CNN performance was significantly higher than emergency physicians' (p <= 0.022) and comparable to that of board-certified radiologists (p >= 0.269).
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Automated Pneumonia Diagnosis using a 2D Deep Convolutional Neural Network with Chest X-Ray Images
    Kassylkassova, Kamila
    Omarov, Batyrkhan
    Kazbekova, Gulnur
    Kozhamkulova, Zhadra
    Maikotov, Mukhit
    Bidakhmet, Zhanar
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (02) : 699 - 708
  • [22] Diagnosis of COVID-19 Using Chest X-ray Images and Disease Symptoms Based on Stacking Ensemble Deep Learning
    AlMohimeed, Abdulaziz
    Saleh, Hager
    El-Rashidy, Nora
    Saad, Redhwan M. A.
    El-Sappagh, Shaker
    Mostafa, Sherif
    DIAGNOSTICS, 2023, 13 (11)
  • [23] Deep Learning-Based System Combining Chest X-Ray and Computerized Tomography Images for COVID-19 Diagnosis
    Ding, Hui
    Fan, Lingyan
    Zhang, Jingfeng
    Gao, Guosheng
    BRITISH JOURNAL OF HOSPITAL MEDICINE, 2024, 85 (08)
  • [24] Improving pneumonia diagnosis with high-accuracy CNN-Based chest X-ray image classification and integrated gradient
    Rabbah, Jalal
    Ridouani, Mohammed
    Hassouni, Larbi
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2025, 101
  • [25] A Deep Learning Model with Self-Supervised Learning and Attention Mechanism for COVID-19 Diagnosis Using Chest X-ray Images
    Park, Junghoon
    Kwak, Il-Youp
    Lim, Changwon
    ELECTRONICS, 2021, 10 (16)
  • [26] Deep Learning-based Diagnosis of Pulmonary Tuberculosis on Chest X-ray in the Emergency Department: A Retrospective Study
    Wang, Chih-Hung
    Chang, Weishan
    Lee, Meng-Rui
    Tay, Joyce
    Wu, Cheng-Yi
    Wu, Meng-Che
    Roth, Holger R.
    Yang, Dong
    Zhao, Can
    Wang, Weichung
    Huang, Chien-Hua
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (02): : 589 - 600
  • [27] COVID-19 Diagnosis Through Deep Learning Techniques and Chest X-Ray Images
    Negreiros R.R.B.
    Silva I.H.S.
    Alves A.L.F.
    Valadares D.C.G.
    Perkusich A.
    Baptista C.S.
    SN Computer Science, 4 (5)
  • [28] Accuracy of Clinical Evaluation and Chest X-ray for HFpEF Diagnosis: Comparison Against Right Heart Catheterization
    Rahi, Wissam
    Hussain, Imad
    Nguyen, Duc T.
    Graviss, Edward A.
    Quinones, Miguel A.
    Nagueh, Sherif F.
    AMERICAN JOURNAL OF CARDIOLOGY, 2023, 206 : 219 - 220
  • [29] Diagnosis of asthmatic pneumonia in children by lung ultrasound vs. chest X-ray: an updated systematic review and meta-analysis
    Ru, Qi
    Liu, LanLan
    Dong, Xiaoyun
    POSTEPY DERMATOLOGII I ALERGOLOGII, 2023, 40 (01): : 28 - 34
  • [30] A deep-learning based multimodal system for Covid-19 diagnosis using breathing sounds and chest X-ray images
    Sait, Unais
    Lal, K. V. Gokul
    Shivakumar, Sanjana
    Kumar, Tarun
    Bhaumik, Rahul
    Prajapati, Sunny
    Bhalla, Kriti
    Chakrapani, Anaghaa
    APPLIED SOFT COMPUTING, 2021, 109