Validating the accuracy of deep learning for the diagnosis of pneumonia on chest x-ray against a robust multimodal reference diagnosis: a post hoc analysis of two prospective studies

被引:4
作者
Hofmeister, Jeremy [1 ]
Garin, Nicolas [2 ,3 ]
Montet, Xavier [1 ]
Scheffler, Max [1 ]
Platon, Alexandra [1 ]
Poletti, Pierre-Alexandre [1 ]
Stirnemann, Jerome [3 ]
Debray, Marie-Pierre [4 ]
Claessens, Yann-Erick [5 ]
Duval, Xavier [6 ]
Prendki, Virginie [7 ,8 ]
机构
[1] Geneva Univ Hosp, Dept Diagnost, Geneva, Switzerland
[2] Riviera Chablais Hosp, Div Internal Med, Rennaz, Switzerland
[3] Geneva Univ Hosp, Dept Med, Geneva, Switzerland
[4] Univ Paris Cite, Hop Bichat, APHP, Dept Radiol,Inserm UMR1152, Paris, France
[5] Ctr Hosp Princesse Grace, Dept Emergency Med, La Colle, Principality of, Monaco
[6] Univ Paris Cite, Hop Bichat, APHP, Dept Epidemiol & Clin Res,Inserm C 1425UMR 1138, Paris, France
[7] Geneva Univ Hosp, Dept Rehabil & Geriatr, Geneva, Switzerland
[8] Geneva Univ Hosp, Div Infect Dis, 4 Rue Gabrielle Perret Gentil, CH-1211 Geneva 14, Switzerland
关键词
Artificial intelligence; Chest x-ray; Deep learning; Diagnosis; Pneumonia; COMMUNITY-ACQUIRED PNEUMONIA; COMPUTED-TOMOGRAPHY; INTEROBSERVER RELIABILITY; IMPLEMENTATION; RADIOLOGISTS; VARIABILITY; RADIOGRAPHS; MODEL;
D O I
10.1186/s41747-023-00416-y
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background Artificial intelligence (AI) seems promising in diagnosing pneumonia on chest x-rays (CXR), but deep learning (DL) algorithms have primarily been compared with radiologists, whose diagnosis can be not completely accurate. Therefore, we evaluated the accuracy of DL in diagnosing pneumonia on CXR using a more robust reference diagnosis. Methods We trained a DL convolutional neural network model to diagnose pneumonia and evaluated its accuracy in two prospective pneumonia cohorts including 430 patients, for whom the reference diagnosis was determined a posteriori by a multidisciplinary expert panel using multimodal data. The performance of the DL model was compared with that of senior radiologists and emergency physicians reviewing CXRs and that of radiologists reviewing computed tomography (CT) performed concomitantly. Results Radiologists and DL showed a similar accuracy on CXR for both cohorts (p >= 0.269): cohort 1, radiologist 1 75.5% (95% confidence interval 69.1-80.9), radiologist 2 71.0% (64.4-76.8), DL 71.0% (64.4-76.8); cohort 2, radiologist 70.9% (64.7-76.4), DL 72.6% (66.5-78.0). The accuracy of radiologists and DL was significantly higher (p <= 0.022) than that of emergency physicians (cohort 1 64.0% [57.1-70.3], cohort 2 63.0% [55.6-69.0]). Accuracy was significantly higher for CT (cohort 1 79.0% [72.8-84.1], cohort 2 89.6% [84.9-92.9]) than for CXR readers including radiologists, clinicians, and DL (all p-values < 0.001). Conclusions When compared with a robust reference diagnosis, the performance of AI models to identify pneumonia on CXRs was inferior than previously reported but similar to that of radiologists and better than that of emergency physicians. Relevance statement: The clinical relevance of AI models for pneumonia diagnosis may have been overestimated. AI models should be benchmarked against robust reference multimodal diagnosis to avoid overestimating its performance. Key point center dot We evaluated an openly-access convolutional neural network (CNN) model to diagnose pneumonia on CXRs. center dot CNN was validated against a strong multimodal reference diagnosis. center dot In our study, the CNN performance (area under the receiver operating characteristics curve 0.74) was lower than that previously reported when validated against radiologists' diagnosis (0.99 in a recent meta-analysis). center dot The CNN performance was significantly higher than emergency physicians' (p <= 0.022) and comparable to that of board-certified radiologists (p >= 0.269).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] A cross-modal deep metric learning model for disease diagnosis based on chest x-ray images
    Yufei Jin
    Huijuan Lu
    Zhao Li
    Yanbin Wang
    [J]. Multimedia Tools and Applications, 2023, 82 : 33421 - 33442
  • [42] A cross-modal deep metric learning model for disease diagnosis based on chest x-ray images
    Jin, Yufei
    Lu, Huijuan
    Li, Zhao
    Wang, Yanbin
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (21) : 33421 - 33442
  • [43] CXR-Net: A Multitask Deep Learning Network for Explainable and Accurate Diagnosis of COVID-19 Pneumonia From Chest X-Ray Images
    Zhang, Xin
    Han, Liangxiu
    Sobeih, Tam
    Han, Lianghao
    Dempsey, Nina
    Lechareas, Symeon
    Tridente, Ascanio
    Chen, Haoming
    White, Stephen
    Zhang, Daoqiang
    [J]. IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2023, 27 (02) : 980 - 991
  • [44] A deep-learning pipeline for the diagnosis and discrimination of viral, non-viral and COVID-19 pneumonia from chest X-ray images
    Wang, Guangyu
    Liu, Xiaohong
    Shen, Jun
    Wang, Chengdi
    Li, Zhihuan
    Ye, Linsen
    Wu, Xingwang
    Chen, Ting
    Wang, Kai
    Zhang, Xuan
    Zhou, Zhongguo
    Yang, Jian
    Sang, Ye
    Deng, Ruiyun
    Liang, Wenhua
    Yu, Tao
    Gao, Ming
    Wang, Jin
    Yang, Zehong
    Cai, Huimin
    Lu, Guangming
    Zhang, Lingyan
    Yang, Lei
    Xu, Wenqin
    Wang, Winston
    Olevera, Andrea
    Ziyar, Ian
    Zhang, Charlotte
    Li, Oulan
    Liao, Weihua
    Liu, Jun
    Chen, Wen
    Chen, Wei
    Shi, Jichan
    Zheng, Lianghong
    Zhang, Longjiang
    Yan, Zhihan
    Zou, Xiaoguang
    Lin, Guiping
    Cao, Guiqun
    Lau, Laurance L.
    Mo, Long
    Liang, Yong
    Roberts, Michael
    Sala, Evis
    Schonlieb, Carola-Bibiane
    Fok, Manson
    Lau, Johnson Yiu-Nam
    Xu, Tao
    He, Jianxing
    [J]. NATURE BIOMEDICAL ENGINEERING, 2021, 5 (06) : 509 - +
  • [45] Improving lung region segmentation accuracy in chest X-ray images using a two-model deep learning ensemble approach*
    Rahman, Md Fashiar
    Zhuang, Yan
    Tseng, Tzu-Liang
    Pokojovy, Michael
    McCaffrey, Peter
    Walser, Eric
    Moen, Scott
    Vo, Alex
    [J]. JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 85
  • [46] Deep Learning-Based Decision-Tree Classifier for COVID-19 Diagnosis From Chest X-ray Imaging
    Yoo, Seung Hoon
    Geng, Hui
    Chiu, Tin Lok
    Yu, Siu Ki
    Cho, Dae Chul
    Heo, Jin
    Choi, Min Sung
    Choi, Il Hyun
    Cong Cung Van
    Nguen Viet Nhung
    Min, Byung Jun
    Lee, Ho
    [J]. FRONTIERS IN MEDICINE, 2020, 7
  • [47] The effectiveness of deep learning vs. traditional methods for lung disease diagnosis using chest X-ray images: A systematic review
    Sajed, Samira
    Sanati, Amir
    Garcia, Jorge Esparteiro
    Rostami, Habib
    Keshavarz, Ahmad
    Teixeira, Andreia
    [J]. APPLIED SOFT COMPUTING, 2023, 147
  • [48] Chest X-ray Analysis With Deep Learning-Based Software as a Triage Test for Pulmonary Tuberculosis: An Individual Patient Data Meta-Analysis of Diagnostic Accuracy
    Tavaziva, Gamuchirai
    Harris, Miriam
    Abidi, Syed K.
    Geric, Coralie
    Breuninger, Marianne
    Dheda, Keertan
    Esmail, Aliasgar
    Muyoyeta, Monde
    Reither, Klaus
    Majidulla, Arman
    Khan, Aamir J.
    Campbell, Jonathon R.
    David, Pierre-Marie
    Denkinger, Claudia
    Miller, Cecily
    Nathavitharana, Ruvandhi
    Pai, Madhukar
    Benedetti, Andrea
    Khan, Faiz Ahmad
    [J]. CLINICAL INFECTIOUS DISEASES, 2022, 74 (08) : 1390 - 1400
  • [49] COVID-DeepNet: Hybrid Multimodal Deep Learning System for Improving COVID-19 Pneumonia Detection in Chest X-ray Images
    Al-Waisy, A. S.
    Mohammed, Mazin Abed
    Al-Fandawi, Shumoos
    Maashi, M. S.
    Garcia-Zapirain, Begonya
    Abdulkareem, Karrar Hameed
    Mostafa, S. A.
    Kumar, Nallapaneni Manoj
    Dac-Nhuong Le
    [J]. CMC-COMPUTERS MATERIALS & CONTINUA, 2021, 67 (02): : 2409 - 2429
  • [50] Enhanced Pneumonia Diagnosis Using Chest X-Ray Image Features and Multilayer Perceptron and k-NN Machine Learning Algorithms
    Celik, Ahmet
    Demirel, Semih
    [J]. TRAITEMENT DU SIGNAL, 2023, 40 (03) : 1015 - 1023