Confidence-Aware Severity Assessment of Lung Disease from Chest X-Rays Using Deep Neural Network on a Multi-Reader Dataset

被引:0
作者
Zandehshahvar, Mohammadreza [1 ]
van Assen, Marly [2 ]
Kim, Eun [2 ]
Kiarashi, Yashar [3 ]
Keerthipati, Vikranth [1 ]
Tessarin, Giovanni [2 ]
Muscogiuri, Emanuele [2 ]
Stillman, Arthur E. [2 ]
Filev, Peter [2 ]
Davarpanah, Amir H. [2 ]
Berkowitz, Eugene A. [2 ]
Tigges, Stefan [2 ]
Lee, Scott J. [2 ]
Vey, Brianna L. [2 ]
De Cecco, Carlo [2 ]
Adibi, Ali [1 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Emory Univ, Sch Med, Dept Radiol & Imaging Sci, Atlanta, GA USA
[3] Emory Univ, Emory Sch Med, Dept Biomed Informat, Atlanta, GA USA
来源
JOURNAL OF IMAGING INFORMATICS IN MEDICINE | 2024年
基金
美国国家科学基金会;
关键词
Lung disease; Severity; Deep learning; Confidence-aware prediction; Uncertainty; VARIABILITY;
D O I
10.1007/s10278-024-01151-5
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
In this study, we present a method based on Monte Carlo Dropout (MCD) as Bayesian neural network (BNN) approximation for confidence-aware severity classification of lung diseases in COVID-19 patients using chest X-rays (CXRs). Trained and tested on 1208 CXRs from Hospital 1 in the USA, the model categorizes severity into four levels (i.e., normal, mild, moderate, and severe) based on lung consolidation and opacity. Severity labels, determined by the median consensus of five radiologists, serve as the reference standard. The model's performance is internally validated against evaluations from an additional radiologist and two residents that were excluded from the median. The performance of the model is further evaluated on additional internal and external datasets comprising 2200 CXRs from the same hospital and 1300 CXRs from Hospital 2 in South Korea. The model achieves an average area under the curve (AUC) of 0.94 +/- 0.01 across all classes in the primary dataset, surpassing human readers in each severity class and achieves a higher Kendall correlation coefficient (KCC) of 0.80 +/- 0.03. The performance of the model is consistent across varied datasets, highlighting its generalization. A key aspect of the model is its predictive uncertainty (PU), which is inversely related to the level of agreement among radiologists, particularly in mild and moderate cases. The study concludes that the model outperforms human readers in severity assessment and maintains consistent accuracy across diverse datasets. Its ability to provide confidence measures in predictions is pivotal for potential clinical use, underscoring the BNN's role in enhancing diagnostic precision in lung disease analysis through CXR.
引用
收藏
页码:793 / 803
页数:11
相关论文
共 35 条
  • [1] Random forest method for the recognition of susceptibility and resistance patterns in antibiograms
    Ayala-Aldana, Nicolas
    Gonzalez-Valdes, Leticia
    [J]. REVISTA CHILENA DE INFECTOLOGIA, 2023, 40 (01): : 76 - 77
  • [2] Application of Artificial Intelligence in COVID-19 Diagnosis and Therapeutics
    Asada, Ken
    Komatsu, Masaaki
    Shimoyama, Ryo
    Takasawa, Ken
    Shinkai, Norio
    Sakai, Akira
    Bolatkan, Amina
    Yamada, Masayoshi
    Takahashi, Satoshi
    Machino, Hidenori
    Kobayashi, Kazuma
    Kaneko, Syuzo
    Hamamoto, Ryuji
    [J]. JOURNAL OF PERSONALIZED MEDICINE, 2021, 11 (09):
  • [3] Inter-observer variability in mammography screening and effect of type and number of readers on screening outcome
    Duijm, L. E. M.
    Louwman, M. W. J.
    Groenewoud, J. H.
    van de Poll-Franse, L. V.
    Fracheboud, J.
    Coebergh, J. W.
    [J]. BRITISH JOURNAL OF CANCER, 2009, 100 (06) : 901 - 907
  • [4] Deep learning-enabled medical computer vision
    Esteva, Andre
    Chou, Katherine
    Yeung, Serena
    Naik, Nikhil
    Madani, Ali
    Mottaghi, Ali
    Liu, Yun
    Topol, Eric
    Dean, Jeff
    Socher, Richard
    [J]. NPJ DIGITAL MEDICINE, 2021, 4 (01)
  • [5] Gal Y, 2017, PR MACH LEARN RES, V70
  • [6] Gal Y, 2016, PR MACH LEARN RES, V48
  • [7] Artificial intelligence for the detection of COVID-19 pneumonia on chest CT using multinational datasets
    Harmon, Stephanie A.
    Sanford, Thomas H.
    Xu, Sheng
    Turkbey, Evrim B.
    Roth, Holger
    Xu, Ziyue
    Yang, Dong
    Myronenko, Andriy
    Anderson, Victoria
    Amalou, Amel
    Blain, Maxime
    Kassin, Michael
    Long, Dilara
    Varble, Nicole
    Walker, Stephanie M.
    Bagci, Ulas
    Ierardi, Anna Maria
    Stellato, Elvira
    Plensich, Guido Giovanni
    Franceschelli, Giuseppe
    Girlando, Cristiano
    Irmici, Giovanni
    Labella, Dominic
    Hammoud, Dima
    Malayeri, Ashkan
    Jones, Elizabeth
    Summers, Ronald M.
    Choyke, Peter L.
    Xu, Daguang
    Flores, Mona
    Tamura, Kaku
    Obinata, Hirofumi
    Mori, Hitoshi
    Patella, Francesca
    Cariati, Maurizio
    Carrafiello, Gianpaolo
    An, Peng
    Wood, Bradford J.
    Turkbey, Baris
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [8] Development and evaluation of an artificial intelligence system for COVID-19 diagnosis
    Jin, Cheng
    Chen, Weixiang
    Cao, Yukun
    Xu, Zhanwei
    Tan, Zimeng
    Zhang, Xin
    Deng, Lei
    Zheng, Chuansheng
    Zhou, Jie
    Shi, Heshui
    Feng, Jianjiang
    [J]. NATURE COMMUNICATIONS, 2020, 11 (01)
  • [9] Assessing Reliability and Challenges of Uncertainty Estimations for Medical Image Segmentation
    Jungo, Alain
    Reyes, Mauricio
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION - MICCAI 2019, PT II, 2019, 11765 : 48 - 56
  • [10] Deep learning with noisy labels: Exploring techniques and remedies in medical image analysis
    Karimi, Davood
    Dou, Haoran
    Warfield, Simon K.
    Gholipour, Ali
    [J]. MEDICAL IMAGE ANALYSIS, 2020, 65 (65)