Multi-scale, domain knowledge-guided attention plus random forest: a two-stage deep learning-based multi-scale guided attention models to diagnose idiopathic pulmonary fibrosis from computed tomography images

被引:9
作者
Yu, Wenxi [1 ]
Zhou, Hua [1 ]
Choi, Youngwon [1 ]
Goldin, Jonathan G. [1 ]
Teng, Pangyu [1 ]
Wong, Weng Kee [1 ]
McNitt-Gray, Michael F. [1 ]
Brown, Matthew S. [1 ]
Kim, Grace Hyun J. [1 ]
机构
[1] Univ Calif Los Angeles, Dept Biostat, 924 Westwood Blvd,Suite 650, Los Angeles, CA 90024 USA
基金
美国国家科学基金会;
关键词
attention models; computed tomography; deep learning; domain knowledge; idiopathic pulmonary fibrosis; machine learning; medical imaging;
D O I
10.1002/mp.16053
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Background Idiopathic pulmonary fibrosis (IPF) is a progressive, irreversible, and usually fatal lung disease of unknown reasons, generally affecting the elderly population. Early diagnosis of IPF is crucial for triaging patients' treatment planning into anti-fibrotic treatment or treatments for other causes of pulmonary fibrosis. However, current IPF diagnosis workflow is complicated and time-consuming, which involves collaborative efforts from radiologists, pathologists, and clinicians and it is largely subject to inter-observer variability. Purpose The purpose of this work is to develop a deep learning-based automated system that can diagnose subjects with IPF among subjects with interstitial lung disease (ILD) using an axial chest computed tomography (CT) scan. This work can potentially enable timely diagnosis decisions and reduce inter-observer variability. Methods Our dataset contains CT scans from 349 IPF patients and 529 non-IPF ILD patients. We used 80% of the dataset for training and validation purposes and 20% as the holdout test set. We proposed a two-stage model: at stage one, we built a multi-scale, domain knowledge-guided attention model (MSGA) that encouraged the model to focus on specific areas of interest to enhance model explainability, including both high- and medium-resolution attentions; at stage two, we collected the output from MSGA and constructed a random forest (RF) classifier for patient-level diagnosis, to further boost model accuracy. RF classifier is utilized as a final decision stage since it is interpretable, computationally fast, and can handle correlated variables. Model utility was examined by (1) accuracy, represented by the area under the receiver operating characteristic curve (AUC) with standard deviation (SD), and (2) explainability, illustrated by the visual examination of the estimated attention maps which showed the important areas for model diagnostics. Results During the training and validation stage, we observe that when we provide no guidance from domain knowledge, the IPF diagnosis model reaches acceptable performance (AUC +/- SD = 0.93 +/- 0.07), but lacks explainability; when including only guided high- or medium-resolution attention, the learned attention maps are not satisfactory; when including both high- and medium-resolution attention, under certain hyperparameter settings, the model reaches the highest AUC among all experiments (AUC +/- SD = 0.99 +/- 0.01) and the estimated attention maps concentrate on the regions of interests for this task. Three best-performing hyperparameter selections according to MSGA were applied to the holdout test set and reached comparable model performance to that of the validation set. Conclusions Our results suggest that, for a task with only scan-level labels available, MSGA+RF can utilize the population-level domain knowledge to guide the training of the network, which increases both model accuracy and explainability.
引用
收藏
页码:894 / 905
页数:12
相关论文
共 36 条
  • [1] [Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.319
  • [2] [Anonymous], 2014, Deep inside convolutional networks: Visualising image classification models and saliency maps, DOI DOI 10.48550/ARXIV.1312.6034
  • [3] [Anonymous], 2019, IEEE T MED IMAGING, DOI DOI 10.1109/TMI.2018.2867261
  • [4] Idiopathic pulmonary fibrosis: Physiologic tests, quantitative CT indexes, and CT visual scores as predictors of mortality
    Best, Alan C.
    Meng, Jiangfeng
    Lynch, Anne M.
    Bozic, Carmen M.
    Miller, David
    Grunwald, Gary K.
    Lynch, David A.
    [J]. RADIOLOGY, 2008, 246 (03) : 935 - 940
  • [5] Random forests
    Breiman, L
    [J]. MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [6] Chollet F., 2015, Keras, DOI DOI 10.1097/WAD.0B013E3182163B62
  • [7] Computer-Aided Diagnosis of Pulmonary Fibrosis Using Deep Learning and CT Images
    Christe, Andreas
    Peters, Alan A.
    Drakopoulos, Dionysios
    Heverhagen, Johannes T.
    Geiser, Thomas
    Stathopoulou, Thomai
    Christodoulidis, Stergios
    Anthimopoulos, Marios
    Mougiakakou, Stavroula G.
    Ebner, Lukas
    [J]. INVESTIGATIVE RADIOLOGY, 2019, 54 (10) : 627 - 632
  • [8] Multiscale attention guided U-Net architecture for cardiac segmentation in short-axis MRI images
    Cui, Hengfei
    Yuwen, Chang
    Jiang, Lei
    Xia, Yong
    Zhang, Yanning
    [J]. COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 206
  • [9] Imatinib Treatment for Idiopathic Pulmonary Fibrosis Randomized Placebo-controlled Trial Results
    Daniels, Craig E.
    Lasky, Joseph A.
    Limper, Andrew H.
    Mieras, Kathleen
    Gabor, Edith
    Schroeder, Darrell R.
    [J]. AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2010, 181 (06) : 604 - 610
  • [10] Hastie T, 2009, The Elements of Statistical Learning: Data Mining, Inference, and Prediction, VI, DOI DOI 10.1007/978-0-387-84858-7