A probabilistic approach for interpretable deep learning in liver cancer diagnosis

被引:5
作者
Wang, Clinton J. [1 ]
Hamm, Charlie A. [1 ,2 ,3 ,4 ,5 ]
Letzen, Brian S. [1 ]
Duncan, James S. [1 ,6 ]
机构
[1] Yale Sch Med, Dept Radiol & Biomed Imaging, 333 Cedar St, New Haven, CT 06520 USA
[2] Charite Univ Med Berlin, Inst Radiol, D-10117 Berlin, Germany
[3] Free Univ Berlin, D-10117 Berlin, Germany
[4] Humboldt Univ, D-10117 Berlin, Germany
[5] Berlin Inst Hlth, D-10117 Berlin, Germany
[6] Yale Sch Engn & Appl Sci, Dept Biomed Engn, New Haven, CT 06520 USA
来源
MEDICAL IMAGING 2019: COMPUTER-AIDED DIAGNOSIS | 2019年 / 10950卷
基金
美国国家卫生研究院;
关键词
interpretable; liver cancer; deep learning; convolutional neural network; diagnostic radiology;
D O I
10.1117/12.2512473
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Despite rapid advances in deep learning applications for radiological diagnosis and prognosis, the clinical adoption of such models is limited by their inability to explain or justify their predictions. This work developed a probabilistic approach for interpreting the predictions of a convolutional neural network (CNN) trained to classify liver lesions from multiphase magnetic resonance imaging (MRI). It determined the presence of 14 radiological features, where each lesion image contained one to four features and only ten examples of each feature were provided. Using stochastic forward passes of these example images through a trained CNN, samples were obtained from each feature's conditional probability distribution over the network's intermediate outputs. The marginal distribution was sampled with stochastic forward passes of images from the entire training dataset, and sparse kernel density estimation (KDE) was used to infer which features were present in a test set of 60 lesion images. This approach was tested on a CNN that reached 89.7% accuracy in classifying six types of liver lesions. It identified radiological features with 72.2 +/- 2.2% precision and 82.6 +/- 2.0% recall. In contrast with previous interpretability approaches, this method used sparsely labeled data, did not change the CNN architecture, and directly outputted radiological descriptors of each image. This approach can identify and explain potential failure modes in a CNN, as well as make a CNN's predictions more transparent to radiologists. Such contributions could facilitate the clinical translation of deep learning in a wide range of diagnostic and prognostic applications.
引用
收藏
页数:9
相关论文
共 17 条
[1]  
[Anonymous], ABS160303925 CORR
[2]   MRI of hepatocellular carcinoma: an update of current practices [J].
Arif-Tiwari, Hina ;
Kalb, Bobby ;
Chundru, Surya ;
Sharma, Puneet ;
Costello, James ;
Guessner, Rainner W. ;
Martin, Diego R. .
DIAGNOSTIC AND INTERVENTIONAL RADIOLOGY, 2014, 20 (03) :209-221
[3]  
Chollet F., 2015, Keras
[4]  
Erhan D., 2009, U MONTREAL, V1341, P1, DOI DOI 10.2464/JILM.23.425
[5]  
Grewal M., 2017, ABS171004934 CORR
[6]  
Holzinger A., 2017, ARXIV171209923
[7]  
Ioffe S, 2015, 32 INT C MACH LEARN
[8]  
Kim S. T., 2018, ABS180508960 CORR
[9]  
Kingma DP, 2014, ARXIV
[10]  
Liu Han, 2007, AISTATS, P283, DOI DOI 10.1214/009053607000000811.508