Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

被引:399
作者
Wang, Guotai [1 ,2 ,3 ]
Li, Wenqi [1 ,2 ]
Aertsen, Michael [4 ]
Deprest, Jan [1 ,4 ,5 ,6 ]
Ourselin, Sebastien [2 ]
Vercauteren, Tom [1 ,2 ,6 ]
机构
[1] UCL, Wellcome EPSRC Ctr Intervent & Surg Sci, London, England
[2] Kings Coll London, Sch Biomed Engn & Imaging Sci, London, England
[3] Univ Elect Sci & Technol China, Sch Mech & Elect Engn, Chengdu, Sichuan, Peoples R China
[4] Univ Hosp Leuven, Dept Radiol, Leuven, Belgium
[5] UCL, Inst Womens Hlth, London, England
[6] Univ Hosp Leuven, Dept Obstet & Gynaecol, Leuven, Belgium
基金
英国工程与自然科学研究理事会; 英国惠康基金;
关键词
Uncertainty estimation; Convolutional neural networks; Medical image segmentation; Data augmentation;
D O I
10.1016/j.neucom.2019.01.103
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite the state-of-the-art performance for medical image segmentation, deep convolutional neural networks (CNNs) have rarely provided uncertainty estimations regarding their segmentation outputs, e.g., model (epistemic) and image-based (aleatoric) uncertainties. In this work, we analyze these different types of uncertainties for CNN-based 2D and 3D medical image segmentation tasks at both pixel level and structure level. We additionally propose a test-time augmentation-based aleatoric uncertainty to analyze the effect of different transformations of the input image on the segmentation output. Test-time augmentation has been previously used to improve segmentation accuracy, yet not been formulated in a consistent mathematical framework. Hence, we also propose a theoretical formulation of test-time augmentation, where a distribution of the prediction is estimated by Monte Carlo simulation with prior distributions of parameters in an image acquisition model that involves image transformations and noise. We compare and combine our proposed aleatoric uncertainty with model uncertainty. Experiments with segmentation of fetal brains and brain tumors from 2D and 3D Magnetic Resonance Images (MRI) showed that 1) the test-time augmentation-based aleatoric uncertainty provides a better uncertainty estimation than calculating the test-time dropout-based model uncertainty alone and helps to reduce overconfident incorrect predictions, and 2) our test-time augmentation outperforms a single-prediction baseline and dropout-based multiple predictions. (C) 2019 The Authors. Published by Elsevier B.V.
引用
收藏
页码:34 / 45
页数:12
相关论文
共 44 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2016, ARXIV160601100
[3]  
[Anonymous], ARXIV180106879
[4]  
[Anonymous], 2015, PROC CVPR IEEE
[5]  
[Anonymous], 2017, MURA LARGE DATASET A
[6]  
[Anonymous], 2011, P ADV NEUR INF PROC
[7]  
[Anonymous], 2018, ARXIV180206455
[8]  
[Anonymous], 2017, ARXIV171204440
[9]  
Ayhan M.S., 2018, Medical Imaging with Deep Learning
[10]   Data Descriptor: Advancing The Cancer Genome Atlas glioma MRI collections with expert segmentation labels and radiomic features [J].
Bakas, Spyridon ;
Akbari, Hamed ;
Sotiras, Aristeidis ;
Bilello, Michel ;
Rozycki, Martin ;
Kirby, Justin S. ;
Freymann, John B. ;
Farahani, Keyvan ;
Davatzikos, Christos .
SCIENTIFIC DATA, 2017, 4