HatchEnsemble: an efficient and practical uncertainty quantification method for deep neural networks

被引:0
作者
Yufeng Xia
Jun Zhang
Tingsong Jiang
Zhiqiang Gong
Wen Yao
Ling Feng
机构
[1] National University of Defense Technology,College of Aerospace Science and Engineering
[2] Tsinghua University,Department of Computer Science and Technology
[3] Chinese Academy of Military Science,National Innovation Institute of Defense Technology
来源
Complex & Intelligent Systems | 2021年 / 7卷
关键词
Ensemble learning; Deep neural networks; Non-Bayesian method; Uncertainty quantification;
D O I
暂无
中图分类号
学科分类号
摘要
Quantifying predictive uncertainty in deep neural networks is a challenging and yet unsolved problem. Existing quantification approaches can be categorized into two lines. Bayesian methods provide a complete uncertainty quantification theory but are often not scalable to large-scale models. Along another line, non-Bayesian methods have good scalability and can quantify uncertainty with high quality. The most remarkable idea in this line is Deep Ensemble, but it is limited in practice due to its expensive computational cost. Thus, we propose HatchEnsemble to improve the efficiency and practicality of Deep Ensemble. The main idea is to use function-preserving transformations, ensuring HatchNets to inherit the knowledge learned by a single model called SeedNet. This process is called hatching, and HatchNet can be obtained by continuously widening the SeedNet. Based on our method, two different hatches are proposed, respectively, for ensembling the same and different architecture networks. To ensure the diversity of models, we also add random noises to parameters during hatching. Experiments on both clean and corrupted datasets show that HatchEnsemble can give a competitive prediction performance and better-calibrated uncertainty quantification in a shorter time compared with baselines.
引用
收藏
页码:2855 / 2869
页数:14
相关论文
共 40 条
[11]  
Ar Mohamed(1950)Verification of forecasts expressed in terms of probability Mon Weather Rev 78 1-3
[12]  
Jaitly N(2020)On power laws in deep ensembles Adv Neural Inf Process Syst 33 2375-2385
[13]  
Senior A(undefined)undefined undefined undefined undefined-undefined
[14]  
Vanhoucke V(undefined)undefined undefined undefined undefined-undefined
[15]  
Nguyen P(undefined)undefined undefined undefined undefined-undefined
[16]  
Sainath TN(undefined)undefined undefined undefined undefined-undefined
[17]  
Alipanahi B(undefined)undefined undefined undefined undefined-undefined
[18]  
Delong A(undefined)undefined undefined undefined undefined-undefined
[19]  
Weirauch MT(undefined)undefined undefined undefined undefined-undefined
[20]  
Frey BJ(undefined)undefined undefined undefined undefined-undefined