Dropout injection at test time for post hoc uncertainty quantification in neural networks

被引:10
作者
Ledda, Emanuele [1 ,2 ]
Fumera, Giorgio [3 ]
Roli, Fabio [2 ]
机构
[1] Sapienza Univ Rome, Dept Comp Control & Management Engn, Via Ariosto 25, I-00185 Rome, Italy
[2] Univ Genoa, Dept Informat Bioengn Robot & Syst Engn, Via Dodecaneso 35, I-16146 Genoa, Italy
[3] Univ Cagliari, Dept Elect & Elect Engn, Via Marengo 3, I-09100 Cagliari, Italy
基金
欧盟地平线“2020”;
关键词
Crowd counting; Epistemic uncertainty; Monte Carlo dropout; Trustworthy AI; Uncertainty quantification; FRAMEWORK;
D O I
10.1016/j.ins.2023.119356
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Among Bayesian methods, Monte Carlo dropout provides principled tools for evaluating the epistemic uncertainty of neural networks. Its popularity recently led to seminal works that proposed activating the dropout layers only during inference for evaluating epistemic uncertainty. This approach, which we call dropout injection, provides clear benefits over its traditional counterpart (which we call embedded dropout) since it allows one to obtain a post hoc uncertainty measure for any existing network previously trained without dropout, avoiding an additional, time-consuming training process. Unfortunately, no previous work thoroughly analyzed injected dropout and compared it with embedded dropout; therefore, we provide a first comprehensive investigation, focusing on regression problems. We show that the effectiveness of dropout injection strongly relies on a suitable scaling of the corresponding uncertainty measure, and propose an alternative method to implement it. We also considered the trade-off between negative log-likelihood and calibration error as a function of the scale factor. Experimental results on benchmark data sets from several regression tasks, including crowd counting, support our claim that dropout injection can effectively behave as a competitive post hoc alternative to embedded dropout.
引用
收藏
页数:15
相关论文
共 30 条
[1]   A review of uncertainty quantification in deep learning: Techniques, applications and challenges [J].
Abdar, Moloud ;
Pourpanah, Farhad ;
Hussain, Sadiq ;
Rezazadegan, Dana ;
Liu, Li ;
Ghavamzadeh, Mohammad ;
Fieguth, Paul ;
Cao, Xiaochun ;
Khosravi, Abbas ;
Acharya, U. Rajendra ;
Makarenkov, Vladimir ;
Nahavandi, Saeid .
INFORMATION FUSION, 2021, 76 :243-297
[2]  
Blundell C, 2015, PR MACH LEARN RES, V37, P1613
[3]  
Boyen X., 1998, Uncertainty in Artificial Intelligence. Proceedings of the Fourteenth Conference (1998), P33
[4]   Privacy preserving crowd monitoring: Counting people without people models or tracking [J].
Chan, Antoni B. ;
Liang, Zhang-Sheng John ;
Vasconcelos, Nuno .
2008 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-12, 2008, :1766-1772
[5]   Feature Mining for Localised Crowd Counting [J].
Chen, Ke ;
Loy, Chen Change ;
Gong, Shaogang ;
Xiang, Tao .
PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2012, 2012,
[6]   Investigating Synthetic Data Sets for Crowd Counting in Cross-scene Scenarios [J].
Delussu, Rita ;
Putzu, Lorenzo ;
Fumera, Giorgio .
VISAPP: PROCEEDINGS OF THE 15TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS, VOL 4: VISAPP, 2020, :365-372
[7]  
Ferris J, 2009, PALG STUD THEAT PERF, P1
[8]  
Gal Y., 2016, UNCERTAINTY DEEP LEA
[9]  
Gal Y, 2017, ADV NEUR IN, V30
[10]  
Gal Y, 2016, PR MACH LEARN RES, V48