Revisiting the Calibration of Modern Neural Networks

被引:0
|
作者
Minderer, Matthias [1 ]
Djolonga, Josip [1 ]
Romijnders, Rob [1 ]
Hubis, Frances [1 ]
Zhai, Xiaohua [1 ]
Houlsby, Neil [1 ]
Tran, Dustin [1 ]
Lucic, Mario [1 ]
机构
[1] Google Res, Brain Team, Mountain View, CA 94043 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021) | 2021年 / 34卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurate estimation of predictive uncertainty (model calibration) is essential for the safe application of neural networks. Many instances of miscalibration in modern neural networks have been reported, suggesting a trend that newer, more accurate models produce poorly calibrated predictions. Here, we revisit this question for recent state-of-the-art image classification models. We systematically relate model calibration and accuracy, and find that the most recent models, notably those not using convolutions, are among the best calibrated. Trends observed in prior model generations, such as decay of calibration with distribution shift or model size, are less pronounced in recent architectures. We also show that model size and amount of pretraining do not fully explain these differences, suggesting that architecture is a major determinant of calibration properties.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] On Calibration of Modern Neural Networks
    Guo, Chuan
    Pleiss, Geoff
    Sun, Yu
    Weinberger, Kilian Q.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [2] Modern practical convolutional neural networks for multivariate regression: Applications to NIR calibration
    Cui, Chenhao
    Fearn, Tom
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 182 : 9 - 20
  • [3] Revisiting Heterophily For Graph Neural Networks
    Luan, Sitao
    Hua, Chenqing
    Lu, Qincheng
    Zhu, Jiaqi
    Zhao, Mingde
    Zhang, Shuyuan
    Chang, Xiao-Wen
    Precup, Doina
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] Modern Neural Networks
    Najmi, Amir-Homayoon
    Emmanuel, Patrick R.
    Moon, Todd K.
    JOHNS HOPKINS APL TECHNICAL DIGEST, 2022, 36 (01): : 14 - 34
  • [5] Modern Neural Networks
    Najmi, Amir-Homayoon
    Emmanuel, Patrick R.
    Moon, Todd K.
    Johns Hopkins APL Technical Digest (Applied Physics Laboratory), 2022, 36 (01): : 14 - 34
  • [6] Revisiting multiple instance neural networks
    Wang, Xinggang
    Yan, Yongluan
    Tang, Peng
    Bai, Xiang
    Liu, Wenyu
    PATTERN RECOGNITION, 2018, 74 : 15 - 24
  • [7] Neural networks in multivariate calibration
    Despagne, F
    Massart, DL
    ANALYST, 1998, 123 (11) : 157R - 178R
  • [8] Camera Calibration with Neural Networks
    Tian, Zhen
    Xiong, Jiulong
    Zhang, Qi
    APPLIED MECHANICS AND MECHANICAL ENGINEERING, PTS 1-3, 2010, 29-32 : 2762 - 2767
  • [9] Revisiting N-Gram Models: Their Impact in Modern Neural Networks for Handwritten Text Recognition
    Tarride, Solene
    Kermorvant, Christopher
    DOCUMENT ANALYSIS AND RECOGNITION-ICDAR 2024, PT VI, 2024, 14809 : 167 - 182
  • [10] Revisiting Edge Detection in Convolutional Neural Networks
    Minh Le
    Kayal, Subhradeep
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,