Fast and Scalable Score-Based Kernel Calibration Tests

被引:0
|
作者
Glaser, Pierre [1 ]
Widmann, David [2 ]
Lindsten, Fredrik [3 ]
Gretton, Arthur [1 ]
机构
[1] UCL, Gatsby Computat Neurosci Unit, London, England
[2] Uppsala Univ, Dept Informat Technol, Uppsala, Sweden
[3] Linkoping Univ, Div Stat & Machine Learning, Linkoping, Sweden
来源
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE | 2023年 / 216卷
基金
瑞典研究理事会;
关键词
RELIABILITY; MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce the Kernel Calibration Conditional Stein Discrepancy test (KCCSD test), a non-parametric, kernel-based test for assessing the calibration of probabilistic models with well-defined scores. In contrast to previous methods, our test avoids the need for possibly expensive expectation approximations while providing control over its type-I error. We achieve these improvements by using a new family of kernels for score-based probabilities that can be estimated without probability density samples, and by using a conditional goodness-of-fit criterion for the KCCSD test's U-statistic. We demonstrate the properties of our test on various synthetic settings.
引用
收藏
页码:691 / 700
页数:10
相关论文
共 50 条
  • [1] Score-based Tests for Item Factor Analysis
    Wang, Ting
    MULTIVARIATE BEHAVIORAL RESEARCH, 2015, 50 (06) : 728 - 729
  • [2] Fast FILTERSIM simulation with score-based distance
    Wu, Jianbing
    Zhang, Tuanfeng
    Journel, Andre
    MATHEMATICAL GEOSCIENCES, 2008, 40 (07) : 773 - 788
  • [3] Fast FILTERSIM Simulation with Score-based Distance
    Jianbing Wu
    Tuanfeng Zhang
    André Journel
    Mathematical Geosciences, 2008, 40 : 773 - 788
  • [4] Score-based tests of measurement invariance: use in practice
    Wang, Ting
    Merkle, Edgar C.
    Zeileis, Achim
    FRONTIERS IN PSYCHOLOGY, 2014, 5
  • [5] Score-based tests for detecting heterogeneity in linear mixed models
    Ting Wang
    Edgar C. Merkle
    Joaquin A. Anguera
    Brandon M. Turner
    Behavior Research Methods, 2021, 53 : 216 - 231
  • [6] Score-based tests for detecting heterogeneity in linear mixed models
    Wang, Ting
    Merkle, Edgar C.
    Anguera, Joaquin A.
    Turner, Brandon M.
    BEHAVIOR RESEARCH METHODS, 2021, 53 (01) : 216 - 231
  • [7] Fast and Reliable Score-Based Generative Model for Parallel MRI
    Hou, Ruizhi
    Li, Fang
    Zeng, Tieyong
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 953 - 966
  • [8] Fast Sampling of Score-Based Models With Cyclical Diffusion Sampling
    Makhtidi, Karimul
    Bustamam, Alhadi
    Adnan, Risman
    Robbani, Hanif Amal
    Mangunwardoyo, Wibowo
    Khan, Mohammad Asif
    IEEE ACCESS, 2024, 12 : 49578 - 49589
  • [9] An R toolbox for score-based measurement invariance tests in IRT models
    Lennart Schneider
    Carolin Strobl
    Achim Zeileis
    Rudolf Debelak
    Behavior Research Methods, 2022, 54 : 2101 - 2113
  • [10] An R toolbox for score-based measurement invariance tests in IRT models
    Schneider, Lennart
    Strobl, Carolin
    Zeileis, Achim
    Debelak, Rudolf
    BEHAVIOR RESEARCH METHODS, 2022, 54 (05) : 2101 - 2113