Robust PACm: Training Ensemble Models Under Misspecification and Outliers

被引:3
作者
Zecchin, Matteo [1 ]
Park, Sangwoo [1 ]
Simeone, Osvaldo [1 ]
Kountouris, Marios [2 ]
Gesbert, David [2 ]
机构
[1] Kings Coll London, Dept Engn, Kings Commun Learning & Informat Proc KCLIP Lab, London WC2R 2LS, England
[2] EURECOM, Commun Syst Dept, F-06410 Sophia Antipolis, France
基金
英国工程与自然科学研究理事会;
关键词
Bayes methods; Pollution measurement; Standards; Europe; Training; Robustness; Predictive models; Bayesian learning; ensemble models; machine learning; misspecification; outliers; robustness; BAYESIAN-INFERENCE;
D O I
10.1109/TNNLS.2023.3295168
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Standard Bayesian learning is known to have suboptimal generalization capabilities under misspecification and in the presence of outliers. Probably approximately correct (PAC)-Bayes theory demonstrates that the free energy criterion minimized by Bayesian learning is a bound on the generalization error for Gibbs predictors (i.e., for single models drawn at random from the posterior) under the assumption of sampling distributions uncontaminated by outliers. This viewpoint provides a justification for the limitations of Bayesian learning when the model is misspecified, requiring ensembling, and when data are affected by outliers. In recent work, PAC-Bayes bounds-referred to as PACm-were derived to introduce free energy metrics that account for the performance of ensemble predictors, obtaining enhanced performance under misspecification. This work presents a novel robust free energy criterion that combines the generalized logarithm score function with PACm ensemble bounds. The proposed free energy training criterion produces predictive distributions that are able to concurrently counteract the detrimental effects of misspecification-with respect to both likelihood and prior distribution-and outliers.
引用
收藏
页码:16518 / 16532
页数:15
相关论文
共 50 条
[41]   Robust Bayesian Recursive Ensemble Kalman Filter Under the Nonstationary Heavy-Tailed Noise [J].
Wang, Li ;
Chen, Hui ;
Lian, Feng ;
Zhang, Wenxu ;
Liu, Jiabin .
IEEE SENSORS JOURNAL, 2025, 25 (01) :749-762
[42]   Comparison of robustness to outliers between robust poisson models and log-binomial models when estimating relative risks for common binary outcomes: a simulation study [J].
Chen, Wansu ;
Shi, Jiaxiao ;
Qian, Lei ;
Azen, Stanley P. .
BMC MEDICAL RESEARCH METHODOLOGY, 2014, 14
[43]   Comparison of robustness to outliers between robust poisson models and log-binomial models when estimating relative risks for common binary outcomes: a simulation study [J].
Wansu Chen ;
Jiaxiao Shi ;
Lei Qian ;
Stanley P Azen .
BMC Medical Research Methodology, 14
[44]   REDIBAGG: Reducing the training set size in ensemble machine learning-based prediction models [J].
Silva-Ramirez, Esther-Lydia ;
Cabrera-Sanchez, Juan-Francisco ;
Lopez-Coello, Manuel .
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2025, 149
[45]   Latent variable models under misspecification - Two-stage least squares (2SLS) and maximum likelihood (ML) estimators [J].
Bollen, Kenneth A. ;
Kirby, James B. ;
Curran, Patrick J. ;
Paxton, Pamela M. ;
Chen, Feinian .
SOCIOLOGICAL METHODS & RESEARCH, 2007, 36 (01) :48-86
[46]   Robust Architecture-Agnostic and Noise Resilient Training of Photonic Deep Learning Models [J].
Kirtas, Manos ;
Passalis, Nikolaos ;
Mourgias-Alexandris, George ;
Dabos, George ;
Pleros, Nikos ;
Tefas, Anastasios .
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 7 (01) :140-149
[47]   A fast identification algorithm with outliers under Box-Cox transformation-based annealing robust radial basis function networks [J].
Chen P.-Y. ;
Wu C.-J. ;
Ko C.-N. ;
Jeng J.-T. .
Artificial Life and Robotics, 2009, 14 (1) :62-66
[48]   Robust parameter estimation for one-inflated positive Poisson Lindley distribution under the presence and absence of outliers with applications to crime data [J].
Tajuddin, Razik Ridzuan Mohd ;
Safari, Muhammad Aslam Mohd ;
Ismail, Noriszura .
PAKISTAN JOURNAL OF STATISTICS AND OPERATION RESEARCH, 2024, 20 (03) :369-381
[49]   Robust diabetic prediction using ensemble machine learning models with synthetic minority over-sampling technique [J].
Sampath, Pradeepa ;
Elangovan, Gurupriya ;
Ravichandran, Kaaveya ;
Shanmuganathan, Vimal ;
Pasupathi, Subbulakshmi ;
Chakrabarti, Tulika ;
Chakrabarti, Prasun ;
Margala, Martin .
SCIENTIFIC REPORTS, 2024, 14 (01)
[50]   ROBUST KALMAN FILTER AND SMOOTHER FOR ERRORS-IN-VARIABLES STATE SPACE MODELS WITH OBSERVATION OUTLIERS BASED ON THE MINIMUM-COVARIANCE DETERMINANT ESTIMATOR [J].
Almutawa, Jaafar .
ASIAN JOURNAL OF CONTROL, 2011, 13 (04) :513-521