Statistical Robustness of Empirical Risks in Machine Learning

被引：0

作者：

Guo, Shaoyan ^{[1
]}

Xu, Huifu ^{[2
]}

Zhang, Liwei ^{[1
]}

机构：

[1] Dalian Univ Technol, Sch Math Sci, Dalian 116024, Peoples R China

[2] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Shatin, Hong Kong, Peoples R China

来源：

JOURNAL OF MACHINE LEARNING RESEARCH | 2023年 / 24卷

基金：

国家重点研发计划;

关键词：

Empirical risks; stability analysis; asymptotic qualitative statistical robust-ness; non-asymptotic quantitative statistical robustness; uniform consistency; UNIFORM-CONVERGENCE; OPTIMAL RATES; REGULARIZATION; LEARNABILITY; STABILITY; SELECTION; WEAK;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies convergence of empirical risks in reproducing kernel Hilbert spaces (RKHS). A conventional assumption in the existing research is that empirical training data are generated by the unknown true probability distribution but this may not be satisfied in some practical circumstances. Consequently the existing convergence results may not provide a guarantee as to whether the empirical risks are reliable or not when the data are potentially corrupted (generated by a distribution perturbed from the true). In this paper, we fill out the gap from robust statistics perspective (Kra center dot tschmer, Schied and Za center dot hle (2012); Kra center dot tschmer, Schied and Za center dot hle (2014); Guo and Xu (2020)). First, we derive moderate sufficient conditions under which the expected risk changes stably (continuously) against small perturbation of the probability distributions of the underlying random variables and demonstrate how the cost function and kernel affect the stability. Second, we examine the difference between laws of the statistical estimators of the expected optimal loss based on pure data and contaminated data using Prokhorov metric and Kantorovich metric, and derive some asymptotic qualitative and non-asymptotic quantitative statistical robustness results. Third, we identify appropriate metrics under which the statistical estimators are uniformly asymptotically consistent. These results provide theoretical grounding for analysing asymptotic convergence and examining reliability of the statistical estimators in a number of regression models.

引用

页数：38

共 50 条

[1] A review on statistical and machine learning competing risks methods
Monterrubio-Gomez, Karla
Constantine-Cooke, Nathan
Vallejos, Catalina A.
BIOMETRICAL JOURNAL, 2024, 66 (02)
[2] Statistical and Machine Learning Approach in Forex Prediction Based on Empirical Data
Sidehabi, Sitti Wetenriajeng
Indrabayu
Tandungan, Sofyan
2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND CYBERNETICS, 2016, : 63 - 68
[3] Empirical Validation of Website Quality Using Statistical and Machine Learning Methods
Dhiman, Poonam
Anjali
2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 286 - 291
[4] Statistical models versus machine learning approach for competing risks in proctological surgery
Romano, Lucia
Manno, Andrea
Rossi, Fabrizio
Masedu, Francesco
Attanasio, Margherita
Vistoli, Fabio
Giuliani, Antonio
UPDATES IN SURGERY, 2025, : 333 - 341
[5] An empirical analysis of homicides in Mexico through Machine Learning and statistical design of experiments
Silva Urrutia, Jose Eliud
Villalobos, Miguel A.
POBLACION Y SALUD EN MESOAMERICA, 2022, 20 (01):
[6] Software Estimation in the Design Stage with Statistical Models and Machine Learning: An Empirical Study
Sanchez-Garcia, Angel J.
Gonzalez-Hernandez, Maria Saarayim
Cortes-Verdin, Karen
Perez-Arriaga, Juan Carlos
MATHEMATICS, 2024, 12 (07)
[7] Empirical Study on Robustness of Machine Learning Approaches for Fault Diagnosis under Railway Operational Conditions
Shi, Dachuan
Ye, Yunguang
Gillwald, Marco
Hecht, Markus
2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
[8] Local-contrastive-learning machine with both generalization and adversarial robustness: A statistical physics analysis
Xie, Mingshan
Wang, Yuchen
Huang, Haiping
SCIENCE CHINA-PHYSICS MECHANICS & ASTRONOMY, 2025, 68 (01)
[9] Local-contrastive-learning machine with both generalization and adversarial robustness: A statistical physics analysis
Mingshan Xie
Yuchen Wang
Haiping Huang
Science China(Physics,Mechanics & Astronomy), 2025, (01) : 109 - 123
[10] Machine Learning Robustness, Fairness, and their Convergence
Lee, Jae-Gil
Roh, Yuji
Song, Hwanjun
Whang, Steven Euijong
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4046 - 4047

← 1 2 3 4 5 →