Statistical Robustness of Empirical Risks in Machine Learning

被引:0
|
作者
Guo, Shaoyan [1 ]
Xu, Huifu [2 ]
Zhang, Liwei [1 ]
机构
[1] Dalian Univ Technol, Sch Math Sci, Dalian 116024, Peoples R China
[2] Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Shatin, Hong Kong, Peoples R China
基金
国家重点研发计划;
关键词
Empirical risks; stability analysis; asymptotic qualitative statistical robust-ness; non-asymptotic quantitative statistical robustness; uniform consistency; UNIFORM-CONVERGENCE; OPTIMAL RATES; REGULARIZATION; LEARNABILITY; STABILITY; SELECTION; WEAK;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies convergence of empirical risks in reproducing kernel Hilbert spaces (RKHS). A conventional assumption in the existing research is that empirical training data are generated by the unknown true probability distribution but this may not be satisfied in some practical circumstances. Consequently the existing convergence results may not provide a guarantee as to whether the empirical risks are reliable or not when the data are potentially corrupted (generated by a distribution perturbed from the true). In this paper, we fill out the gap from robust statistics perspective (Kra center dot tschmer, Schied and Za center dot hle (2012); Kra center dot tschmer, Schied and Za center dot hle (2014); Guo and Xu (2020)). First, we derive moderate sufficient conditions under which the expected risk changes stably (continuously) against small perturbation of the probability distributions of the underlying random variables and demonstrate how the cost function and kernel affect the stability. Second, we examine the difference between laws of the statistical estimators of the expected optimal loss based on pure data and contaminated data using Prokhorov metric and Kantorovich metric, and derive some asymptotic qualitative and non-asymptotic quantitative statistical robustness results. Third, we identify appropriate metrics under which the statistical estimators are uniformly asymptotically consistent. These results provide theoretical grounding for analysing asymptotic convergence and examining reliability of the statistical estimators in a number of regression models.
引用
收藏
页数:38
相关论文
共 50 条
  • [1] A review on statistical and machine learning competing risks methods
    Monterrubio-Gomez, Karla
    Constantine-Cooke, Nathan
    Vallejos, Catalina A.
    BIOMETRICAL JOURNAL, 2024, 66 (02)
  • [2] Statistical and Machine Learning Approach in Forex Prediction Based on Empirical Data
    Sidehabi, Sitti Wetenriajeng
    Indrabayu
    Tandungan, Sofyan
    2016 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND CYBERNETICS, 2016, : 63 - 68
  • [3] Empirical Validation of Website Quality Using Statistical and Machine Learning Methods
    Dhiman, Poonam
    Anjali
    2014 5TH INTERNATIONAL CONFERENCE CONFLUENCE THE NEXT GENERATION INFORMATION TECHNOLOGY SUMMIT (CONFLUENCE), 2014, : 286 - 291
  • [4] Statistical models versus machine learning approach for competing risks in proctological surgery
    Romano, Lucia
    Manno, Andrea
    Rossi, Fabrizio
    Masedu, Francesco
    Attanasio, Margherita
    Vistoli, Fabio
    Giuliani, Antonio
    UPDATES IN SURGERY, 2025, : 333 - 341
  • [5] An empirical analysis of homicides in Mexico through Machine Learning and statistical design of experiments
    Silva Urrutia, Jose Eliud
    Villalobos, Miguel A.
    POBLACION Y SALUD EN MESOAMERICA, 2022, 20 (01):
  • [6] Software Estimation in the Design Stage with Statistical Models and Machine Learning: An Empirical Study
    Sanchez-Garcia, Angel J.
    Gonzalez-Hernandez, Maria Saarayim
    Cortes-Verdin, Karen
    Perez-Arriaga, Juan Carlos
    MATHEMATICS, 2024, 12 (07)
  • [7] Empirical Study on Robustness of Machine Learning Approaches for Fault Diagnosis under Railway Operational Conditions
    Shi, Dachuan
    Ye, Yunguang
    Gillwald, Marco
    Hecht, Markus
    2020 IEEE 23RD INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC), 2020,
  • [8] Local-contrastive-learning machine with both generalization and adversarial robustness: A statistical physics analysis
    Xie, Mingshan
    Wang, Yuchen
    Huang, Haiping
    SCIENCE CHINA-PHYSICS MECHANICS & ASTRONOMY, 2025, 68 (01)
  • [9] Local-contrastive-learning machine with both generalization and adversarial robustness: A statistical physics analysis
    Mingshan Xie
    Yuchen Wang
    Haiping Huang
    Science China(Physics,Mechanics & Astronomy), 2025, (01) : 109 - 123
  • [10] Machine Learning Robustness, Fairness, and their Convergence
    Lee, Jae-Gil
    Roh, Yuji
    Song, Hwanjun
    Whang, Steven Euijong
    KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2021, : 4046 - 4047