MONK - Outlier-Robust Mean Embedding Estimation by Median-of-Means

被引:0
|
作者
Lerasle, Matthieu [1 ,2 ]
Szabo, Zoltan [3 ]
Mathieu, Timothee [1 ]
Lecue, Guillaume [4 ]
机构
[1] Univ Paris Sud, Lab Math Orsay, Paris, France
[2] Univ Paris Saclay, CNRS, Paris, France
[3] Ecole Polytech, CMAP, Palaiseau, France
[4] CREST ENSAE ParisTech, Paris, France
关键词
KERNELS; METRICS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mean embeddings provide an extremely flexible and powerful tool in machine learning and statistics to represent probability distributions and define a semi-metric (MMD, maximum mean discrepancy; also called N-distance or energy distance), with numerous successful applications. The representation is constructed as the expectation of the feature map defined by a kernel. As a mean, its classical empirical estimator, however, can be arbitrary severely affected even by a single outlier in case of unbounded features. To the best of our knowledge, unfortunately even the consistency of the existing few techniques trying to alleviate this serious sensitivity bottleneck is unknown. In this paper, we show how the recently emerged principle of median-of-means can be used to design estimators for kernel mean embedding and MMD with excessive resistance properties to outliers, and optimal sub-Gaussian deviation bounds under mild assumptions.
引用
收藏
页数:12
相关论文
共 41 条
  • [1] Graph Embedding with Outlier-Robust Ratio Estimation
    Satta, Kaito
    Sasaki, Hiroaki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2022, E105D (10) : 1812 - 1816
  • [2] Robust Kernel Density Estimation with Median-of-Means principle
    Humbert, Pierre
    Le Bars, Batiste
    Minvielle, Ludovic
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [3] Outlier-Robust Sparse Mean Estimation for Heavy-Tailed Distributions
    Diakonikolas, Ilias
    Kane, Daniel M.
    Lee, Jasper C. H.
    Pensia, Ankit
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [4] DeepMoM: Robust Deep Learning With Median-of-Means
    Huang, Shih-Ting
    Lederer, Johannes
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (01) : 181 - 195
  • [5] ROBUST MACHINE LEARNING BY MEDIAN-OF-MEANS: THEORY AND PRACTICE
    Lecue, Guillaume
    Lerasle, Matthieu
    ANNALS OF STATISTICS, 2020, 48 (02): : 906 - 931
  • [6] Outlier-Robust State Estimation for Humanoid Robots
    Piperakis, Stylianos
    Kanoulas, Dimitrios
    Tsagarakis, Nikos G.
    Trahanias, Panos
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 706 - 713
  • [7] Robust Clustered Federated Learning with Bootstrap Median-of-Means
    Xie, Ming
    Ma, Jie
    Long, Guodong
    Zhang, Chengqi
    WEB AND BIG DATA, PT I, APWEB-WAIM 2022, 2023, 13421 : 237 - 250
  • [8] Efficient and Robust Median-of-Means Algorithms for Location and Regression
    Kogler, Alexander
    Traxler, Patrick
    PROCEEDINGS OF 2016 18TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC), 2016, : 206 - 213
  • [9] Outlier-robust spectral estimation for spatial lattice processes
    Nirel, R
    Mugglestone, MA
    Barnett, V
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1998, 27 (12) : 3095 - 3111
  • [10] On the Convergence of IRLS and Its Variants in Outlier-Robust Estimation
    Peng, Liangzu
    Kummerle, Christian
    Vidal, Rene
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 17808 - 17818