Maximum Likelihood Estimation in Gaussian Process Regression is Ill-Posed

被引:0
作者
Karvonen, Toni [1 ]
Oates, Chris J. [2 ]
机构
[1] Univ Helsinki, Dept Math & Stat, PL 56 Pietari Kalmin katu 5, Helsinki 00014, Finland
[2] Newcastle Univ, Sch Math Stat & Phys, Newcastle Upon Tyne NE1 7RU, England
基金
芬兰科学院;
关键词
Gaussian processes; maximum likelihood estimation; ill-posedness; stationary kernels; KERNEL HILBERT-SPACES; ASYMPTOTIC PROPERTIES; CROSS-VALIDATION; HYPER-PARAMETERS; INTERPOLATION; CONVERGENCE; COVARIANCE; PREDICTION; APPROXIMATION; DESIGN;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Gaussian process regression underpins countless academic and industrial applications of machine learning and statistics, with maximum likelihood estimation routinely used to select appropriate parameters for the covariance kernel. However, it remains an open problem to establish the circumstances in which maximum likelihood estimation is well-posed, that is, when the predictions of the regression model are insensitive to small perturbations of the data. This article identifies scenarios where the maximum likelihood estimator fails to be well-posed, in that the predictive distributions are not Lipschitz in the data with respect to the Hellinger distance. These failure cases occur in the noiseless data setting, for any Gaussian process with a stationary covariance function whose lengthscale parameter is estimated using maximum likelihood. Although the failure of maximum likelihood estimation is part of Gaussian process folklore, these rigorous theoretical results appear to be the first of their kind. The implication of these negative results is that well-posedness may need to be assessed post-hoc, on a case-by-case basis, when maximum likelihood estimation is used to train a Gaussian process model.
引用
收藏
页数:47
相关论文
共 83 条
  • [31] Karvonen T, 2019, IEEE INT WORKS MACH
  • [32] Karvonen T, 2023, Arxiv, DOI arXiv:2203.05400
  • [33] Maximum Likelihood Estimation and Uncertainty Quantification for Gaussian Process Approximation of Deterministic Functions
    Karvonen, Toni
    Wynne, George
    Tronarp, Filip
    Oates, Chris
    Sarkka, Simo
    [J]. SIAM-ASA JOURNAL ON UNCERTAINTY QUANTIFICATION, 2020, 8 (03): : 926 - 958
  • [34] The role of the range parameter for estimation and prediction in geostatistics
    Kaufman, C. G.
    Shaby, B. A.
    [J]. BIOMETRIKA, 2013, 100 (02) : 473 - 484
  • [35] Bayesian calibration of computer models
    Kennedy, MC
    O'Hagan, A
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2001, 63 : 425 - 450
  • [36] A CORRESPONDENCE BETWEEN BAYESIAN ESTIMATION ON STOCHASTIC PROCESSES AND SMOOTHING BY SPLINES
    KIMELDOR.GS
    WAHBA, G
    [J]. ANNALS OF MATHEMATICAL STATISTICS, 1970, 41 (02): : 495 - &
  • [37] Kowalska K., 2012, 2012 15th International Conference on Information Fusion (FUSION 2012), P1164
  • [38] Design and analysis of computer experiments
    Kuhnt, Sonja
    Steinberg, David M.
    [J]. ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2010, 94 (04) : 307 - 309
  • [39] On the Well-posedness of Bayesian Inverse Problems
    Latz, Jonas
    [J]. SIAM-ASA JOURNAL ON UNCERTAINTY QUANTIFICATION, 2020, 8 (01): : 451 - 482
  • [40] Convergence of increasingly flat radial basis interpolants to polynomial interpolants
    Lee, Yeon Ju
    Yoon, Gang Joon
    Yoon, Jungho
    [J]. SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 2007, 39 (02) : 537 - 553