Fitting Matern smoothness parameters using automatic differentiation

被引:6
作者
Geoga, Christopher J. [1 ,2 ]
Marin, Oana [2 ]
Schanen, Michel [2 ]
Stein, Michael L. [1 ]
机构
[1] Rutgers State Univ, Dept Stat, New Brunswick, NJ 08901 USA
[2] Argonne Natl Lab, Math & Comp Sci Div, Lemont, IL 60439 USA
关键词
Matern covariance; Bessel functions; Gaussian processes; Maximum likelihood; Automatic differentiation; BESSEL-FUNCTIONS; DERIVATIVES; ARGUMENT; RESPECT;
D O I
10.1007/s11222-022-10127-w
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Matern covariance function is ubiquitous in the application of Gaussian processes to spatial statistics and beyond. Perhaps the most important reason for this is that the smoothness parameter nu gives complete control over the mean-square differentiability of the process, which has significant implications for the behavior of estimated quantities such as interpolants and forecasts. Unfortunately, derivatives of the Matern covariance function with respect to nu require derivatives of the modified second-kind Bessel function K-nu with respect to nu. While closed form expressions of these derivatives do exist, they are prohibitively difficult and expensive to compute. For this reason, many software packages require fixing nu as opposed to estimating it, and all existing software packages that attempt to offer the functionality of estimating nu use finite difference estimates for partial derivative K-nu(nu). In this work, we introduce a new implementation of K-nu that has been designed to provide derivatives via automatic differentiation (AD), and whose resulting derivatives are significantly faster and more accurate than those computed using finite differences. We provide comprehensive testing for both speed and accuracy and show that our AD solution can be used to build accurate Hessian matrices for second-order maximum likelihood estimation in settings where Hessians built with finite difference approximations completely fail.
引用
收藏
页数:16
相关论文
共 36 条
[1]  
Abramowitz M., 1966, Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables, V5th
[2]   A PORTABLE PACKAGE FOR BESSEL-FUNCTIONS OF A COMPLEX ARGUMENT AND NONNEGATIVE ORDER [J].
AMOS, DE .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1986, 12 (03) :265-273
[3]   THE IF-PROBLEM IN AUTOMATIC DIFFERENTIATION [J].
BECK, T ;
FISCHER, H .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 1994, 50 (1-3) :119-131
[4]   AUTOMATIC DIFFERENTIATION OF ITERATIVE PROCESSES [J].
BECK, T .
JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 1994, 50 (1-3) :109-118
[5]   Julia: A Fresh Approach to Numerical Computing [J].
Bezanson, Jeff ;
Edelman, Alan ;
Karpinski, Stefan ;
Shah, Viral B. .
SIAM REVIEW, 2017, 59 (01) :65-98
[6]   On the derivatives of the Bessel and Struve functions with respect to the order [J].
Brychkov, YA ;
Geddes, KO .
INTEGRAL TRANSFORMS AND SPECIAL FUNCTIONS, 2005, 16 (03) :187-198
[7]  
EFRON B, 1978, BIOMETRIKA, V65, P457, DOI 10.2307/2335893
[8]  
Fike J.A., 2012, Recent Advances in Algorithmic Differentiation, P163, DOI DOI 10.1007/978-3-642-30023-3_15
[9]  
Gardner J., Advances in Neural Information Processing Systems, V31
[10]   Scalable Gaussian Process Computations Using Hierarchical Matrices [J].
Geoga, Christopher J. ;
Anitescu, Mihai ;
Stein, Michael L. .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2020, 29 (02) :227-237