Exact Gaussian Process Regression with Distributed Computations

被引:20
作者
Duc-Trung Nguyen [1 ]
Filippone, Maurizio [1 ]
Michiardi, Pietro [1 ]
机构
[1] EURECOM, Campus Sophia Tech, Biot, France
来源
SAC '19: PROCEEDINGS OF THE 34TH ACM/SIGAPP SYMPOSIUM ON APPLIED COMPUTING | 2019年
关键词
Regression; Matrix Factorization; Distributed computing; PARALLEL CHOLESKY FACTORIZATION; ALGORITHMS; SET;
D O I
10.1145/3297280.3297409
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Gaussian Processes (GPs) are powerful non-parametric Bayesian models for function estimation, but suffer from high complexity in terms of both computation and storage. To address such issues, approximation methods have flourished in the literature, including model approximations and approximate inference. However, these methods often sacrifice accuracy for scalability. In this work, we present the design and evaluation of a distributed method for exact GP inference, that achieves true model parallelism using simple, high-level distributed computing frameworks. Our experiments show that exact inference at scale is not only feasible, but it also brings substantial benefits in terms of low error rates and accurate quantification of uncertainty.
引用
收藏
页码:1286 / 1295
页数:10
相关论文
共 55 条
[1]  
ADELSONVELSKII GM, 1962, DOKL AKAD NAUK SSSR+, V146, P263
[2]  
Angerson E., 1990, Proceedings of Supercomputing '90 (Cat. No.90CH2916-5), P2, DOI 10.1109/SUPERC.1990.129995
[3]  
[Anonymous], 1999, LAPACK users' guide third
[4]  
[Anonymous], 2017, J MACHINE LEARNING R
[5]  
[Anonymous], 2017, J MACH LEARN RES
[6]  
[Anonymous], 1997, Scalapack Users Guide
[7]   Near optimal Cholesky factorization on orthogonal multiprocessors [J].
Bansal, SS ;
Vishal, B ;
Gupta, P .
INFORMATION PROCESSING LETTERS, 2002, 84 (01) :23-30
[8]  
Choi Jaeyoung., 1996, SCI PROGRAMMING-NETH, V5, P173
[9]  
Choudhury A, 2002, SIAM PROC S, P95
[10]  
Cutajar K., 2017, PROCEEDINGS OF THE 3, P884