Scalable Gaussian Process Regression Networks

被引:0
作者
Li, Shibo [1 ]
Xing, Wei [2 ]
Kirby, Robert M. [1 ,2 ]
Zhe, Shandian [1 ]
机构
[1] Univ Utah, Sch Comp, Salt Lake City, UT 84112 USA
[2] Univ Utah, Sci Comp & Imaging Inst, Salt Lake City, UT USA
来源
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE | 2020年
关键词
MODEL;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaussian process regression networks (GPRN) are powerful Bayesian models for multi-output regression, but their inference is intractable. To address this issue, existing methods use a fully factorized structure (or a mixture of such structures) over all the outputs and latent functions for posterior approximation, which, however, can miss the strong posterior dependencies among the latent variables and hurt the inference quality. In addition, the updates of the variational parameters are inefficient and can be prohibitively expensive for a large number of outputs. To overcome these limitations, we propose a scalable variational inference algorithm for GPRN, which not only captures the abundant posterior dependencies but also is much more efficient for massive outputs. We tensorize the output space and introduce tensor/matrix-normal variational posteriors to capture the posterior correlations and to reduce the parameters. We jointly optimize all the parameters and exploit the inherent Kronecker product structure in the variational model evidence lower bound to accelerate the computation. We demonstrate the advantages of our method in several real-world applications.
引用
收藏
页码:2456 / 2462
页数:7
相关论文
共 25 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
Alvarez M., 2009, Advances in neural information processing systems, P57
[3]  
Alvarez MA, 2019, PR MACH LEARN RES, V89
[4]   Kernels for Vector-Valued Functions: A Review [J].
Alvarez, Mauricio A. ;
Rosasco, Lorenzo ;
Lawrence, Neil D. .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 4 (03) :195-266
[5]   Efficient topology optimization in MATLAB using 88 lines of code [J].
Andreassen, Erik ;
Clausen, Anders ;
Schevenels, Mattias ;
Lazarov, Boyan S. ;
Sigmund, Ole .
STRUCTURAL AND MULTIDISCIPLINARY OPTIMIZATION, 2011, 43 (01) :1-16
[6]  
[Anonymous], 2013, Advances in neural information processing systems
[7]  
[Anonymous], 2010, P 13 INT C ARTIFICIA
[8]  
[Anonymous], 2011, P 25 INT C NEURAL IN
[9]  
Balasubramanian M, 2002, SCIENCE, V295
[10]  
Bonilla Edwin V, 2007, Advances in Neural Information Processing Systems, V20