Task-Agnostic Amortized Inference of Gaussian Process Hyperparameters

被引:0
|
作者
Liu, Sulin [1 ]
Sun, Xingyuan [1 ]
Ramadge, Peter J. [1 ]
Adams, Ryan P. [1 ]
机构
[1] Princeton Univ, Princeton, NJ 08544 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaussian processes (GPs) are flexible priors for modeling functions. However, their success depends on the kernel accurately reflecting the properties of the data. One of the appeals of the GP framework is that the marginal likelihood of the kernel hyperparameters is often available in closed form, enabling optimization and sampling procedures to fit these hyperparameters to data. Unfortunately, point-wise evaluation of the marginal likelihood is expensive due to the need to solve a linear system; searching or sampling the space of hyperparameters thus often dominates the practical cost of using GPs. We introduce an approach to the identification of kernel hyperparameters in GP regression and related problems that sidesteps the need for costly marginal likelihoods. Our strategy is to "amortize" inference over hyperparameters by training a single neural network, which consumes a set of regression data and produces an estimate of the kernel function, useful across different tasks. To accommodate the varying dimension and cardinality of different regression problems, we use a hierarchical self-attention-based neural network that produces estimates of the hyperparameters which are invariant to the order of the input data points and data dimensions. We show that a single neural model trained on synthetic data is able to generalize directly to several different unseen real-world GP use cases. Our experiments demonstrate that the estimated hyperparameters are comparable in quality to those from the conventional model selection procedures, while being much faster to obtain, significantly accelerating GP regression and its related applications such as Bayesian optimization and Bayesian quadrature. The code and pre-trained model are available at https://github.com/PrincetonLIPS/AHGP.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] Latent Plans for Task-Agnostic Offline Reinforcement Learning
    Rosete-Beas, Erick
    Mees, Oier
    Kalweit, Gabriel
    Boedecker, Joschka
    Burgard, Wolfram
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 1838 - 1849
  • [22] TAPE: Task-Agnostic Prior Embedding for Image Restoration
    Liu, Lin
    Xie, Lingxi
    Zhang, Xiaopeng
    Yuan, Shanxin
    Chen, Xiangyu
    Zhou, Wengang
    Li, Houqiang
    Tian, Qi
    COMPUTER VISION - ECCV 2022, PT XVIII, 2022, 13678 : 447 - 464
  • [23] EViLBERT: Learning Task-Agnostic Multimodal Sense Embeddings
    Calabrese, Agostina
    Bevilacqua, Michele
    Navigli, Roberto
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 481 - 487
  • [24] Task-agnostic feature extractors for incremental learning at the edge
    Loomis, Lisa
    Wise, David
    Inkawhich, Nathan
    Thiem, Clare
    McDonald, Nathan
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS VI, 2024, 13051
  • [25] Task-Agnostic Dynamics Priors for Deep Reinforcement Learning
    Du, Yilun
    Narasimhan, Karthik
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [26] TADA: Efficient Task-Agnostic Domain Adaptation for Transformers
    Hung, Chia-Chien
    Lange, Lukas
    Stroetgen, Jannik
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 487 - 503
  • [27] COSMIC: Mutual Information for Task-Agnostic Summarization Evaluation
    Darrin, Maxime
    Formont, Philippe
    CilEuNG, Jackie Chi Kit
    Piantanida, Pablo
    PROCEEDINGS OF THE 62ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1: LONG PAPERS, 2024, : 12696 - 12717
  • [28] Task-Agnostic Privacy-Preserving Representation Learning for Federated Learning against Attribute Inference Attacks
    Arevalo, Caridad Arroyo
    Noorbakhsh, Sayedeh Leila
    Dong, Yun
    Hong, Yuan
    Wang, Binghui
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 10909 - 10917
  • [29] CEM: Constrained Entropy Maximization for Task-Agnostic Safe Exploration
    Yang, Qisong
    Spaan, Matthijs T. J.
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 9, 2023, : 10798 - 10806
  • [30] Task-Agnostic Continual Hippocampus Segmentation for Smooth Population Shifts
    Gonzalez, Camila
    Ranem, Amin
    Othman, Ahmed
    Mukhopadhyay, Anirban
    DOMAIN ADAPTATION AND REPRESENTATION TRANSFER (DART 2022), 2022, 13542 : 108 - 118