Conditional Matrix Flows for Gaussian Graphical Models

被引:0
作者
Negri, Marcello Massimo [1 ]
Torres, Fabricio Arend [1 ]
Roth, Volker [1 ]
机构
[1] Univ Basel, Basel, Switzerland
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年
关键词
VARIABLE SELECTION; SCALE MIXTURES; LASSO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Studying conditional independence among many variables with few observations is a challenging task. Gaussian Graphical Models (GGMs) tackle this problem by encouraging sparsity in the precision matrix through l(q) regularization with q <= 1. However, most GMMs rely on the l(1) norm because the objective is highly non-convex for sub-l(1) pseudo-norms. In the frequentist formulation, the l(1) norm relaxation provides the solution path as a function of the shrinkage parameter lambda. In the Bayesian formulation, sparsity is instead encouraged through a Laplace prior, but posterior inference for different lambda requires repeated runs of expensive Gibbs samplers. Here we propose a general framework for variational inference with matrix-variate Normalizing Flow in GGMs, which unifies the benefits of frequentist and Bayesian frameworks. As a key improvement on previous work, we train with one flow a continuum of sparse regression models jointly for all regularization parameters lambda and all l(q) norms, including non-convex sub-l(1) pseudo-norms. Within one model we thus have access to (i) the evolution of the posterior for any lambda and any l(q) (pseudo-) norm, (ii) the marginal log-likelihood for model selection, and (iii) the frequentist solution paths through simulated annealing in the MAP limit.
引用
收藏
页数:17
相关论文
共 45 条
  • [1] Abramson David, 1999, ASIA PACIFIC J OPERA
  • [2] Alves Larissa, 2021, VARIATIONAL FULL BAY
  • [3] ANDREWS DF, 1974, J ROY STAT SOC B MET, V36, P99
  • [4] Simulated annealing for maximum A Posteriori parameter estimation of hidden Markov models
    Andrieu, C
    Doucet, A
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2000, 46 (03) : 994 - 1004
  • [5] [Anonymous], 1999, Matrix Variate Distributions
  • [6] Atanov Andrei, 2020, Semiconditional normalizing flows for semi-supervised learning
  • [7] Bayesian Group-Sparse Modeling and Variational Inference
    Babacan, S. Derin
    Nakajima, Shinichi
    Do, Minh N.
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (11) : 2906 - 2921
  • [8] Banerjee Onureena, 2007, MODEL SELECTION SPAR
  • [9] Variational Inference: A Review for Statisticians
    Blei, David M.
    Kucukelbir, Alp
    McAuliffe, Jon D.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 859 - 877
  • [10] Castelo R, 2006, J MACH LEARN RES, V7, P2621