Conditional Matrix Flows for Gaussian Graphical Models

被引：0

作者：

Negri, Marcello Massimo ^{[1
]}

Torres, Fabricio Arend ^{[1
]}

Roth, Volker ^{[1
]}

机构：

[1] Univ Basel, Basel, Switzerland

来源：

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023) | 2023年

关键词：

VARIABLE SELECTION; SCALE MIXTURES; LASSO;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Studying conditional independence among many variables with few observations is a challenging task. Gaussian Graphical Models (GGMs) tackle this problem by encouraging sparsity in the precision matrix through l(q) regularization with q <= 1. However, most GMMs rely on the l(1) norm because the objective is highly non-convex for sub-l(1) pseudo-norms. In the frequentist formulation, the l(1) norm relaxation provides the solution path as a function of the shrinkage parameter lambda. In the Bayesian formulation, sparsity is instead encouraged through a Laplace prior, but posterior inference for different lambda requires repeated runs of expensive Gibbs samplers. Here we propose a general framework for variational inference with matrix-variate Normalizing Flow in GGMs, which unifies the benefits of frequentist and Bayesian frameworks. As a key improvement on previous work, we train with one flow a continuum of sparse regression models jointly for all regularization parameters lambda and all l(q) norms, including non-convex sub-l(1) pseudo-norms. Within one model we thus have access to (i) the evolution of the posterior for any lambda and any l(q) (pseudo-) norm, (ii) the marginal log-likelihood for model selection, and (iii) the frequentist solution paths through simulated annealing in the MAP limit.

引用

页数：17

共 45 条

[1] Abramson David, 1999, ASIA PACIFIC J OPERA
[2] Alves Larissa, 2021, VARIATIONAL FULL BAY
[3] ANDREWS DF, 1974, J ROY STAT SOC B MET, V36, P99
[4] Simulated annealing for maximum A Posteriori parameter estimation of hidden Markov models
Andrieu, C
Doucet, A
[J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2000, 46 (03) : 994 - 1004
[5] [Anonymous], 1999, Matrix Variate Distributions
[6] Atanov Andrei, 2020, Semiconditional normalizing flows for semi-supervised learning
[7] Bayesian Group-Sparse Modeling and Variational Inference
Babacan, S. Derin
Nakajima, Shinichi
Do, Minh N.
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (11) : 2906 - 2921
[8] Banerjee Onureena, 2007, MODEL SELECTION SPAR
[9] Variational Inference: A Review for Statisticians
Blei, David M.
Kucukelbir, Alp
McAuliffe, Jon D.
[J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) : 859 - 877
[10] Castelo R, 2006, J MACH LEARN RES, V7, P2621

← 1 2 3 4 5 →