Beyond Prediction: A Framework for Inference With Variational Approximations in Mixture Models

被引:14
作者
Westling, T. [1 ]
McCormick, T. H. [2 ,3 ]
机构
[1] Univ Penn, Ctr Causal Inference, Blockley Hall 509D,423 Guardian Dr, Philadelphia, PA 19104 USA
[2] Univ Washington, Dept Stat, Seattle, WA 98195 USA
[3] Univ Washington, Dept Sociol, Seattle, WA 98195 USA
基金
美国国家科学基金会;
关键词
Generalized linear mixed models; Profile M-estimation; ASYMPTOTIC NORMALITY; POPULATION-STRUCTURE; LIKELIHOOD; ALGORITHM;
D O I
10.1080/10618600.2019.1609977
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Variational inference is a popular method for estimating model parameters and conditional distributions in hierarchical and mixed models, which arise frequently in many settings in the health, social, and biological sciences. Variational inference in a frequentist context works by approximating intractable conditional distributions with a tractable family and optimizing the resulting lower bound on the log-likelihood. The variational objective function is typically less computationally intensive to optimize than the true likelihood, enabling scientists to fit rich models even with extremely large datasets. Despite widespread use, little is known about the general theoretical properties of estimators arising from variational approximations to the log-likelihood, which hinders their use in inferential statistics. In this article, we connect such estimators to profile M-estimation, which enables us to provide regularity conditions for consistency and asymptotic normality of variational estimators. Our theory also motivates three methodological improvements to variational inference: estimation of the asymptotic model-robust covariance matrix, a one-step correction that improves estimator efficiency, and an empirical assessment of consistency. We evaluate the proposed results using simulation studies and data on marijuana use from the National Longitudinal Study of Youth. for this article are available online.
引用
收藏
页码:778 / 789
页数:12
相关论文
共 32 条
[1]  
Airoldi EM, 2008, J MACH LEARN RES, V9, P1981
[2]   Fast and Elegant Numerical Linear Algebra Using the RcppEigen Package [J].
Bates, Douglas ;
Eddelbuettel, Dirk .
JOURNAL OF STATISTICAL SOFTWARE, 2013, 52 (05) :1-24
[3]   ASYMPTOTIC NORMALITY OF MAXIMUM LIKELIHOOD AND ITS VARIATIONAL APPROXIMATION FOR STOCHASTIC BLOCKMODELS [J].
Bickel, Peter ;
Choi, David ;
Chang, Xiangyu ;
Zhang, Hai .
ANNALS OF STATISTICS, 2013, 41 (04) :1922-1943
[4]   Variational Inference: A Review for Statisticians [J].
Blei, David M. ;
Kucukelbir, Alp ;
McAuliffe, Jon D. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) :859-877
[5]   Latent Dirichlet allocation [J].
Blei, DM ;
Ng, AY ;
Jordan, MI .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (4-5) :993-1022
[6]  
Boyd Stephen, 2004, Convex Optimization, DOI 10.1017/CBO9780511804441
[7]  
Bureau of Labor Statistics U. S. Department of Labor, 2013, NAT LONG SURV YOUTH
[8]   A LIMITED MEMORY ALGORITHM FOR BOUND CONSTRAINED OPTIMIZATION [J].
BYRD, RH ;
LU, PH ;
NOCEDAL, J ;
ZHU, CY .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1995, 16 (05) :1190-1208
[9]  
Davidian Marie., 1995, NONLINEAR MODELS REP, V62
[10]   Mixed-membership models of scientific publications [J].
Erosheva, E ;
Fienberg, S ;
Lafferty, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 :5220-5227