Build, Compute, Critique, Repeat: Data Analysis with Latent Variable Models

被引:97
作者
Blei, David M. [1 ]
机构
[1] Princeton Univ, Dept Comp Sci, Princeton, NJ 08540 USA
来源
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 1 | 2014年 / 1卷
基金
美国国家科学基金会;
关键词
latent variable models; graphical models; variational inference; predictive sample reuse; posterior predictive checks; HIDDEN MARKOV-MODELS; STATISTICAL-ANALYSIS; BAYES INFERENCE; ALGORITHMS; FUTURE; FIT;
D O I
10.1146/annurev-statistics-022513-115657
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We survey latent variable models for solving data-analysis problems. A latent variable model is a probabilistic model that encodes hidden patterns in the data. We uncover these patterns from their conditional distribution and use them to summarize data and form predictions. Latent variable models are important in many fields, including computational biology, natural language processing, and social network analysis. Our perspective is that models are developed iteratively: We build a model, use it to analyze data, assess how it succeeds and fails, revise it, and repeat. We describe how new research has transformed these essential activities. First, we describe probabilistic graphical models, a language for formulating latent variable models. Second, we describe mean field variational inference, a generic algorithm for approximating conditional distributions. Third, we describe how to use our analyses to solve problems: exploring the data, forming predictions, and pointing us in the direction of improved models.
引用
收藏
页码:203 / 232
页数:30
相关论文
共 117 条
  • [1] Airoldi EM, 2008, J MACH LEARN RES, V9, P1981
  • [2] [Anonymous], 1999, Learning in Graphical Models
  • [3] [Anonymous], 2010, Bayesian Nonparametrics
  • [4] [Anonymous], 1999, Learning in Graphical Models
  • [5] [Anonymous], 2006, Pattern recognition and machine learning
  • [6] [Anonymous], 2013, Stan: A C++ Library for Probability and Sampling
  • [7] MIXTURES OF DIRICHLET PROCESSES WITH APPLICATIONS TO BAYESIAN NONPARAMETRIC PROBLEMS
    ANTONIAK, CE
    [J]. ANNALS OF STATISTICS, 1974, 2 (06) : 1152 - 1174
  • [8] Asuncion A., 2009, C UNC ART INT UAI QU, P27, DOI DOI 10.1080/10807030390248483
  • [9] Attias H, 1999, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, P21
  • [10] Attias H, 2000, ADV NEUR IN, V12, P209