Learning Deep Generative Models

被引:206
作者
Salakhutdinov, Ruslan [1 ,2 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 3G4, Canada
[2] Univ Toronto, Dept Stat Sci, Toronto, ON M5S 3G4, Canada
来源
ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, VOL 2 | 2015年 / 2卷
关键词
deep learning; deep belief networks; deep Boltzmann machines; graphical models;
D O I
10.1146/annurev-statistics-010814-020120
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Building intelligent systems that are capable of extracting high-level representations from high-dimensional sensory data lies at the core of solving many artificial intelligence-related tasks, including object recognition, speech perception, and language understanding. Theoretical and biological arguments strongly suggest that building such systems requires models with deep architectures that involve many layers of nonlinear processing. In this article, we review several popular deep learning models, including deep belief networks and deep Boltzmann machines. We show that (a) these deep generative models, which contain many layers of latent variables and millions of parameters, can be learned efficiently, and (b) the learned high-level feature representations can be successfully applied in many application domains, including visual object recognition, information retrieval, classification, and regression tasks.
引用
收藏
页码:361 / 385
页数:25
相关论文
共 54 条
[31]  
LeCun Y, 2004, PROC CVPR IEEE, P97
[32]  
Lee H., 2009, P 26 ANN INT C MACH, DOI DOI 10.1145/1553374.1553453
[33]   The role of the primary visual cortex in higher level vision [J].
Lee, TS ;
Mumford, D ;
Romero, R ;
Lamme, VAF .
VISION RESEARCH, 1998, 38 (15-16) :2429-2454
[34]   Learning to Represent Spatial Transformations with Factored Higher-Order Boltzmann Machines [J].
Memisevic, Roland ;
Hinton, Geoffrey E. .
NEURAL COMPUTATION, 2010, 22 (06) :1473-1492
[35]   Acoustic Modeling Using Deep Belief Networks [J].
Mohamed, Abdel-rahman ;
Dahl, George E. ;
Hinton, Geoffrey .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2012, 20 (01) :14-22
[36]  
Nair V., 2009, Advances in Neural Information Processing Systems 21 (NIPS'08), P1145
[37]   Annealed importance sampling [J].
Neal, RM .
STATISTICS AND COMPUTING, 2001, 11 (02) :125-139
[38]  
Ranzato M, 2007, PROC CVPR IEEE, P1429
[39]   A STOCHASTIC APPROXIMATION METHOD [J].
ROBBINS, H ;
MONRO, S .
ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (03) :400-407
[40]   LEARNING REPRESENTATIONS BY BACK-PROPAGATING ERRORS [J].
RUMELHART, DE ;
HINTON, GE ;
WILLIAMS, RJ .
NATURE, 1986, 323 (6088) :533-536