Optimal Architectures in a Solvable Model of Deep Networks

被引:0
作者
Kadmon, Jonathan [1 ,2 ]
Sompolinsky, Haim [1 ,2 ,3 ]
机构
[1] Hebrew Univ Jerusalem, Racah Inst Phys, Jerusalem, Israel
[2] Hebrew Univ Jerusalem, ELSC, Jerusalem, Israel
[3] Harvard Univ, Ctr Brain Sci, Cambridge, MA 02138 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 29 (NIPS 2016) | 2016年 / 29卷
关键词
REPRESENTATIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks have received a considerable attention due to the success of their training for real world machine learning applications. They are also of great interest to the understanding of sensory processing in cortical sensory hierarchies. The purpose of this work is to advance our theoretical understanding of the computational benefits of these architectures. Using a simple model of clustered noisy inputs and a simple learning rule, we provide analytically derived recursion relations describing the propagation of the signals along the deep network. By analysis of these equations, and defining performance measures, we show that these model networks have optimal depths. We further explore the dependence of the optimal architecture on the system parameters.
引用
收藏
页数:9
相关论文
共 14 条
[1]  
[Anonymous], 1986, P 1986 PARALLEL DIST
[2]  
[Anonymous], 2008, Advances in neural information processing systems
[3]  
[Anonymous], 2013, EXACT SOLUTIONS NONL
[4]   Sparseness and Expansion in Sensory Representations [J].
Babadi, Baktash ;
Sompolinsky, Haim .
NEURON, 2014, 83 (05) :1213-1226
[5]   NEURAL NETWORKS AND PRINCIPAL COMPONENT ANALYSIS - LEARNING FROM EXAMPLES WITHOUT LOCAL MINIMA [J].
BALDI, P ;
HORNIK, K .
NEURAL NETWORKS, 1989, 2 (01) :53-58
[6]  
Bhand Maneesh., 2011, Advances in Neural Information Processing Sys- tems, P1971
[7]   Large-Margin Classification in Infinite Neural Networks [J].
Cho, Youngmin ;
Saul, Lawrence K. .
NEURAL COMPUTATION, 2010, 22 (10) :2678-2697
[8]  
Cohen William W, 2008, EXTRACTING COMPOSING
[9]   LAYERED NEURAL NETWORKS [J].
DOMANY, E ;
KINZEL, W ;
MEIR, R .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1989, 22 (12) :2081-2102
[10]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507