Kernel Analysis of Deep Networks

被引:0
作者
Montavon, Gregoire [1 ]
Braun, Mikio L. [1 ]
Mueller, Klaus-Robert [1 ,2 ]
机构
[1] Tech Univ Berlin, Machine Learning Grp, D-10587 Berlin, Germany
[2] Univ Calif Los Angeles, Inst Pure & Appl Math, Los Angeles, CA 90095 USA
关键词
deep networks; kernel principal component analysis; representations; RECEPTIVE-FIELDS; ARCHITECTURE;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
When training deep networks it is common knowledge that an efficient and well generalizing representation of the problem is formed. In this paper we aim to elucidate what makes the emerging representation successful. We analyze the layer-wise evolution of the representation in a deep network by building a sequence of deeper and deeper kernels that subsume the mapping performed by more and more layers of the deep network and measuring how these increasingly complex kernels fit the learning problem. We observe that deep networks create increasingly better representations of the learning problem and that the structure of the deep network controls how fast the representation of the task is formed layer after layer.
引用
收藏
页码:2563 / 2581
页数:19
相关论文
共 38 条
[31]   LEARNING REPRESENTATIONS BY BACK-PROPAGATING ERRORS [J].
RUMELHART, DE ;
HINTON, GE ;
WILLIAMS, RJ .
NATURE, 1986, 323 (6088) :533-536
[32]   Input space versus feature space in kernel-based methods [J].
Schölkopf, B ;
Mika, S ;
Burges, CJC ;
Knirsch, P ;
Müller, KR ;
Rätsch, G ;
Smola, AJ .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (05) :1000-1017
[33]   Nonlinear component analysis as a kernel eigenvalue problem [J].
Scholkopf, B ;
Smola, A ;
Muller, KR .
NEURAL COMPUTATION, 1998, 10 (05) :1299-1319
[34]  
Scholkopf B., 2001, Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
[35]  
Serre T, 2005, PROC CVPR IEEE, P994
[36]   Mathematics of the Neural Response [J].
Smale, S. ;
Rosasco, L. ;
Bouvrie, J. ;
Caponnetto, A. ;
Poggio, T. .
FOUNDATIONS OF COMPUTATIONAL MATHEMATICS, 2010, 10 (01) :67-91
[37]   The connection between regularization operators and support vector kernels [J].
Smola, AJ ;
Scholkopf, B ;
Muller, KR .
NEURAL NETWORKS, 1998, 11 (04) :637-649
[38]  
Wibisono Andre, 2010, LEARNING INVARIANCE