Higher Order Contractive Auto-Encoder

被引:0
作者
Rifai, Salah [1 ]
Mesnil, Gregoire [1 ]
Vincent, Pascal [1 ]
Muller, Xavier [1 ]
Bengio, Yoshua [1 ]
Dauphin, Yann [1 ]
Glorot, Xavier [1 ]
机构
[1] Univ Montreal, Dept IRO, Montreal, PQ H2C 3J7, Canada
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II | 2011年 / 6912卷
关键词
Unsupervised feature learning; deep learning; manifold;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a novel regularizer when training an auto-encoder for unsupervised feature extraction. We explicitly encourage the latent representation to contract the input space by regularizing the norm of the Jacobian (analytically) and the Hessian (stochastically) of the encoder's output with respect to its input, at the training points. While the penalty on the Jacobian's norm ensures robustness to tiny corruption of samples in the input space, constraining the norm of the Hessian extends this robustness when moving further away from the sample. From a manifold learning perspective, balancing this regularization with the auto-encoder's reconstruction objective yields a representation that varies most when moving along the data manifold in input space, and is most insensitive in directions orthogonal to the manifold. The second order regularization, using the Hessian, penalizes curvature, and thus favors smooth manifold. We show that our proposed technique, while remaining computationally efficient, yields representations that are significantly better suited for initializing deep architectures than previously proposed approaches, beating state-of-the-art performance on a number of datasets.
引用
收藏
页码:645 / 660
页数:16
相关论文
共 25 条
[1]  
[Anonymous], 2008, P 25 INT C MACH LEAR
[2]  
[Anonymous], 2011, INT C MACH LEARN
[3]  
[Anonymous], 2008, P ICML, DOI 10.1145/1390156.1390294
[4]  
[Anonymous], 2006, BOOK REV IEEE T NEUR
[5]  
[Anonymous], P 13 INT C ART INT S
[6]  
[Anonymous], 1977, Solution of illposed problems
[7]  
[Anonymous], 2007, IEEE INT C ICML
[8]  
Bengio Y., 2006, Advances in Neural Information Processing Systems, V19, DOI DOI 10.7551/MITPRESS/7503.003.0024
[9]  
Bengio Y., 2006, NIPS, V18
[10]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127