Learning a good representation with unsymmetrical auto-encoder

被引:15
作者
Sun, Yanan [1 ]
Mao, Hua [1 ]
Guo, Quan [1 ]
Yi, Zhang [1 ]
机构
[1] Sichuan Univ, Coll Comp Sci, Machine Intelligence Lab, Chengdu 610065, Peoples R China
基金
美国国家科学基金会;
关键词
Auto-encoder; Neural networks; Feature learning; Deep learning; Unsupervised learning;
D O I
10.1007/s00521-015-1939-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Auto-encoders play a fundamental role in unsupervised feature learning and learning initial parameters of deep architectures for supervised tasks. For given input samples, robust features are used to generate robust representations from two perspectives: (1) invariant to small variation of samples and (2) reconstruction by decoders with minimal error. Traditional auto-encoders with different regularization terms have symmetrical numbers of encoder and decoder layers, and sometimes parameters. We investigate the relation between the number of layers and propose an unsymmetrical structure, i.e., an unsymmetrical auto-encoder (UAE), to learn more effective features. We present empirical results of feature learning using the UAE and state-of-the-art auto-encoders for classification tasks with a range of datasets. We also analyze the gradient vanishing problem mathematically and provide suggestions for the appropriate number of layers to use in UAEs with a logistic activation function. In our experiments, UAEs demonstrated superior performance with the same configuration compared to other autoencoders.
引用
收藏
页码:1361 / 1367
页数:7
相关论文
共 26 条
[1]  
[Anonymous], 2012, UNSUPERVISED TRANSF
[2]  
[Anonymous], 2008, Advances in neural information processing systems
[3]  
[Anonymous], 2009, TECHNICAL REPORT
[4]  
[Anonymous], 2006, NIPS
[5]  
[Anonymous], 1995, Advances in Neural Information Processing Systems
[6]  
[Anonymous], 2009, ICML
[7]  
[Anonymous], 2009, NIPS
[8]   NEURAL NETWORKS AND PRINCIPAL COMPONENT ANALYSIS - LEARNING FROM EXAMPLES WITHOUT LOCAL MINIMA [J].
BALDI, P ;
HORNIK, K .
NEURAL NETWORKS, 1989, 2 (01) :53-58
[9]   Contrastive Learning and Neural Oscillations [J].
Baldi, Pierre ;
Pineda, Fernando .
NEURAL COMPUTATION, 1991, 3 (04) :526-545
[10]   What Size Net Gives Valid Generalization? [J].
Baum, Eric B. ;
Haussler, David .
NEURAL COMPUTATION, 1989, 1 (01) :151-160