Stacked Robust Autoencoder for Classification

被引:4
作者
Mehta, Janki [1 ]
Gupta, Kavya [1 ]
Gogna, Anupriya [1 ]
Majumdar, Angshul [1 ]
Anand, Saket [1 ]
机构
[1] Indraprastha Inst Informat Technol, Delhi, India
来源
NEURAL INFORMATION PROCESSING, ICONIP 2016, PT III | 2016年 / 9949卷
关键词
Autoencoder; Deep learning; Classification; Robust estimation; ALGORITHM; REGRESSION;
D O I
10.1007/978-3-319-46675-0_66
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work we propose an l(p)-norm data fidelity constraint for training the autoencoder. Usually the Euclidean distance is used for this purpose; we generalize the l(2)-norm to the l(p)-norm; smaller values of p make the problem robust to outliers. The ensuing optimization problem is solved using the Augmented Lagrangian approach. The proposed l(p)-norm Autoencoder has been tested on benchmark deep learning datasets - MNIST, CIFAR-10 and SVHN. We have seen that the proposed robust autoencoder yields better results than the standard autoencoder (l(2)-norm) and deep belief network for all of these problems.
引用
收藏
页码:600 / 607
页数:8
相关论文
共 17 条
[1]  
[Anonymous], 2011, ICML
[2]  
[Anonymous], 2009, ICML
[3]   NEURAL NETWORKS AND PRINCIPAL COMPONENT ANALYSIS - LEARNING FROM EXAMPLES WITHOUT LOCAL MINIMA [J].
BALDI, P ;
HORNIK, K .
NEURAL NETWORKS, 1989, 2 (01) :53-58
[4]   IMPROVED ALGORITHM FOR DISCRETE L1 LINEAR-APPROXIMATION [J].
BARRODALE, I ;
ROBERTS, FDK .
SIAM JOURNAL ON NUMERICAL ANALYSIS, 1973, 10 (05) :839-848
[5]   ALTERNATIVES TO LEAST-SQUARES [J].
BRANHAM, RL .
ASTRONOMICAL JOURNAL, 1982, 87 (06) :928-937
[6]   Nonconvex Splitting for Regularized Low-Rank plus Sparse Decomposition [J].
Chartrand, Rick .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2012, 60 (11) :5810-5819
[7]   ROBUST ESTIMATION OF LOCATION PARAMETER [J].
HUBER, PJ .
ANNALS OF MATHEMATICAL STATISTICS, 1964, 35 (01) :73-&
[8]   A maximum likelihood approach to least absolute deviation regression [J].
Li, YB ;
Arce, GR .
EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (12) :1762-1769
[9]   On the choice of Compressed Sensing priors and sparsifying transforms for MR image reconstruction: An experimental study [J].
Majumdar, Angshul ;
Ward, Rabab K. .
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2012, 27 (09) :1035-1048
[10]   Stacked Convolutional Auto-Encoders for Hierarchical Feature Extraction [J].
Masci, Jonathan ;
Meier, Ueli ;
Ciresan, Dan ;
Schmidhuber, Juergen .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT I, 2011, 6791 :52-59