Building feature space of extreme learning machine with sparse denoising stacked-autoencoder

被引:48
作者
Cao, Le-le [1 ]
Huang, Wen-bing [1 ]
Sun, Fu-chun [1 ]
机构
[1] Tsinghua Univ, Dept Comp Sci & Technol, Tsinghua Natl Lab Informat Sci & Technol TNList, Beijing 100084, Peoples R China
关键词
Extreme learning machine (ELM); Ridge regression; Feature space; Stacked autoencoder (SAE); Classification; Regression; FACE RECOGNITION; BELIEF NETWORKS; DEEP; REGRESSION;
D O I
10.1016/j.neucom.2015.02.096
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The random-hidden-node extreme learning machine (ELM) is a much more generalized cluster of single-hidden-layer feed-forward neural networks (SLFNs) which has three parts: random projection, nonlinear transformation, and ridge regression (RR) model. Networks with deep architectures have demonstrated state-of-the-art performance in a variety of settings, especially with computer vision tasks. Deep learning algorithms such as stacked autoencoder (SAE) and deep belief network (DEN) are built on learning several levels of representation of the input. Beyond simply learning features by stacking autoencoders (AE), there is a need for increasing its robustness to noise and reinforcing the sparsity of weights to make it easier to discover interesting and prominent features. The sparse AE and denoising AE was hence developed for this purpose. This paper proposes an approach: SSDAE-RR (stacked sparse denoising autoencoder - ridge regression) that effectively integrates the advantages in SAE, sparse AE, denoising AE, and the RR implementation in ELM algorithm. We conducted experimental study on real-world classification (binary and multiclass) and regression problems with different scales among several relevant approaches: SSDAE-RR, ELM, DBN, neural network (NN), and SAE. The performance analysis shows that the SSDAE-RR tends to achieve a better generalization ability on relatively large datasets (large sample size and high dimension) that were not pre-processed for feature abstraction. For 16 out of 18 tested datasets, the performance of SSDAE-RR is more stable than other tested approaches. We also note that the sparsity regularization and denoising mechanism seem to be mandatory for constructing interpretable feature representations. The fact that a SSDAE-RR approach often has a comparable training time to ELM makes it useful in some real applications. (C) 2015 Elsevier B.V. All rights reserved.
引用
收藏
页码:60 / 71
页数:12
相关论文
共 58 条
  • [31] Enhanced random search based incremental extreme learning machine
    Huang, Guang-Bin
    Chen, Lei
    [J]. NEUROCOMPUTING, 2008, 71 (16-18) : 3460 - 3468
  • [32] Convex incremental extreme learning machine
    Huang, Guang-Bin
    Chen, Lei
    [J]. NEUROCOMPUTING, 2007, 70 (16-18) : 3056 - 3062
  • [33] Extreme learning machine: Theory and applications
    Huang, Guang-Bin
    Zhu, Qin-Yu
    Siew, Chee-Kheong
    [J]. NEUROCOMPUTING, 2006, 70 (1-3) : 489 - 501
  • [34] Universal approximation using incremental constructive feedforward networks with random hidden nodes
    Huang, Guang-Bin
    Chen, Lei
    Siew, Chee-Kheong
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2006, 17 (04): : 879 - 892
  • [35] An Insight into Extreme Learning Machines: Random Neurons, Random Features and Kernels
    Huang, Guang-Bin
    [J]. COGNITIVE COMPUTATION, 2014, 6 (03) : 376 - 390
  • [36] Extreme Learning Machine for Regression and Multiclass Classification
    Huang, Guang-Bin
    Zhou, Hongming
    Ding, Xiaojian
    Zhang, Rui
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2012, 42 (02): : 513 - 529
  • [37] Extreme learning machines: a survey
    Huang, Guang-Bin
    Wang, Dian Hui
    Lan, Yuan
    [J]. INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2011, 2 (02) : 107 - 122
  • [38] Optimization method based extreme learning machine for classification
    Huang, Guang-Bin
    Ding, Xiaojian
    Zhou, Hongming
    [J]. NEUROCOMPUTING, 2010, 74 (1-3) : 155 - 163
  • [39] Kasun LLC, 2013, IEEE INTELL SYST, V28, P31
  • [40] Lan Y., 2010, P EUR S ART NEUR NET, P327