Improving Deep Neural Networks with Multilayer Maxout Networks

被引:0
作者
Sun, Weichen [1 ]
Su, Fei [1 ,2 ]
Wang, Leiquan [1 ,3 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing, Peoples R China
[2] Beijing Univ Posts & Telecommun, Beijing Key Lab Network Syst & Network Culture, Beijing, Peoples R China
[3] China Univ Petr Huadong, Sch Comp & Commun Engn, Qingdao, Peoples R China
来源
2014 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING CONFERENCE | 2014年
关键词
Deep learning; Convolutional neural network; Maxout; Representation learning; Image classification;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For the purpose of enhancing discriminability of convolutional neural networks (CNNs) and facilitating optimization, a multilayer structured variant of the maxout unit (named Multilayer Maxout Network, MMN) is proposed in this paper. CNNs with maxout units employ linear convolution filters followed by maxout units to abstract representations from less abstract ones. Our model instead applies MMNs as activation functions of CNNs to abstract representations, which inherits advantages of both maxout units and deep neural networks, and is a more general nonlinear function approximator as well. Experimental results show that our proposed model yields better performance on three image classification benchmark datasets (CIFAR-10, CIFAR-100 and MNIST) than some state-of-the-art methods. Furthermore, the influence of MMN in different hidden layers is analyzed, and a trade-off scheme between the accuracy and computing resources is given.
引用
收藏
页码:334 / 337
页数:4
相关论文
共 17 条
  • [1] [Anonymous], 2013, ARXIV13013516
  • [2] [Anonymous], 2014, ARXIV13124400
  • [3] [Anonymous], 2013, PMLR, DOI DOI 10.5555/3042817.3043055
  • [4] Chauvin Y., 1995, Backpropagation: Theory, architectures, and applications
  • [5] A regularization term to avoid the saturation of the sigmoids in multilayer neural networks
    Garrido, L
    Gomez, S
    Gaitan, V
    SerraRicart, M
    [J]. INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 1996, 7 (03) : 257 - 262
  • [6] Goodfellow I. J., 2013, ICML, P2356
  • [7] Hinton G.E., 2012, ARXIV, DOI DOI 10.9774/GLEAF.978-1-909493-38-4_2
  • [8] Karlik B., 2011, INT J ARTIF INTELL, V1, P111, DOI DOI 10.1088/1742-6596/1237/2/022030
  • [9] Krizhevsky A., 2009, Learning Multiple Layers of Features from Tiny Images, DOI DOI 10.1145/3079856.3080246
  • [10] ImageNet Classification with Deep Convolutional Neural Networks
    Krizhevsky, Alex
    Sutskever, Ilya
    Hinton, Geoffrey E.
    [J]. COMMUNICATIONS OF THE ACM, 2017, 60 (06) : 84 - 90