Global-connected network with generalized ReLU activation

被引:26
作者
Chen, Zhi [1 ]
Ho, Pin-Han [1 ]
机构
[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada
关键词
CNN; Computer vision; Deep learning; Activation; NEURAL-NETWORKS; DROPOUT;
D O I
10.1016/j.patcog.2019.07.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent Progress has shown that exploitation of hidden layer neurons in convolutional neural networks (CNN) incorporating with a carefully designed activation function can yield better classification results in the field of computer vision. The paper firstly introduces a novel deep learning (DL) architecture aiming to mitigate the gradient-vanishing problem, in which the earlier hidden layer neurons could be directly connected with the last hidden layer and fed into the softmax layer for classification. We then design a generalized linear rectifier function as the activation function that can approximate arbitrary complex functions via training of the parameters. We will show that our design can achieve similar performance in a number of object recognition and video action benchmark tasks, such as MNIST, CIFAR-10/100, SVHN, Fashion-MNIST, STL-10, and UCF YoutTube Action Video datasets, under significantly less number of parameters and shallower network infrastructure, which is not only promising in training in terms of computation burden and memory usage, but is also applicable to low-computation, low-memory mobile scenarios for inference. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页数:10
相关论文
共 38 条
  • [1] [Anonymous], 18 INT C ART INT STA
  • [2] [Anonymous], 2014 IEEE INT C LEAR
  • [3] [Anonymous], ICLR 2016
  • [4] [Anonymous], 2015, INT C MACH LEARN ICM
  • [5] [Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
  • [6] [Anonymous], 2016, P BRIT MACH VIS C
  • [7] [Anonymous], 2016, EUR C COMP VIS ECCV
  • [8] Text/non-text image classification in the wild with convolutional neural networks
    Bai, Xiang
    Shi, Baoguang
    Zhang, Chengquan
    Cai, Xuan
    Qi, Li
    [J]. PATTERN RECOGNITION, 2017, 66 : 437 - 446
  • [9] The dropout learning algorithm
    Baldi, Pierre
    Sadowski, Peter
    [J]. ARTIFICIAL INTELLIGENCE, 2014, 210 : 78 - 122
  • [10] Representation Learning: A Review and New Perspectives
    Bengio, Yoshua
    Courville, Aaron
    Vincent, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828