Global-connected network with generalized ReLU activation

被引：28

作者：

Chen, Zhi ^{[1
]}

Ho, Pin-Han ^{[1
]}

机构：

[1] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON N2L 3G1, Canada

来源：

PATTERN RECOGNITION | 2019年 / 96卷

关键词：

CNN; Computer vision; Deep learning; Activation; NEURAL-NETWORKS; DROPOUT;

D O I：

10.1016/j.patcog.2019.07.006

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent Progress has shown that exploitation of hidden layer neurons in convolutional neural networks (CNN) incorporating with a carefully designed activation function can yield better classification results in the field of computer vision. The paper firstly introduces a novel deep learning (DL) architecture aiming to mitigate the gradient-vanishing problem, in which the earlier hidden layer neurons could be directly connected with the last hidden layer and fed into the softmax layer for classification. We then design a generalized linear rectifier function as the activation function that can approximate arbitrary complex functions via training of the parameters. We will show that our design can achieve similar performance in a number of object recognition and video action benchmark tasks, such as MNIST, CIFAR-10/100, SVHN, Fashion-MNIST, STL-10, and UCF YoutTube Action Video datasets, under significantly less number of parameters and shallower network infrastructure, which is not only promising in training in terms of computation burden and memory usage, but is also applicable to low-computation, low-memory mobile scenarios for inference. (C) 2019 Elsevier Ltd. All rights reserved.

引用

页数：10

共 38 条

[1]

[Anonymous], 18 INT C ART INT STA

[2]

[Anonymous], 2014 IEEE INT C LEAR

[3]

[Anonymous], ICLR 2016

[4]

[Anonymous], 2015, INT C MACH LEARN ICM

[5]

[Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]

[6]

[Anonymous], 2016, P BRIT MACH VIS C

[7]

[Anonymous], 2016, EUR C COMP VIS ECCV

[8] Text/non-text image classification in the wild with convolutional neural networks [J].

Bai, Xiang ;

Shi, Baoguang ;

Zhang, Chengquan ;

Cai, Xuan ;

Qi, Li .

PATTERN RECOGNITION, 2017, 66 :437-446

[9] The dropout learning algorithm [J].

Baldi, Pierre ;

Sadowski, Peter .

ARTIFICIAL INTELLIGENCE, 2014, 210 :78-122

[10] Representation Learning: A Review and New Perspectives [J].

Bengio, Yoshua ;

Courville, Aaron ;

Vincent, Pascal .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828

← 1 2 3 4 →