A Novel Connectivity of Deep Convolutional Neural Networks

被引:0
作者
Shen, Zhixi [1 ]
Liu, Yong [1 ]
机构
[1] Chongqing Univ, Coll Automat, Chongqing 400044, Peoples R China
来源
2017 CHINESE AUTOMATION CONGRESS (CAC) | 2017年
基金
中国国家自然科学基金;
关键词
Sparse residual network; convolutional neural networks; image recognition; multiplicity;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Residual network(ResNet) is an effective instance and a significant extension of deep convolutional neural network. ResNet utilizes skip-connection between input layers and output layers to solve the vanishing gradient problem. Due to the powerfulness of skip-connection, the gradient can flow directly through the identity function from later layers to the earlier layers. However, skip-connection makes ResNet overemphasizes the effect of the original input feature. In this paper, we further research the learning behavior of deep residual networks(ResNet) and proposed a novel connectivity pattern to replace skip-connection, which builds upon the success of interpreting the ResNet as an exponential ensemble of relatively shallow networks. The proposed method is implemented by making more input layers link with the output layer. The proposed method is called spare residual network(SRN). it generates model that is wider rather than deeper. The experimental results show that the proposed SRN can achieve impressive results on CIFAR-10, and outperform most of the existing architecture.
引用
收藏
页码:7779 / 7783
页数:5
相关论文
共 20 条
[1]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[2]   Learning Deep Architectures for AI [J].
Bengio, Yoshua .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2009, 2 (01) :1-127
[4]   A Multiobjective Sparse Feature Learning Model for Deep Neural Networks [J].
Gong, Maoguo ;
Liu, Jia ;
Li, Hao ;
Cai, Qing ;
Su, Linzhi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (12) :3263-3277
[5]  
He K., 2016, PROC CVPR IEEE, P630, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38, DOI 10.1109/CVPR.2016.90]
[6]  
He K., 2016, P IEEE COMPUTER SOC, P770
[7]  
Ioffe Sergey, 2015, PROC INT C MACH LEAR, V37, P448, DOI DOI 10.48550/ARXIV.1502.03167
[8]  
JIA Y, 2014, P 22 ACM INT C MULT, DOI [DOI 10.1145/2647868.2654889, 10.1145/2647868.2654889]
[9]  
Krizhevsky A., 2009, LEARNING MULTIPLE LA
[10]   ImageNet Classification with Deep Convolutional Neural Networks [J].
Krizhevsky, Alex ;
Sutskever, Ilya ;
Hinton, Geoffrey E. .
COMMUNICATIONS OF THE ACM, 2017, 60 (06) :84-90