Deep Convolutional Network Based on Pyramid Architecture

被引:5
作者
Lv, Enhui [1 ]
Wang, Xuesong [1 ]
Cheng, Yuhu [1 ]
Yu, Qiang [2 ]
机构
[1] China Univ Min & Technol, Sch Informat & Control Engn, Xuzhou 221116, Jiangsu, Peoples R China
[2] China Univ Min & Technol, Sch Elect & Power Engn, Xuzhou 221116, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Deep convolution network; pyramid architecture; feature map dimension; gradient dispersion; NEURAL-NETWORKS;
D O I
10.1109/ACCESS.2018.2860785
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep convolutional network demonstrates that the classification accuracy can be remarkably improved by increasing the number of network layers, however, increases the accuracy by 1% of costs nearly doubling the number of layers. Meanwhile, gradient dispersion will occur in the training process, which leads to performance degradation. In order to solve the problem of training difficulty with the increased number of layers, we focus on network architecture and propose a deep convolutional network based on the pyramid structure. In the network architecture, as the number of layers increased, the feature map dimensions (i.e., the number of channels) are gradually increased at each layer to distribute the burden concentrated at locations of structural units affected by downsampling, such that all units are equally distributed. By exploring the sequence between the stacked elements inside the structural unit, we design a pyramidal building block, as its shape gradually widens from the top downwards, which is called the deep pyramid convolutional network (DPCNet). Experimental results on CIFAR-10 and CIFAR-100 datasets have shown that DPCNet has the superior generalization capability and can effectively improve the image classification accuracy.
引用
收藏
页码:43125 / 43135
页数:11
相关论文
共 39 条
[1]  
[Anonymous], 2015, PROC CVPR IEEE, DOI 10.1109/CVPR.2015.7299173
[2]  
[Anonymous], 2016, Lecture Notes in Computer Science, DOI [10.1007/978-3-319-46493-0_38, DOI 10.1007/978-3-319-46493-0_38]
[3]  
[Anonymous], 2016, P MACHINE LEARNING R
[4]  
[Anonymous], 2016, ARXIV160507648
[5]  
[Anonymous], 2017, INCEPTION RECURRENT
[6]   On the Complexity of Neural Network Classifiers: A Comparison Between Shallow and Deep Architectures [J].
Bianchini, Monica ;
Scarselli, Franco .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (08) :1553-1565
[7]   Fast Deep Neural Networks With Knowledge Guided Training and Predicted Regions of Interests for Real-Time Video Object Detection [J].
Cao, Wenming ;
Yuan, Jianhe ;
He, Zhihai ;
Zhang, Zhi ;
He, Zhiquan .
IEEE ACCESS, 2018, 6 :8990-8999
[8]   Triplet-Based Deep Hashing Network for Cross-Modal Retrieval [J].
Deng, Cheng ;
Chen, Zhaojia ;
Liu, Xianglong ;
Gao, Xinbo ;
Tao, Dacheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (08) :3893-3903
[9]   Facial landmark localization by enhanced convolutional neural network [J].
Deng, Weihong ;
Fang, Yuke ;
Xu, Zhenqi ;
Hu, Jiani .
NEUROCOMPUTING, 2018, 273 :222-229
[10]  
Girshick R., 2015, P IEEE INT C COMPUTE, P1440, DOI [10.1109/ICCV.2015.169, DOI 10.1109/ICCV.2015.169]