CSPNet: A New Backbone that can Enhance Learning Capability of CNN

被引:3140
作者
Wang, Chien-Yao [1 ]
Liao, Hong-Yuan Mark [1 ,2 ]
Wu, Yueh-Hua [1 ,3 ]
Chen, Ping-Yang [4 ]
Hsieh, Jun-Wei [5 ]
Yeh, I-Hau [6 ]
机构
[1] Acad Sinica, Inst Informat Sci, Taipei, Taiwan
[2] Providence Univ, Dept Comp Sci & Informat Engn, Taichung, Taiwan
[3] Natl Taiwan Univ, Dept Comp Sci & Informat Engn, Taipei, Taiwan
[4] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
[5] Natl Chiao Tung Univ, Coll Artificial Intelligence & Green Energy, Hsinchu, Taiwan
[6] Elan Microelect Corp, Hsinchu, Taiwan
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020) | 2020年
关键词
D O I
10.1109/CVPRW50498.2020.00203
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural networks have enabled state-of-the-art approaches to achieve incredible results on computer vision tasks such as object detection. However, such success greatly relies on costly computation resources, which hinders people with cheap devices from appreciating the advanced technology. In this paper, we propose Cross Stage Partial Network (CSPNet) to mitigate the problem that previous works require heavy inference computations from the network architecture perspective. We attribute the problem to the duplicate gradient information within network optimization. The proposed networks respect the variability of the gradients by integrating feature maps from the beginning and the end of a network stage, which, in our experiments, reduces computations by 20% with equivalent or even superior accuracy on the ImageNet dataset, and significantly outperforms state-of-the-art approaches in terms of AP50 on the MS COCO object detection dataset. The CSPNet is easy to implement and general enough to cope with architectures based on ResNet, ResNeXt, and DenseNet.
引用
收藏
页码:1571 / 1580
页数:10
相关论文
共 43 条
[1]  
[Anonymous], 2013, Darknet: Open source neural networks in C
[2]   Hierarchical Shot Detector [J].
Cao, Jiale ;
Pang, Yanwei ;
Han, Jungong ;
Li, Xuelong .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9704-9713
[3]   HarDNet: A Low Memory Traffic Network [J].
Chao, Ping ;
Kao, Chao-Yang ;
Ruan, Yu-Shan ;
Huang, Chien-Hsiang ;
Lin, Youn-Long .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3551-3560
[4]  
Chen Liang-Chieh, 2019, P IEEE INT C COMPUTE
[5]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[6]  
Deng Y., 2017, Light-Head R-CNN: In Defense of Two-Stage Object Detector
[7]   CenterNet: Keypoint Triplets for Object Detection [J].
Duan, Kaiwen ;
Bai, Song ;
Xie, Lingxi ;
Qi, Honggang ;
Huang, Qingming ;
Tian, Qi .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6568-6577
[8]  
Gao Shang-Hua, 2020, IEEE T. Pattern Anal. Mach. Intell.
[9]  
Goodfellow I.J., 2013, Maxout networks
[10]   Deep Pyramidal Residual Networks [J].
Han, Dongyoon ;
Kim, Jiwhan ;
Kim, Junmo .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6307-6315