CLMIP: cross-layer manifold invariance based pruning method of deep convolutional neural network for real-time road type recognition

被引:16
作者
Zheng, Qinghe [1 ]
Tian, Xinyu [2 ]
Yang, Mingqiang [1 ]
Su, Huake [3 ]
机构
[1] Shandong Univ, Sch Informat Sci & Engn, Qingdao 266237, Peoples R China
[2] Shandong Management Univ, Coll Mech & Elect Engn, Jinan 250357, Peoples R China
[3] Xidian Univ, Sch Microelect, Xian 710126, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Deep learning; Road type recognition; Model pruning; Model compression; Inference speedup;
D O I
10.1007/s11045-020-00736-x
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Recently, deep learning based models have demonstrated the superiority in a variety of visual tasks like object detection and instance segmentation. In practical applications, deploying advanced networks into real-time applications such as autonomous driving is still challenging due to expensive computational cost and memory footprint. In this paper, to reduce the size of deep convolutional neural network (CNN) and accelerate its reasoning, we propose a cross-layer manifold invariance based pruning method named CLMIP for network compression to help it complete real-time road type recognition in low-cost vision system. Manifolds are higher-dimensional analogues of curves and surfaces, which can be self-organized to reflect the data distribution and characterize the relationship between data. Therefore, we hope to guarantee the generalization ability of deep CNN by maintaining the consistency of the data manifolds of each layer in the network, and then remove the parameters with less influence on the manifold structure. Therefore, CLMIP can be regarded as a tool to further investigate the dependence of model structure on network optimization and generalization. To the best of our knowledge, this is the first time to prune deep CNN based on the invariance of data manifolds. During experimental process, we use the python based keyword crawler program to collect 102 first-view videos of car cameras, including 137 200 images (320 x 240) of four road scenes (urban road, off-road, trunk road and motorway). Finally, the classification results have demonstrated that CLMIP can achieve state-of-the-art performance with a speed of 26 FPS on NVIDIA Jetson Nano.
引用
收藏
页码:239 / 262
页数:24
相关论文
共 64 条
[1]  
[Anonymous], BRIT MACH VIS C BMVC
[2]  
[Anonymous], 2015, J MACH LEARN RES, DOI DOI 10.17863/CAM.6477
[3]  
[Anonymous], 2016, PATTERN RECOGNITION
[4]   Learning distributions of shape trajectories from longitudinal datasets: a hierarchical model on a manifold of diffeomorphisms [J].
Bone, Alexandre ;
Colliot, Olivier ;
Durrleman, Stanley .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9271-9280
[5]   Scene classification using a hybrid generative/discriminative approach [J].
Bosch, Anna ;
Zisserman, Andrew ;
Munoz, Xavier .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (04) :712-727
[6]   Contextualizing Object Detection and Classification [J].
Chen, Qiang ;
Song, Zheng ;
Dong, Jian ;
Huang, Zhongyang ;
Hua, Yang ;
Yan, Shuicheng .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (01) :13-27
[7]   When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs [J].
Cheng, Gong ;
Yang, Ceyuan ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05) :2811-2821
[8]  
Chollet F, IEEE C COMP VIS PATT, P1800
[9]   Convolutional Neural Network With Data Augmentation for SAR Target Recognition [J].
Ding, Jun ;
Chen, Bo ;
Liu, Hongwei ;
Huang, Mengyuan .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2016, 13 (03) :364-368
[10]  
Ding XH, 2018, AAAI CONF ARTIF INTE, P6797