Retraining-free methods for fast on-the-fly pruning of convolutional neural networks

被引:11
作者
Ashouri, Amir H. [1 ]
Abdelrahman, Tarek S. [2 ,3 ]
Dos Remedios, Alwyn [4 ]
机构
[1] Univ Toronto, ECE Dept, Toronto, ON, Canada
[2] Univ Toronto, Elect & Comp Engn, Toronto, ON, Canada
[3] Univ Toronto, Comp Sci, Toronto, ON, Canada
[4] Qualcomm Inc, Markham, ON, Canada
关键词
Deep learning; Convolutional neural networks; Sparsity; Pruning;
D O I
10.1016/j.neucom.2019.08.063
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We explore retraining-free pruning of CNNs. We propose and evaluate three model-independent methods for sparsification of model weights. Our methods are magnitude-based, efficient, and can be applied on-the-fly during model load time, which is necessary in some deployment contexts. We evaluate the effectiveness of these methods in introducing sparsity with minimal loss of inference accuracy using five state-of-the-art pretrained CNNs. The evaluation shows that the methods reduce the number of weights by up to 73% (i.e., compression factor of 3.7 x) without incurring more than 5% loss in Top-5 accuracy. These results also hold for quantized versions of the CNNs. We develop a classifier to determine which of the three methods is most suited for a given model. Finally, we employ additional, but impractical in our deployment context, fine-tuning and show that it gains only 8% in sparsity. This indicates that our on-the-fly methods capture much of the sparsity than can be attained without retraining, yet remain efficient and straight-forward to use. (C) 2019 Elsevier B.V. All rights reserved.
引用
收藏
页码:56 / 69
页数:14
相关论文
共 50 条
[41]   TOWARDS THINNER CONVOLUTIONAL NEURAL NETWORKS THROUGH GRADUALLY GLOBAL PRUNING [J].
Wang, Zhengtao ;
Zhu, Ce ;
Xia, Zhiqiang ;
Guo, Qi ;
Liu, Yipeng .
2017 24TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2017, :3939-3943
[42]   Hybrid-Grained Pruning and Hardware Acceleration for Convolutional Neural Networks [J].
Li, Yu ;
Cao, Shan ;
Zhao, Beining ;
Zhang, Wei ;
Jiang, Zhiyuan .
2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
[43]   LEAST-SQUARES BASED LAYERWISE PRUNING OF CONVOLUTIONAL NEURAL NETWORKS [J].
Mauch, Lukas ;
Yang, Bin .
2018 IEEE STATISTICAL SIGNAL PROCESSING WORKSHOP (SSP), 2018, :60-64
[44]   Compression of Deep Convolutional Neural Networks Using Effective Channel Pruning [J].
Guo, Qingbei ;
Wu, Xiao-Jun ;
Zhao, Xiuyang .
IMAGE AND GRAPHICS, ICIG 2019, PT I, 2019, 11901 :760-772
[45]   Selective Pruning of Sparsity-Supported Energy-Efficient Accelerator for Convolutional Neural Networks [J].
Liu, Chia-Chi ;
Zhang, Xuezhi ;
Wey, I-Chyn ;
Teo, T. Hui .
2023 IEEE 16TH INTERNATIONAL SYMPOSIUM ON EMBEDDED MULTICORE/MANY-CORE SYSTEMS-ON-CHIP, MCSOC, 2023, :454-461
[46]   A fast magnitude estimation method based on deep convolutional neural networks [J].
Wang ZiFa ;
Liao JiAn ;
Wang YanWei ;
Wei DongLiang ;
Zhao DengKe .
CHINESE JOURNAL OF GEOPHYSICS-CHINESE EDITION, 2023, 66 (01) :272-288
[47]   DeepQGHO: Quantized Greedy Hyperparameter Optimization in Deep Neural Networks for on-the-Fly Learning [J].
Chowdhury, Anjir Ahmed ;
Hossen, Md Abir ;
Azam, Md Ali ;
Rahman, Md Hafizur .
IEEE ACCESS, 2022, 10 :6407-6416
[48]   Sequential Convolutional Recurrent Neural Networks for Fast Automatic Modulation Classification [J].
Liao, Kaisheng ;
Zhao, Yaodong ;
Gu, Jie ;
Zhang, Yaping ;
Zhong, Yi .
IEEE ACCESS, 2021, 9 :27182-27188
[49]   FAST AIRCRAFT DETECTION IN SATELLITE IMAGES BASED ON CONVOLUTIONAL NEURAL NETWORKS [J].
Wu, Hui ;
Zhang, Hui ;
Zhang, Jinfang ;
Xu, Fanjiang .
2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, :4210-4214
[50]   DeepStreamOS: Fast open-Set classification for convolutional neural networks [J].
Chambers, Lorraine ;
Gaber, Mohamed Medhat .
PATTERN RECOGNITION LETTERS, 2022, 154 :75-82