A One-step Pruning-recovery Framework for Acceleration of Convolutional Neural Networks

被引:0
|
作者
Wang, Dong [1 ]
Bai, Xiao [1 ]
Zhou, Lei [1 ]
Zhou, Jun [2 ]
机构
[1] Beihang Univ, Sch Comp Sci & Engn, Beijing Adv Innovat Ctr Big Data & Brain Comp, Jiangxi Res Inst, Beijing, Peoples R China
[2] Griffith Univ, Sch Informat & Commun Technol, Nathan, Qld, Australia
来源
2019 IEEE 31ST INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2019) | 2019年
基金
中国国家自然科学基金;
关键词
filter pruning; network pruning; cnn acceleration;
D O I
10.1109/ICTAI.2019.00111
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Acceleration of convolutional neural network has received increasing attention during the past several years. Among various acceleration techniques, filter pruning has its inherent merit by effectively reducing the number of convolution filters. However, most filter pruning methods resort to tedious and time-consuming layer-by-layer pruning-recovery strategy to avoid a significant drop of accuracy. In this paper, we present an efficient filter pruning framework to solve this problem. Our method accelerates the network in one-step pruning-recovery manner with a novel optimization objective function, which achieves higher accuracy with much less cost compared with existing pruning methods. Furthermore, our method allows network compression with global filter pruning. Given a global pruning rate, it can adaptively determine the pruning rate for each single convolutional layer, while these rates are often set as hyper-parameters in previous approaches. Evaluated on VGG-16 and RcsNct-50 using ImageNet, our approach outperforms several state-of-the-art methods with less accuracy drop under the same and even much fewer floating-point operations (FLOPs).
引用
收藏
页码:768 / 775
页数:8
相关论文
共 45 条
  • [1] Filter pruning via annealing decaying for deep convolutional neural networks acceleration
    Huang, Jiawen
    Xiong, Liyan
    Huang, Xiaohui
    Chen, Qingsen
    Huang, Peng
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (02):
  • [2] Empirical evaluation of filter pruning methods for acceleration of convolutional neural network
    Dheeraj Kumar
    Mayuri A. Mehta
    Vivek C. Joshi
    Rachana S. Oza
    Ketan Kotecha
    Jerry Chun-Wei Lin
    Multimedia Tools and Applications, 2024, 83 : 54699 - 54727
  • [3] HILP: hardware-in-loop pruning of convolutional neural networks towards inference acceleration
    Dong Li
    Qianqian Ye
    Xiaoyue Guo
    Yunda Sun
    Li Zhang
    Neural Computing and Applications, 2024, 36 : 8825 - 8842
  • [4] HILP: hardware-in-loop pruning of convolutional neural networks towards inference acceleration
    Li, Dong
    Ye, Qianqian
    Guo, Xiaoyue
    Sun, Yunda
    Zhang, Li
    NEURAL COMPUTING & APPLICATIONS, 2024, 36 (15) : 8825 - 8842
  • [5] Empirical evaluation of filter pruning methods for acceleration of convolutional neural network
    Kumar, Dheeraj
    Mehta, Mayuri A.
    Joshi, Vivek C.
    Oza, Rachana S.
    Kotecha, Ketan
    Lin, Jerry Chun-Wei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (18) : 54699 - 54727
  • [6] Fpar: filter pruning via attention and rank enhancement for deep convolutional neural networks acceleration
    Chen, Yanming
    Wu, Gang
    Shuai, Mingrui
    Lou, Shubin
    Zhang, Yiwen
    An, Zhulin
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2024, 15 (07) : 2973 - 2985
  • [7] Blending Pruning Criteria for Convolutional Neural Networks
    He, Wei
    Huang, Zhongzhan
    Liang, Mingfu
    Liang, Senwei
    Yang, Haizhao
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 3 - 15
  • [8] Discriminative Layer Pruning for Convolutional Neural Networks
    Jordao, Artur
    Lie, Maiko
    Schwartz, William Robson
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2020, 14 (04) : 828 - 837
  • [9] CAPTOR: A Class Adaptive Filter Pruning Framework for Convolutional Neural Networks in Mobile Applications
    Qin, Zhuwei
    Yu, Fuxun
    Liu, Chenchen
    Chen, Xiang
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 444 - 449
  • [10] FPC: Filter pruning via the contribution of output feature map for deep convolutional neural networks acceleration
    Chen, Yanming
    Wen, Xiang
    Zhang, Yiwen
    He, Qiang
    KNOWLEDGE-BASED SYSTEMS, 2022, 238