Automatic Compression Ratio Allocation for Pruning Convolutional Neural Networks

被引:0
|
作者
Liu, Yunfeng [1 ]
Kong, Huihui [1 ]
Yu, Peihua [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China
来源
ICVISP 2019: PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON VISION, IMAGE AND SIGNAL PROCESSING | 2019年
基金
中国国家自然科学基金;
关键词
Neural Networks; Network Pruning; Model Compression; Computer Vision;
D O I
10.1145/3387168.3387184
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutional neural networks (CNNs) have demonstrated significant performance improvement in many application scenarios. However, the high computational complexity and model size have limited its application on the mobile and embedded devices. Various approaches have been proposed to compress CNNs. Filter pruning is widely considered as a promising solution, which can significantly speed up the inference and reduce memory consumption. To this end, most approaches tend to prune filters by manually allocating compression ratio, which highly relies on individual expertise and not friendly to non-professional users. In this paper, we propose an Automatic Compression Ratio Allocation (ACRA) scheme based on binary search algorithm to prune convolutional neural networks. Specifically, ACRA provides two strategies for allocating compression ratio automatically. First, uniform pruning strategy allocates the same compression ratio to each layer, which is obtained by binary search based on target FLOPs reduction of the whole networks. Second, sensitivity-based pruning strategy allocates appropriate compression ratio to each layer based on the sensitivity to accuracy. Experimental results from VGG11 and VGG-16, demonstrate that our scheme can reduce FLOPs significantly while maintaining a high accuracy level. Specifically, for the VGG16 on CIFAR-10 dataset, we reduce 29.18% FLOPs with only 1.24% accuracy decrease.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] Automatic Habitat Mapping using Convolutional Neural Networks
    Diegues, Andre
    Pinto, Jose
    Ribeiro, Pedro
    2018 IEEE/OES AUTONOMOUS UNDERWATER VEHICLE WORKSHOP (AUV), 2018,
  • [32] Automatic vehicle type classification with convolutional neural networks
    Roecker, Max N.
    Costa, Yandre M. G.
    Almeida, Joao L. R.
    Matsushita, Gustavo H. G.
    2018 25TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2018,
  • [33] New pruning techniques for constructive neural networks with application to image compression
    Ma, L
    Khorasani, K
    SIGNAL PROCESSING, SENSOR FUSION, AND TARGET RECOGNITION IX, 2000, 4052 : 298 - 308
  • [34] Compression of Deep Neural Networks by combining pruning and low rank decomposition
    Goyal, Saurabh
    Choudhury, Anamitra Roy
    Sharma, Vivek
    2019 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2019, : 952 - 958
  • [35] An optimal-score-based filter pruning for deep convolutional neural networks
    Sawant, Shrutika S.
    Bauer, J.
    Erick, F. X.
    Ingaleshwar, Subodh
    Holzer, N.
    Ramming, A.
    Lang, E. W.
    Goetz, Th
    APPLIED INTELLIGENCE, 2022, 52 (15) : 17557 - 17579
  • [36] FILTER PRUNING BASED ON LOCAL GRADIENT ACTIVATION MAPPING IN CONVOLUTIONAL NEURAL NETWORKS
    Intraraprasit, Monthon
    Chitsobhuk, Orachat
    INTERNATIONAL JOURNAL OF INNOVATIVE COMPUTING INFORMATION AND CONTROL, 2023, 19 (06): : 1697 - 1715
  • [37] CHANNEL PRUNING VIA GRADIENT OF MUTUAL INFORMATION FOR LIGHTWEIGHT CONVOLUTIONAL NEURAL NETWORKS
    Lee, Min Kyu
    Lee, Seunghyun
    Lee, Sang Hyuk
    Song, Byung Cheol
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1751 - 1755
  • [38] A Low-Complexity Modified ThiNet Algorithm for Pruning Convolutional Neural Networks
    Tofigh, Sadegh
    Ahmad, M. Omair
    Swamy, M. N. S.
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1012 - 1016
  • [39] Filter pruning via annealing decaying for deep convolutional neural networks acceleration
    Huang, Jiawen
    Xiong, Liyan
    Huang, Xiaohui
    Chen, Qingsen
    Huang, Peng
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2025, 28 (02):
  • [40] An optimal-score-based filter pruning for deep convolutional neural networks
    Shrutika S. Sawant
    J. Bauer
    F. X. Erick
    Subodh Ingaleshwar
    N. Holzer
    A. Ramming
    E. W. Lang
    Th. Götz
    Applied Intelligence, 2022, 52 : 17557 - 17579