SASL: Saliency-Adaptive Sparsity Learning for Neural Network Acceleration

被引:21
|
作者
Shi, Jun [1 ]
Xu, Jianfeng [2 ]
Tasaka, Kazuyuki [2 ]
Chen, Zhibo [1 ]
机构
[1] Univ Sci & Technol China, CAS Key Lab Technol Geospatial Informat Proc & Ap, Hefei 230027, Peoples R China
[2] KDDI Res Inc, Fujimino 3568502, Japan
关键词
Training; Acceleration; Biological neural networks; Optimization; Computational modeling; Predictive models; Convolutional neural network (CNN); sparsity learning; adaptive; acceleration; compression;
D O I
10.1109/TCSVT.2020.3013170
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Accelerating the inference of CNNs is critical to their deployment in real-world applications. Among all pruning approaches, the methods of implementing a sparsity learning framework have shown effectiveness as they learn and prune the models in an end-to-end data-driven manner. However, these works impose the same sparsity regularization on all filters indiscriminately, which can hardly result in an optimal structure-sparse network. In this paper, we propose a Saliency-Adaptive Sparsity Learning (SASL) approach for further optimization. A novel and effective estimation of each filter, i.e., saliency, is designed, which is measured from two aspects: the importance for prediction performance and the consumed computational resources. During sparsity learning, the regularization strength is adjusted according to the saliency, so our optimized format can better preserve the prediction performance while zeroing out more computation-heavy filters. The calculation for saliency introduces minimum overhead to the training process, which means our SASL is very efficient. During the pruning phase, in order to optimize the proposed data-dependent criterion, a hard sample mining strategy is utilized, which shows higher effectiveness and efficiency. Extensive experiments demonstrate the superior performance of our method. Notably, on ILSVRC-2012 dataset, our approach can reduce 49.7% FLOPs of ResNet-50 with very negligible 0.39% top-1 and 0.05% top-5 accuracy degradation.
引用
收藏
页码:2008 / 2019
页数:12
相关论文
共 50 条
  • [21] LassoNet: A Neural Network with Feature Sparsity
    Lemhadri, Ismael
    Ruan, Feng
    Abraham, Louis
    Tibshirani, Robert
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [22] FUZZY ADAPTIVE LEARNING CONTROL NETWORK WITH ONLINE NEURAL LEARNING
    LIN, CT
    LIN, CJ
    LEE, CSG
    FUZZY SETS AND SYSTEMS, 1995, 71 (01) : 25 - 45
  • [23] Adaptive neural network control and learning for robot manipulator
    Wu, Y. (xyuwu@scut.edu.cn), 1600, Chinese Mechanical Engineering Society (49):
  • [24] Neural network motion controller with fuzzy adaptive learning
    Jezernik, Karel
    Rodic, Miran
    Safaric, Riko
    Elektrotehniski Vestnik/Electrotechnical Review, 1995, 62 (3-4): : 182 - 190
  • [25] Recurrent Neural Network Learning by Adaptive Genetic Operators
    Chihi, Hanen
    Arous, Najet
    2012 6TH INTERNATIONAL CONFERENCE ON SCIENCES OF ELECTRONICS, TECHNOLOGIES OF INFORMATION AND TELECOMMUNICATIONS (SETIT), 2012, : 832 - 834
  • [26] Adaptive learning schemes for the modified probabilistic neural network
    Zaknich, A
    Desilva, CJS
    ICA(3)PP 97 - 1997 3RD INTERNATIONAL CONFERENCE ON ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, 1997, : 597 - 610
  • [27] Backpropagation Neural Network with Adaptive Learning Rate for Classification
    Jullapak, Rujira
    Thammano, Arit
    ADVANCES IN NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, ICNC-FSKD 2022, 2023, 153 : 493 - 499
  • [28] NONLINEAR DYNAMIC CALIBRATION AND CORRECTION OF ACCELERATION SENSOR BASED ON ADAPTIVE NEURAL NETWORK
    Xiao, Shuo
    Wang, Shengzhi
    Zhuang, Jiayu
    Huang, Zhenzhen
    Zhang, Guopeng
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2022, 30 (02)
  • [29] Learning the Sparsity for ReRAM: Mapping and Pruning Sparse Neural Network for ReRAM based Accelerator
    Lin, Jilan
    Zhu, Zhenhua
    Wang, Yu
    Xie, Yuan
    24TH ASIA AND SOUTH PACIFIC DESIGN AUTOMATION CONFERENCE (ASP-DAC 2019), 2019, : 639 - 644
  • [30] Objects Classification by Learning-Based Visual Saliency Model and Convolutional Neural Network
    Li, Na
    Zhao, Xinbo
    Yang, Yongjia
    Zou, Xiaochun
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2016, 2016