Learning to Prune Deep Neural Networks via Layer-wise Optimal Brain Surgeon

被引:0
作者
Dong, Xin [1 ]
Chen, Shangyu [1 ]
Pan, Sinno Jialin [1 ]
机构
[1] Nanyang Technol Univ, Singapore, Singapore
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017) | 2017年 / 30卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
How to develop slim and accurate deep neural networks has become crucial for real-world applications, especially for those employed in embedded systems. Though previous work along this research line has shown some promising results, most existing methods either fail to significantly compress a well-trained deep network or require a heavy retraining process for the pruned deep network to re-boost its prediction performance. In this paper, we propose a new layer-wise pruning method for deep neural networks. In our proposed method, parameters of each individual layer are pruned independently based on second order derivatives of a layer-wise error function with respect to the corresponding parameters. We prove that the final prediction performance drop after pruning is bounded by a linear combination of the reconstructed errors caused at each layer. By controlling layer-wise errors properly, one only needs to perform a light retraining process on the pruned network to resume its original prediction performance. We conduct extensive experiments on benchmark datasets to demonstrate the effectiveness of our pruning method compared with several state-of-the-art baseline methods. Codes of our work are released at: https://github.com/csyhhu/L-OBS.
引用
收藏
页数:11
相关论文
共 50 条
[31]   NeuronMotif: Deciphering cis-regulatory codes by layer-wise demixing of deep neural networks [J].
Wei, Zheng ;
Hua, Kui ;
Wei, Lei ;
Ma, Shining ;
Jiang, Rui ;
Zhang, Xuegong ;
Li, Yanda ;
Wong, Wing H. ;
Wang, Xiaowo .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (15)
[32]   A Dynamic Layer-Wise Gradient Sparsity and Gradient Merging Optimization Method for Deep Neural Networks [J].
Ju, Tao ;
Kang, Heting ;
Liu, Shuai ;
Huo, Jiuyuan .
Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2024, 58 (09) :105-116
[33]   Layer-wise partitioning and merging for efficient and scalable deep learning [J].
Akintoye, S. B. ;
Han, L. ;
Lloyd, H. ;
Zhang, X. ;
Dancey, D. ;
Chen, H. ;
Zhang, D. .
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2023, 149 :432-444
[34]   Multithreaded Layer-wise Training of Sparse Deep Neural Networks using Compressed Sparse Column [J].
Mofrad, Mohammad Hasanzadeh ;
Melhem, Rami ;
Ahmad, Yousuf ;
Hammoud, Mohammad .
2019 IEEE HIGH PERFORMANCE EXTREME COMPUTING CONFERENCE (HPEC), 2019,
[35]   An analytic layer-wise deep learning framework with applications to robotics [J].
Huu-Thiet Nguyen ;
Chien Chern Cheah ;
Kar-Ann Toh .
AUTOMATICA, 2022, 135
[36]   FiLayer: A Novel Fine-Grained Layer-Wise Parallelism Strategy for Deep Neural Networks [J].
Jiang, Wenbin ;
Zhang, Yangsong ;
Liu, Pai ;
Ye, Geyan ;
Jin, Hai .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2018, PT III, 2018, 11141 :321-330
[37]   DEEP RECURRENT NEURAL NETWORKS WITH LAYER-WISE MULTI-HEAD ATTENTIONS FOR PUNCTUATION RESTORATION [J].
Kim, Seokhwan .
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, :7280-7284
[38]   Layer-Wise Residual-Guided Feature Learning With Deep Learning Networks for Industrial Quality Prediction [J].
Wang, Yalin ;
Luo, Jiang ;
Liu, Chenliang ;
Yuan, Xiaofeng ;
Wang, Kai ;
Yang, Chunhua .
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 71
[39]   Deep Neural Network Quantization via Layer-Wise Optimization Using Limited Training Data [J].
Chen, Shangyu ;
Wang, Wenya ;
Pan, Sinno Jialin .
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, :3329-3336
[40]   Explaining Therapy Predictions with Layer-wise Relevance Propagation in Neural Networks [J].
Yang, Yinchong ;
Tresp, Volker ;
Wunderle, Marius ;
Fasching, Peter A. .
2018 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2018, :152-162