Robust Neural Pruning with Gradient Sampling Optimization for Residual Neural Networks

被引:0
作者
Yun, Juyoung [1 ]
机构
[1] SUNY Stony Brook, Dept Comp Sci, New York, NY 11794 USA
来源
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024 | 2024年
关键词
Neural Networks; Optimization; Neural Pruning;
D O I
10.1109/IJCNN60899.2024.10650301
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research embarks on pioneering the integration of gradient sampling optimization techniques, particularly StochGradAdam, into the pruning process of neural networks. Our main objective is to address the significant challenge of maintaining accuracy in pruned neural models, critical in resource-constrained scenarios. Through extensive experimentation, we demonstrate that gradient sampling significantly preserves accuracy during and after the pruning process compared to traditional optimization methods. Our study highlights the pivotal role of gradient sampling in robust learning and maintaining crucial information post substantial model simplification. The results across CIFAR-10 datasets and residual neural architectures validate the versatility and effectiveness of our approach. This work presents a promising direction for developing efficient neural networks without compromising performance, even in environments with limited computational resources.
引用
收藏
页数:10
相关论文
共 27 条
[1]  
Blalock D., 2020, ARXIV
[2]   Model Compression and Acceleration for Deep Neural Networks The principles, progress, and challenges [J].
Cheng, Yu ;
Wang, Duo ;
Zhou, Pan ;
Zhang, Tao .
IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (01) :126-136
[3]  
Ding T., 2023, ARXIV
[4]  
Frankle J., 2019, ICLR
[5]   Avoiding Overfitting: A Survey on Regularization Methods for Convolutional Neural Networks [J].
Goncalves Dos Santos, Claudio Filipi ;
Papa, Joao Paulo .
ACM COMPUTING SURVEYS, 2022, 54 (10S)
[6]  
Goodfellow I, 2016, ADAPT COMPUT MACH LE, P1
[7]  
Han S., 2016, 4 INT C LEARN REPR I, P2
[8]  
Han S, 2015, ADV NEUR IN, V28
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
Hoefler T, 2021, J MACH LEARN RES, V23