Combination pruning method based on reinforcement learning and 3σ criterion

被引:2
作者
Xu S.-M. [1 ]
Li Y. [1 ]
Yuan Q.-L. [1 ]
机构
[1] School of Information Science and Engineering, East China University of Science and Technology, Shanghai
来源
Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science) | 2023年 / 57卷 / 03期
关键词
combination pruning; deep neural network; model compression; reinforcement learning; sparsity rate;
D O I
10.3785/j.issn.1008-973X.2023.03.006
中图分类号
学科分类号
摘要
In order to resolve the problem that deep neural network with complex structure and redundant parameters could not be deployed to the resource constrained embedded system, an efficient combination pruning method based on reinforcement learning and 3σ criterion was proposed, which was inspired by the effect of sparsity rate on performance. Firstly, an optimal global sparsity rate was determined according to the influence of sparsity rate on accuracy, which could achieve a good balance between sparsity rate and accuracy. Secondly, under the guidance of optimal global sparsity rate, the reinforcement learning method was used to search the optimal pruning rate of each convolutional layer automatically, and the unimportant weights were cut off on the basis of the pruning rate. Then, the weight pruning threshold of each fully connected layer was determined by 3σ criterion, and for each fully connected layer, the weight which below the threshold would be pruned. Finally, the accuracy of model recognition was restored by retraining. Experimental results showed that the proposed pruning method could compress the parameters of VGG16, ResNet56 and ResNet50 network by 83.33%, 70.1% and 80.9% respectively, and the model’s recognition accuracy could be reduced by 1.55%, 1.98% and 1.86% respectively. © 2023 Zhejiang University. All rights reserved.
引用
收藏
页码:486 / 494
页数:8
相关论文
共 34 条
  • [1] KRIZHEVSKY A, SUTSKEVER I, HINTON G E., ImageNet classification with deep convolutional neural networks [C], Proceedings of the 25th International Conference on Neural Information Processing Systems, pp. 1097-1105, (2012)
  • [2] ZHANG L B, HUANG S L, LIU W., Intra-class part swapping for fine-grained image classification [C], Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 3209-3218, (2021)
  • [3] REN S K, HE K M, GIRSHICK R, Et al., Object detection networks on convolutional feature maps [J], IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 7, pp. 1476-1481, (2017)
  • [4] REDMON J, DIVVALA S, GIRSHICK R, Et al., You only look once: unified, real-time object detection [C], Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 779-788, (2016)
  • [5] BLAIVAS M, ARNTFIELD R, WHITE M., Creation and testing of a deep learning algorithm to automatically identify and label vessels, nerves, tendons, and bones on cross-sectional point-of-care ultrasound scans for peripheral intravenous catheter placement by novices [J], Journal of Ultrasound in Medicine, 39, 9, pp. 1721-1727, (2020)
  • [6] LONG J, SHELHAMER E, DARRELL T., Fully convolutional networks for semantic segmentation [C], Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431-3440, (2015)
  • [7] TANZI L, PIAZZOLLA P, PORPIGLIA F, Et al., Real-time deep learning semantic segmentation during intra-operative surgery for 3D augmented reality assistance [J], International Journal of Computer Assisted Radiology and Surgery, 16, pp. 1435-1445, (2021)
  • [8] ZHANG Zhe-han, FANG Wei, DU Li-li, Et al., Semantic segmentation of remote sensing image based on coding-decoding convolutional neural network [J], Acta Optica Sinica, 40, 3, pp. 46-55, (2020)
  • [9] LV Yong-fa, Mobile phone surface detect detection algorithm based on deep learning, (2020)
  • [10] JIANG Y, WANG W, ZHAO C., A machine vision-based realtime anomaly detection method for industrial products using deep learning [C], 2019 Chinese Automation Congress, pp. 4842-4847, (2019)