Holistic Filter Pruning for Efficient Deep Neural Networks

被引：13

作者：

Enderich, Lukas ^{[1
]}

Timm, Fabian ^{[2
]}

Burgard, Wolfram ^{[3
]}

机构：

[1] Robert Bosch GmbH, D-71229 Leonberg, Germany

[2] Robert Bosch GmbH, D-71272 Renningen, Germany

[3] Univ Freiburg, D-79110 Freiburg, Germany

来源：

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021 | 2021年

关键词：

D O I：

10.1109/WACV48630.2021.00264

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks (DNNs) are usually over-parameterized to increase the likelihood of getting adequate initial weights by random initialization. Consequently, trained DNNs have many redundancies which can be pruned from the model to reduce complexity and improve the ability to generalize. Structural sparsity, as achieved by filter pruning, directly reduces the tensor sizes of weights and activations and is thus particularly effective for reducing complexity. We propose Holistic Filter Pruning (HFP), a novel approach for common DNN training that is easy to implement and enables to specify accurate pruning rates for the number of both parameters and multiplications. After each forward pass, the current model size is calculated and compared to the desired target size. By gradient descent, a global solution can be found that allocates the pruning budget over the individual layers such that the desired target size is fulfilled. In various experiments, we give insights into the training and achieve state-of-the-art performance on CIFAR-10 and ImageNet.

引用

页码：2595 / 2604

页数：10

共 38 条

[1]

[Anonymous], 2016, NETWORK TRIMMING DAT

[2] Deep Learning: Methods and Applications [J].

Deng, Li ;

Yu, Dong .

FOUNDATIONS AND TRENDS IN SIGNAL PROCESSING, 2013, 7 (3-4) :I-387

[3]

Enderich L., 2019, EUR S ART NEUR NETW

[4]

Enderich Lukas, 2020, NEUROCOMPUTING

[5] Five-repetition sit-to-Stand test among patients post-stroke and healthy-matched controls: the use of different chair types and number of trials [J].

Franco, Juliane ;

Quintino, Ludmylla Ferreira ;

Faria, Christina D. C. M. .

PHYSIOTHERAPY THEORY AND PRACTICE, 2021, 37 (12) :1419-1428

[6] DMCP: Differentiable Markov Channel Pruning for Neural Networks [J].

Guo, Shaopeng ;

Wang, Yujie ;

Li, Quanquan ;

Yan, Junjie .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1536-1544

[7]

Guo Yiwen, 2016, ABS160804493 CORR

[8]

Han S, 2015, ADV NEUR IN, V28

[9] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[10]

He Y, 2018, PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P2234

← 1 2 3 4 →