DMCP: Differentiable Markov Channel Pruning for Neural Networks

被引：184

作者：

Guo, Shaopeng ^{[1
]}

Wang, Yujie ^{[1
]}

Li, Quanquan ^{[1
]}

Yan, Junjie ^{[1
]}

机构：

[1] SenseTime Res, Hong Kong, Peoples R China

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2020年

关键词：

D O I：

10.1109/CVPR42600.2020.00161

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent works imply that the channel pruning can be regarded as searching optimal sub-structure from unpruned networks. However, existing works based on this observation require training and evaluating a large number of structures, which limits their application. In this paper, we propose a novel differentiable method for channel pruning, named Differentiable Markov Channel Pruning (DMCP), to efficiently search the optimal sub-structure. Our method is differentiable and can be directly optimized by gradient descent with respect to standard task loss and budget regularization (e.g. FLOPs constraint). In DMCP, we model the channel pruning as a Markov process, in which each state represents for retaining the corresponding channel during pruning, and transitions between states denote the pruning process. In the end, our method is able to implicitly select the proper number of channels in each layer by the Markov process with optimized transitions. To validate the effectiveness of our method, we perform extensive experiments on Imagenet with ResNet and MobilenetV2. Results show our method can achieve consistent improvement than state-of-the-art pruning methods in various FLOPs settings.

引用

页码：1536 / 1544

页数：9

共 23 条

[1]

[Anonymous], 2017, P INT C LEARN REPR T

[2]

[Anonymous], 2016, INT C LEARNING REPRE

[3]

[Anonymous], 2018, PROC INT C LEARN REP

[4]

[Anonymous], 1993, NIPS 1993

[5]

Ding Xiaohan, 2019, ABS190504748 CORR

[6]

Durdanovic Igor, 2016, ABS160808710 CORR

[7] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[8]

He Y., 2018, arXiv:1808.07471

[9]

He Yang, 2018, ABS181100250 CORR

[10] AMC: AutoML for Model Compression and Acceleration on Mobile Devices [J].

He, Yihui ;

Lin, Ji ;

Liu, Zhijian ;

Wang, Hanrui ;

Li, Li-Jia ;

Han, Song .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :815-832

← 1 2 3 →