Anonymous Model Pruning for Compressing Deep Neural Networks

被引:6
作者
Zhang, Lechun [1 ,2 ]
Chen, Guangyao [2 ]
Shi, Yemin [2 ]
Zhang, Quan [2 ]
Tan, Mingkui [4 ]
Wang, Yaowei [2 ,3 ]
Tian, Yonghong [2 ,3 ]
Huang, Tiejun [2 ,3 ]
机构
[1] Peking Univ, Sch ECE, Shenzhen Grad Sch, Shenzhen, Peoples R China
[2] Peking Univ, Inst Digital Media, Beijing 100871, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[4] South China Univ Technol, Guangzhou 518055, Peoples R China
来源
THIRD INTERNATIONAL CONFERENCE ON MULTIMEDIA INFORMATION PROCESSING AND RETRIEVAL (MIPR 2020) | 2020年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
network compression; knowledge distillation; pruning;
D O I
10.1109/MIPR49039.2020.00040
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Many deep neural network compression algorithms need to fine-tune on source dataset, which makes them unpractical when the source datasets are unavailable. Although data-free methods can overcome this problem, they often suffer from a huge loss of accuracy. In this paper, we propose a novel approach named Anonymous-Model Pruning (AMP), which seeks to compress the network without the source data and the accuracy can be guaranteed without too much loss. AMP compresses deep neural networks via searching pruning rate automatically and fine-tuning the compressed model under the teacher-student diagram. The key innovations are that the pruning rate is automatically determined, and the fine-tuning process is under the guidance of uncompressed network instead of labels. Even without the source dataset, compared with existing pruning methods, our proposed method can still achieve comparable accuracy with similar pruning rate. For example, for ResNet50, our AMP method only incur 0.76% loss in top-1 accuracy with 32.72% pruning rate.
引用
收藏
页码:161 / 164
页数:4
相关论文
共 22 条
[1]  
[Anonymous], 5 INT C LEARN REPPR
[2]  
[Anonymous], 2016, INT C LEARNING REPRE
[3]  
[Anonymous], INT C MACHINE LEARNI
[4]  
[Anonymous], 2015, ACS SYM SER
[5]  
Ba LJ, 2014, ADV NEUR IN, V27
[6]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]  
Hao Z., 2016, ECCV
[9]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[10]  
He Y., 2018, arXiv:1808.07471