Adversarial Network Compression

被引:8
作者
Belagiannis, Vasileios [1 ]
Farshad, Azade [1 ,2 ]
Galasso, Fabio [1 ]
机构
[1] Innovat OSRAM GmbH, Garching, Germany
[2] Tech Univ Munich, Garching, Germany
来源
COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV | 2019年 / 11132卷
关键词
D O I
10.1007/978-3-030-11018-5_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network compression has recently received much attention due to the computational requirements of modern deep models. In this work, our objective is to transfer knowledge from a deep and accurate model to a smaller one. Our contributions are threefold: (i) we propose an adversarial network compression approach to train the small student network to mimic the large teacher, without the need for labels during training; (ii) we introduce a regularization scheme to prevent a trivially-strong discriminator without reducing the network capacity and (iii) our approach generalizes on different teacher-student models. In an extensive evaluation on five standard datasets, we show that our student has small accuracy drop, achieves better performance than other knowledge transfer approaches and it surpasses the performance of the same network trained with labels. In addition, we demonstrate state-of-the-art results compared to other compression strategies.
引用
收藏
页码:431 / 449
页数:19
相关论文
共 65 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]  
[Anonymous], 2018, ARXIV180103924
[3]  
[Anonymous], 2017, ARXIV170205464
[4]  
[Anonymous], 2016, ARXIV161105431
[5]  
[Anonymous], 2016, ARXIV160600915
[6]  
[Anonymous], 2016, INT C LEARNING REPRE
[7]  
[Anonymous], 2016, LECT NOTES COMPUT SC, DOI DOI 10.1007/978-3-319-46484-8_29
[8]  
[Anonymous], 2016, BMVC
[9]  
[Anonymous], 2017, NISP PRUNING NETWORK
[10]  
[Anonymous], 2018, P INT C LEARN REPR M