Adversarial Network Compression

被引：8

作者：

Belagiannis, Vasileios ^{[1
]}

Farshad, Azade ^{[1
,2
]}

Galasso, Fabio ^{[1
]}

机构：

[1] Innovat OSRAM GmbH, Garching, Germany

[2] Tech Univ Munich, Garching, Germany

来源：

COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV | 2019年 / 11132卷

关键词：

D O I：

10.1007/978-3-030-11018-5_37

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Neural network compression has recently received much attention due to the computational requirements of modern deep models. In this work, our objective is to transfer knowledge from a deep and accurate model to a smaller one. Our contributions are threefold: (i) we propose an adversarial network compression approach to train the small student network to mimic the large teacher, without the need for labels during training; (ii) we introduce a regularization scheme to prevent a trivially-strong discriminator without reducing the network capacity and (iii) our approach generalizes on different teacher-student models. In an extensive evaluation on five standard datasets, we show that our student has small accuracy drop, achieves better performance than other knowledge transfer approaches and it surpasses the performance of the same network trained with labels. In addition, we demonstrate state-of-the-art results compared to other compression strategies.

引用

页码：431 / 449

页数：19

共 65 条

[1]

Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265

[2]

[Anonymous], 2018, ARXIV180103924

[3]

[Anonymous], 2017, ARXIV170205464

[4]

[Anonymous], 2016, ARXIV161105431

[5]

[Anonymous], 2016, ARXIV160600915

[6]

[Anonymous], 2016, INT C LEARNING REPRE

[7]

[Anonymous], 2016, LECT NOTES COMPUT SC, DOI DOI 10.1007/978-3-319-46484-8_29

[8]

[Anonymous], 2016, BMVC

[9]

[Anonymous], 2017, NISP PRUNING NETWORK

[10]

[Anonymous], 2018, P INT C LEARN REPR M

← 1 2 3 4 5 6 7 →