Teacher-free Distillation via Regularizing Intermediate Representation

被引:4
作者
Li, Lujun [1 ]
Liang, Shiuan-Ni [1 ]
Yang, Ya [1 ]
Jin, Zhe [2 ]
机构
[1] Monash Univ, Sch Engn, Subang Jaya, Malaysia
[2] Anhui Univ, Sch Artificial Intelligence, Hefei, Peoples R China
来源
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年
关键词
Knowledge distillation;
D O I
10.1109/IJCNN55064.2022.9892575
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature distillation always leads to significant performance improvements, but requires extra training budgets. To address the problem, we propose TFD, a simple and effective Teacher- Free Distillation framework, which seeks to reuse the privileged features within the student network itself. Specifically, TFD squeezes feature knowledge in the deeper layers into the shallow ones by minimizing feature loss. Thanks to the narrow gap of these self- features, TFD only needs to adopt a simple la loss without complex transformations. Extensive experiments on recognition benchmarks show that our framework can achieve superior performance than teacher- based feature distillation methods. On the lmageNet dataset, our approach achieves 0.8% gains for ResNet18, which surpasses other state-of-the-art training techniques.
引用
收藏
页数:6
相关论文
共 50 条
[41]   Revisiting Knowledge Distillation via Label Smoothing Regularization [J].
Yuan, Li ;
Tay, Francis E. H. ;
Li, Guilin ;
Wang, Tao ;
Feng, Jiashi .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :3902-3910
[42]  
Yun S, 2020, PROC CVPR IEEE, P13873, DOI 10.1109/CVPR42600.2020.01389
[43]  
Zagoruyko S., 2016, BRIT MACH VIS C, DOI [DOI 10.5244/C.30.87, 10.5244/C.30.87]
[44]  
Zagoruyko S., 2017, P INT C LEARN REPR I
[45]   Learning the Model Update for Siamese Trackers [J].
Zhang, Lichao ;
Gonzalez-Garcia, Abel ;
van de Weijer, Joost ;
Danelljan, Martin ;
Khan, Fahad Shahbaz .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :4009-4018
[46]   Residual Dense Network for Image Super-Resolution [J].
Zhang, Yulun ;
Tian, Yapeng ;
Kong, Yu ;
Zhong, Bineng ;
Fu, Yun .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2472-2481
[47]  
Zhang Zhilu, 2020, Advances in Neural Information Processing Systems
[48]  
Zhou A., 2017, INT C LEARN REPR
[49]  
Zhou S., 2016, ABS16060 ARXIV
[50]  
Zhou Zaida, 2020, ARXIV200601683