Deep Ensemble Learning by Diverse Knowledge Distillation for Fine-Grained Object Classification

被引:9
作者
Okamoto, Naoki [1 ]
Hirakawa, Tsubasa [1 ]
Yamashita, Takayoshi [1 ]
Fujiyoshi, Hironobu [1 ]
机构
[1] Chubu Univ, Kasugai, Aichi, Japan
来源
COMPUTER VISION, ECCV 2022, PT XI | 2022年 / 13671卷
关键词
Ensemble learning; Knowledge distillation;
D O I
10.1007/978-3-031-20083-0_30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble of networks with bidirectional knowledge distillation does not significantly improve on the performance of ensemble of networks without bidirectional knowledge distillation. We think that this is because there is a relationship between the knowledge in knowledge distillation and the individuality of networks in the ensemble. In this paper, we propose a knowledge distillation for ensemble by optimizing the elements of knowledge distillation as hyperparameters. The proposed method uses graphs to represent diverse knowledge distillations. It automatically designs the knowledge distillation for the optimal ensemble by optimizing the graph structure to maximize the ensemble accuracy. Graph optimization and evaluation experiments using Stanford Dogs, Stanford Cars, CUB-200-2011, CIFAR-10, and CIFAR-100 show that the proposed method achieves higher ensemble accuracy than conventional ensembles.
引用
收藏
页码:502 / 518
页数:17
相关论文
共 39 条
[1]   Variational Information Distillation for Knowledge Transfer [J].
Ahn, Sungsoo ;
Hu, Shell Xu ;
Damianou, Andreas ;
Lawrence, Neil D. ;
Dai, Zhenwen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9155-9163
[2]   Optuna: A Next-generation Hyperparameter Optimization Framework [J].
Akiba, Takuya ;
Sano, Shotaro ;
Yanase, Toshihiko ;
Ohta, Takeru ;
Koyama, Masanori .
KDD'19: PROCEEDINGS OF THE 25TH ACM SIGKDD INTERNATIONAL CONFERENCCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2019, :2623-2631
[3]  
[Anonymous], 2009, CIFAR-100 Dataset
[4]   Grad-CAM plus plus : Generalized Gradient-based Visual Explanations for Deep Convolutional Networks [J].
Chattopadhay, Aditya ;
Sarkar, Anirban ;
Howlader, Prantik ;
Balasubramanian, Vineeth N. .
2018 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2018), 2018, :839-847
[5]  
Chen GB, 2017, ADV NEUR IN, V30
[6]   Exploiting Joint Robustness to Adversarial Perturbations [J].
Dabouei, Ali ;
Soleymani, Sobhan ;
Taherkhani, Fariborz ;
Dawson, Jeremy ;
Nasrabadi, Nasser M. .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :1119-1128
[7]   Diversity with Cooperation: Ensemble Methods for Few-Shot Classification [J].
Dvornik, Nikita ;
Schmid, Cordelia ;
Mairal, Julien .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3722-3730
[8]   Attention Branch Network: Learning of Attention Mechanism for Visual Explanation [J].
Fukui, Hiroshi ;
Hirakawa, Tsubasa ;
Yamashita, Takayoshi ;
Fujiyoshi, Hironobu .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :10697-10706
[9]  
Furlanello T, 2018, PR MACH LEARN RES, V80
[10]  
Ghasemzadeh Hassan, 2020, ASS ADVANCEMENT ARTI