A Category-Aware Curriculum Learning for Data-Free Knowledge Distillation

被引：0

作者：

Li, Xiufang ^{[1
]}

Jiao, Licheng ^{[1
]}

Sun, Qigong ^{[2
,3
]}

Liu, Fang ^{[1
]}

Liu, Xu ^{[1
]}

Li, Lingling ^{[1
]}

Chen, Puhua ^{[1
]}

Yang, Shuyuan ^{[1
]}

机构：

[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China

[2] SenseTime Res, Shanghai 200032, Peoples R China

[3] Shanghai AI Lab, Shanghai 200032, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

关键词：

Generators; Training; Knowledge engineering; Data models; Training data; Task analysis; Monitoring; Data generation; knowledge distillation; category-aware; curriculum learning; image classification;

D O I：

10.1109/TMM.2024.3395844

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Constructing effective proxy data is one of the core challenges in data-free knowledge distillation. The existing models ignore the influence of the category entanglement of the generated data on the distillation. To alleviate this issue, imitating the human learning process, a new category-aware curriculum learning mechanism is proposed in this paper to perform data-free knowledge distillation, called CCL-D. The main ideology of this category-aware curriculum learning mechanism is to provide a new learning mode for data generation and network training, which enables the model to realize the knowledge distillation process from easy to difficult through automated curriculum learning. In this novel learning mechanism, a category-aware monitoring module is proposed to constrain the category attribute of generated data. Based on this monitoring module, the curriculum learning process for data generation and network training is designed and applied. Initially, the generator is guided to obtain new data with clear category features. The utilization of data with apparent category features is easy for student network training, and it enables the student network to learn clear and significant category features at the early training stage. Subsequently, the generator is guided to generate data with category entanglement. Utilizing these new data with category entanglement problems can improve the recognition ability of the student network to interclass interference and enhance network robustness. The effectiveness of the CCL-D is verified on the six benchmark experimental datasets (MNIST, CIFAR-10, CIFAR-100, SVHN, Caltech-101, Tiny-Imagenet).

引用

页码：9603 / 9618

页数：16

共 66 条

[1]

Barber D, 2004, ADV NEUR IN, V16, P201

[2]

Bengio Y, 2009, INT C MACHINE LEARNI, P41, DOI [DOI 10.1145/1553374.1553380, 10.1145/1553374.1553380]

[3]

Binici K, 2022, AAAI CONF ARTIF INTE, P6089

[4] Preventing Catastrophic Forgetting and Distribution Mismatch in Knowledge Distillation via Synthetic Data [J].

Binici, Kuluhan ;

Nam Trung Pham ;

Mitra, Tulika ;

Leman, Karianto .

2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, :3625-3633

[5] Data-Free Learning of Student Networks [J].

Chen, Hanting ;

Wang, Yunhe ;

Xu, Chang ;

Yang, Zhaohui ;

Liu, Chuanjian ;

Shi, Boxin ;

Xu, Chunjing ;

Xu, Chao ;

Tian, Qi .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :3513-3521

[6] Dual-Awareness Attention for Few-Shot Object Detection [J].

Chen, Tung-, I ;

Liu, Yueh-Cheng ;

Su, Hung-Ting ;

Chang, Yu-Cheng ;

Lin, Yu-Hsiang ;

Yeh, Jia-Fong ;

Chen, Wen-Chin ;

Hsu, Winston H. .

IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 :291-301

[7]

Chen X, 2016, 30 C NEURAL INFORM P, V29

[8] Data-Free Network Quantization With Adversarial Knowledge Distillation [J].

Choi, Yoojin ;

Choi, Jihwan ;

El-Khamy, Mostafa ;

Lee, Jungwon .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, :3047-3057

[9] CDFI: Compression-Driven Network Design for Frame Interpolation [J].

Ding, Tianyu ;

Liang, Luming ;

Zhu, Zhihui ;

Zharkov, Ilya .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7997-8007

[10]

Do K., 2022, Advances in Neural Information Processing Systems, V35, P10055

← 1 2 3 4 5 6 7 →