A Category-Aware Curriculum Learning for Data-Free Knowledge Distillation

被引:0
作者
Li, Xiufang [1 ]
Jiao, Licheng [1 ]
Sun, Qigong [2 ,3 ]
Liu, Fang [1 ]
Liu, Xu [1 ]
Li, Lingling [1 ]
Chen, Puhua [1 ]
Yang, Shuyuan [1 ]
机构
[1] Xidian Univ, Int Res Ctr Intelligent Percept & Computat, Key Lab Intelligent Percept & Image Understanding, Minist Educ, Xian 710071, Peoples R China
[2] SenseTime Res, Shanghai 200032, Peoples R China
[3] Shanghai AI Lab, Shanghai 200032, Peoples R China
关键词
Generators; Training; Knowledge engineering; Data models; Training data; Task analysis; Monitoring; Data generation; knowledge distillation; category-aware; curriculum learning; image classification;
D O I
10.1109/TMM.2024.3395844
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Constructing effective proxy data is one of the core challenges in data-free knowledge distillation. The existing models ignore the influence of the category entanglement of the generated data on the distillation. To alleviate this issue, imitating the human learning process, a new category-aware curriculum learning mechanism is proposed in this paper to perform data-free knowledge distillation, called CCL-D. The main ideology of this category-aware curriculum learning mechanism is to provide a new learning mode for data generation and network training, which enables the model to realize the knowledge distillation process from easy to difficult through automated curriculum learning. In this novel learning mechanism, a category-aware monitoring module is proposed to constrain the category attribute of generated data. Based on this monitoring module, the curriculum learning process for data generation and network training is designed and applied. Initially, the generator is guided to obtain new data with clear category features. The utilization of data with apparent category features is easy for student network training, and it enables the student network to learn clear and significant category features at the early training stage. Subsequently, the generator is guided to generate data with category entanglement. Utilizing these new data with category entanglement problems can improve the recognition ability of the student network to interclass interference and enhance network robustness. The effectiveness of the CCL-D is verified on the six benchmark experimental datasets (MNIST, CIFAR-10, CIFAR-100, SVHN, Caltech-101, Tiny-Imagenet).
引用
收藏
页码:9603 / 9618
页数:16
相关论文
共 66 条
  • [1] [Anonymous], 2020, Uncertainty in Artificial Intelligence
  • [2] Barber D, 2004, ADV NEUR IN, V16, P201
  • [3] Bengio Y., 2009, P 26 ANN INT C MACH, P41, DOI [DOI 10.1145/1553374.1553380, 10.1145/1553374.155338]
  • [4] Binici K, 2022, AAAI CONF ARTIF INTE, P6089
  • [5] Preventing Catastrophic Forgetting and Distribution Mismatch in Knowledge Distillation via Synthetic Data
    Binici, Kuluhan
    Nam Trung Pham
    Mitra, Tulika
    Leman, Karianto
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 3625 - 3633
  • [6] Data-Free Learning of Student Networks
    Chen, Hanting
    Wang, Yunhe
    Xu, Chang
    Yang, Zhaohui
    Liu, Chuanjian
    Shi, Boxin
    Xu, Chunjing
    Xu, Chao
    Tian, Qi
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 3513 - 3521
  • [7] Dual-Awareness Attention for Few-Shot Object Detection
    Chen, Tung-, I
    Liu, Yueh-Cheng
    Su, Hung-Ting
    Chang, Yu-Cheng
    Lin, Yu-Hsiang
    Yeh, Jia-Fong
    Chen, Wen-Chin
    Hsu, Winston H.
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 291 - 301
  • [8] Chen Xi, 2016, NEURAL INFORM PROCES, V29
  • [9] Data-Free Network Quantization With Adversarial Knowledge Distillation
    Choi, Yoojin
    Choi, Jihwan
    El-Khamy, Mostafa
    Lee, Jungwon
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW 2020), 2020, : 3047 - 3057
  • [10] CDFI: Compression-Driven Network Design for Frame Interpolation
    Ding, Tianyu
    Liang, Luming
    Zhu, Zhihui
    Zharkov, Ilya
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 7997 - 8007