Relation-Guided Adversarial Learning for Data-Free Knowledge Transfer

被引：0

作者：

Liang, Yingping ^{[1
]}

Fu, Ying ^{[1
]}

机构：

[1] Beijing Inst Technol, Sch Comp Sci & Technol, Beijing, Peoples R China

来源：

INTERNATIONAL JOURNAL OF COMPUTER VISION | 2024年

基金：

中国国家自然科学基金;

关键词：

Knowledge distillation; Data-free distillation; Transfer learning; Model quantization; Incremental learning; Generative model;

D O I：

10.1007/s11263-024-02303-4

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Data-free knowledge distillation transfers knowledge by recovering training data from a pre-trained model. Despite the recent success of seeking global data diversity, the diversity within each class and the similarity among different classes are largely overlooked, resulting in data homogeneity and limited performance. In this paper, we introduce a novel Relation-Guided Adversarial Learning method with triplet losses, which solves the homogeneity problem from two aspects. To be specific, our method aims to promote both intra-class diversity and inter-class confusion of the generated samples. To this end, we design two phases, an image synthesis phase and a student training phase. In the image synthesis phase, we construct an optimization process to push away samples with the same labels and pull close samples with different labels, leading to intra-class diversity and inter-class confusion, respectively. Then, in the student training phase, we perform an opposite optimization, which adversarially attempts to reduce the distance of samples of the same classes and enlarge the distance of samples of different classes. To mitigate the conflict of seeking high global diversity and keeping inter-class confusing, we propose a focal weighted sampling strategy by selecting the negative in the triplets unevenly within a finite range of distance. RGAL shows significant improvement over previous state-of-the-art methods in accuracy and data efficiency. Besides, RGAL can be inserted into state-of-the-art methods on various data-free knowledge transfer applications. Experiments on various benchmarks demonstrate the effectiveness and generalizability of our proposed method on various tasks, specially data-free knowledge distillation, data-free quantization, and non-exemplar incremental learning. Our code will be publicly available to the community.

引用

页码：2868 / 2885

页数：18

共 50 条

[1] R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning
Gao, Qiankun
Zhao, Chen
Ghanem, Bernard
Zhang, Jian
COMPUTER VISION, ECCV 2022, PT XXIII, 2022, 13683 : 423 - 439
[2] ROBUSTNESS AND DIVERSITY SEEKING DATA-FREE KNOWLEDGE DISTILLATION
Han, Pengchao
Park, Jihong
Wang, Shiqiang
Liu, Yejun
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 2740 - 2744
[3] Parameterized data-free knowledge distillation for heterogeneous federated learning
Guo, Cheng
He, Qianqian
Tang, Xinyu
Liu, Yining
Jie, Yingmo
KNOWLEDGE-BASED SYSTEMS, 2025, 317
[4] A Category-Aware Curriculum Learning for Data-Free Knowledge Distillation
Li, Xiufang
Jiao, Licheng
Sun, Qigong
Liu, Fang
Liu, Xu
Li, Lingling
Chen, Puhua
Yang, Shuyuan
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9603 - 9618
[5] Dual discriminator adversarial distillation for data-free model compression
Zhao, Haoran
Sun, Xin
Dong, Junyu
Manic, Milos
Zhou, Huiyu
Yu, Hui
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2022, 13 (05) : 1213 - 1230
[6] Dual discriminator adversarial distillation for data-free model compression
Haoran Zhao
Xin Sun
Junyu Dong
Milos Manic
Huiyu Zhou
Hui Yu
International Journal of Machine Learning and Cybernetics, 2022, 13 : 1213 - 1230
[7] Dynamic data-free knowledge distillation by easy-to-hard learning strategy
Li, Jingru
Zhou, Sheng
Li, Liangcheng
Wang, Haishuai
Bu, Jiajun
Yu, Zhi
INFORMATION SCIENCES, 2023, 642
[8] Data-free knowledge distillation in neural networks for regression
Kang, Myeonginn
Kang, Seokho
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 175
[9] ENHANCING DATA-FREE ADVERSARIAL DISTILLATION WITH ACTIVATION REGULARIZATION AND VIRTUAL INTERPOLATION
Qu, Xiaoyang
Wang, Jianzong
Xiao, Jing
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3340 - 3344
[10] Data-Free Solution of Electromagnetic PDEs Using Neural Networks and Extension to Transfer Learning
Bhardwaj, Shubhendu
Gaire, Pawan
IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 2022, 70 (07) : 5179 - 5188

← 1 2 3 4 5 →