KD-DLGAN: Data Limited Image Generation via Knowledge Distillation

被引：14

作者：

Cui, Kaiwen ^{[1
]}

Yu, Yingchen ^{[1
]}

Zhan, Fangneng ^{[2
]}

Liao, Shengcai ^{[3
]}

Lu, Shijian ^{[1
]}

Xing, Eric ^{[4
]}

机构：

[1] Nanyang Technol Univ, Singapore, Singapore

[2] Max Planck Inst Informat, Saarbrucken, Germany

[3] Incept Inst Artificial Intelligence, Abu Dhabi, U Arab Emirates

[4] Mohamed bin Zayed Univ Artificial Intelligence, Abu Dhabi, U Arab Emirates

来源：

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR | 2023年

关键词：

D O I：

10.1109/CVPR52729.2023.00377

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Generative Adversarial Networks (GANs) rely heavily on large-scale training data for training high-quality image generation models. With limited training data, the GAN discriminator often suffers from severe overfitting which directly leads to degraded generation especially in generation diversity. Inspired by the recent advances in knowledge distillation (KD), we propose KD-DLGAN, a knowledge-distillation based generation framework that introduces pre-trained vision-language models for training effective data-limited generation models. KD-DLGAN consists of two innovative designs. The first is aggregated generative KD that mitigates the discriminator overfitting by challenging the discriminator with harder learning tasks and distilling more generalizable knowledge from the pre-trained models. The second is correlated generative KD that improves the generation diversity by distilling and preserving the diverse image-text correlation within the pre-trained models. Extensive experiments over multiple benchmarks show that KD-DLGAN achieves superior image generation with limited training data. In addition, KD-DLGAN complements the state-of-the-art with consistent and substantial performance gains. Note that codes will be released.

引用

页码：3872 / 3882

页数：11

共 54 条

[1] Robust Cross-Modal Representation Learning with Progressive Self-Distillation [J].

Andonian, Alex ;

Chen, Shixing ;

Hamid, Raffay .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :16409-16420

[2]

[Anonymous], 2022, P IEEE CVF C COMP VI

[3]

[Anonymous], 2022, P IEEE CVF C COMP VI, DOI DOI 10.1109/ECTC51906.2022.00250

[4]

[Anonymous], 2022, P IEEE CVF C COMP VI, DOI DOI 10.1109/ICDCS54860.2022.00095

[5]

Arjovsky M, 2017, PR MACH LEARN RES, V70

[6]

Ba LJ, 2014, ADV NEUR IN, V27

[7]

Brock Andrew, 2018, P 7 ACM INT S PERVAS, DOI DOI 10.1145/3205873.3205877

[8]

Bucila C., 2006, Proceedings of the 12th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P535, DOI DOI 10.1145/1150402.1150464

[9] Data-free Knowledge Distillation for Object Detection [J].

Chawla, Akshay ;

Yin, Hongxu ;

Molchanov, Pavlo ;

Alvarez, Jose .

2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, :3288-3297

[10]

Chen GB, 2017, ADV NEUR IN, V30

← 1 2 3 4 5 6 →