Overcoming Catastrophic Forgetting for Fine-Tuning Pre-trained GANs

被引：0

作者：

Zhang, Zeren ^{[1
]}

Li, Xingjian ^{[2
]}

Hong, Tao ^{[1
]}

Wang, Tianyang ^{[3
]}

Ma, Jinwen ^{[1
]}

Xiong, Haoyi ^{[2
]}

Xu, Cheng-Zhong ^{[4
]}

机构：

[1] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China

[2] Baidu Inc, Beijing, Peoples R China

[3] Univ Alabama Birmingham, Birmingham, AL 35294 USA

[4] Univ Macau, Macau, Peoples R China

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT V | 2023年 / 14173卷

关键词：

Transfer Learning; Generative Adversarial Networks;

D O I：

10.1007/978-3-031-43424-2_18

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The great transferability of DNNs has induced a popular paradigm of "pre-training & fine-tuning", by which a data-scarce task can be performed much more easily. However, compared to the existing efforts made in the context of supervised transfer learning, fewer explorations have been made on effectively fine-tuning pre-trained Generative Adversarial Networks (GANs). As reported in recent empirical studies, fine-tuning GANs faces the similar challenge of catastrophic forgetting as in supervised transfer learning. This causes a severe capacity loss of the pre-trained model when adapting it to downstream datasets. While most existing approaches suggest to directly interfere parameter updating, this paper introduces novel schemes from another perspective, i.e. inputs and features, thus essentially focuses on data aspect. Firstly, we adopt a trust-region method to smooth the adaptation dynamics by progressively adjusting input distributions, aiming to avoid dramatic parameter changes, especially when the pre-trained GAN has no information of target data. Secondly, we aim to avoid the loss of the diversity of the generated results of the fine-tuned GAN. This is achieved by explicitly encouraging generated images to encompass diversified spectral components in their deep features. We theoretically study the rationale of the proposed schemes, and conduct extensive experiments on popular transfer learning benchmarks to demonstrate the superiority of the schemes. The code and corresponding supplemental materials are available at https://github.com/zezeze97/Transfer-Pretrained-Gan.

引用

页码：293 / 308

页数：16

共 50 条

[41] FINE-TUNING OF PRE-TRAINED END-TO-END SPEECH RECOGNITION WITH GENERATIVE ADVERSARIAL NETWORKS
Haidar, Md Akmal
Rezagholizadeh, Mehdi
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 6204 - 6208
[42] Fine-tuning pre-trained neural networks for medical image classification in small clinical datasets
Spolaor, Newton
Lee, Huei Diana
Mendes, Ana Isabel
Nogueira, Conceicao Veloso
Sabino Parmezan, Antonio Rafael
Resende Takaki, Weber Shoity
Rodrigues Coy, Claudio Saddy
Wu, Feng Chung
Fonseca-Pinto, Rui
MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (09) : 27305 - 27329
[43] An Empirical Study of Parameter-Efficient Fine-Tuning Methods for Pre-trained Code Models
Liu, Jiaxing
Sha, Chaofeng
Peng, Xin
2023 38TH IEEE/ACM INTERNATIONAL CONFERENCE ON AUTOMATED SOFTWARE ENGINEERING, ASE, 2023, : 397 - 408
[44] APPT: Boosting Automated Patch Correctness Prediction via Fine-Tuning Pre-Trained Models
Zhang, Quanjun
Fang, Chunrong
Sun, Weisong
Liu, Yan
He, Tieke
Hao, Xiaodong
Chen, Zhenyu
IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2024, 50 (03) : 474 - 494
[45] TraceGuard: Fine-Tuning Pre-Trained Model by Using Stego Images to Trace Its User
Zhou, Limengnan
Ren, Xingdong
Qian, Cheng
Sun, Guangling
MATHEMATICS, 2024, 12 (21)
[46] Parameter-efficient fine-tuning of large-scale pre-trained language models
Ding, Ning
Qin, Yujia
Yang, Guang
Wei, Fuchao
Yang, Zonghan
Su, Yusheng
Hu, Shengding
Chen, Yulin
Chan, Chi-Min
Chen, Weize
Yi, Jing
Zhao, Weilin
Wang, Xiaozhi
Liu, Zhiyuan
Zheng, Hai-Tao
Chen, Jianfei
Liu, Yang
Tang, Jie
Li, Juanzi
Sun, Maosong
NATURE MACHINE INTELLIGENCE, 2023, 5 (03) : 220 - +
[47] Comparative Study of Fine-Tuning of Pre-Trained Convolutional Neural Networks for Diabetic Retinopathy Screening
Mohammadian, Saboora
Karsaz, Ali
Roshan, Yaser M.
2017 24TH NATIONAL AND 2ND INTERNATIONAL IRANIAN CONFERENCE ON BIOMEDICAL ENGINEERING (ICBME), 2017, : 224 - 229
[48] Attempts on detecting Alzheimer's disease by fine-tuning pre-trained model with Gaze Data
Nagasawa, Junichi
Nakata, Yuichi
Hiroe, Mamoru
Zheng, Yujia
Kawaguchi, Yutaka
Maegawa, Yuji
Hojo, Naoki
Takiguchi, Tetsuya
Nakayama, Minoru
Uchimura, Maki
Sonoda, Yuma
Kowa, Hisatomo
Nagamatsu, Takashi
PROCEEDINGS OF THE 2024 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS, ETRA 2024, 2024,
[49] On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting
Korbak, Tomasz
Elsahar, Hady
Kruszewski, German
Dymetman, Marc
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[50] Fine-tuning pre-trained voice conversion model for adding new target speakers with limited data
Koshizuka, Takeshi
Ohmura, Hidefumi
Katsurada, Kouichi
INTERSPEECH 2021, 2021, : 1339 - 1343

← 1 2 3 4 5 →