Overcoming Catastrophic Forgetting for Fine-Tuning Pre-trained GANs

被引：0

作者：

Zhang, Zeren ^{[1
]}

Li, Xingjian ^{[2
]}

Hong, Tao ^{[1
]}

Wang, Tianyang ^{[3
]}

Ma, Jinwen ^{[1
]}

Xiong, Haoyi ^{[2
]}

Xu, Cheng-Zhong ^{[4
]}

机构：

[1] Peking Univ, Sch Math Sci, Beijing 100871, Peoples R China

[2] Baidu Inc, Beijing, Peoples R China

[3] Univ Alabama Birmingham, Birmingham, AL 35294 USA

[4] Univ Macau, Macau, Peoples R China

来源：

MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, ECML PKDD 2023, PT V | 2023年 / 14173卷

关键词：

Transfer Learning; Generative Adversarial Networks;

D O I：

10.1007/978-3-031-43424-2_18

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The great transferability of DNNs has induced a popular paradigm of "pre-training & fine-tuning", by which a data-scarce task can be performed much more easily. However, compared to the existing efforts made in the context of supervised transfer learning, fewer explorations have been made on effectively fine-tuning pre-trained Generative Adversarial Networks (GANs). As reported in recent empirical studies, fine-tuning GANs faces the similar challenge of catastrophic forgetting as in supervised transfer learning. This causes a severe capacity loss of the pre-trained model when adapting it to downstream datasets. While most existing approaches suggest to directly interfere parameter updating, this paper introduces novel schemes from another perspective, i.e. inputs and features, thus essentially focuses on data aspect. Firstly, we adopt a trust-region method to smooth the adaptation dynamics by progressively adjusting input distributions, aiming to avoid dramatic parameter changes, especially when the pre-trained GAN has no information of target data. Secondly, we aim to avoid the loss of the diversity of the generated results of the fine-tuned GAN. This is achieved by explicitly encouraging generated images to encompass diversified spectral components in their deep features. We theoretically study the rationale of the proposed schemes, and conduct extensive experiments on popular transfer learning benchmarks to demonstrate the superiority of the schemes. The code and corresponding supplemental materials are available at https://github.com/zezeze97/Transfer-Pretrained-Gan.

引用

页码：293 / 308

页数：16

共 50 条

[31] Make Pre-trained Model Reversible: From Parameter to Memory Efficient Fine-Tuning
Liao, Baohao
Tan, Shaomu
Monz, Christof
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[32] Towards Efficient Fine-Tuning of Pre-trained Code Models: An Experimental Study and Beyond
Shi, Ensheng
Wang, Yanlin
Zhang, Hongyu
Du, Lun
Han, Shi
Zhang, Dongmei
Sun, Hongbin
PROCEEDINGS OF THE 32ND ACM SIGSOFT INTERNATIONAL SYMPOSIUM ON SOFTWARE TESTING AND ANALYSIS, ISSTA 2023, 2023, : 39 - 51
[33] HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation
Yuan, Hongyi
Yuan, Zheng
Tan, Chuanqi
Huang, Fei
Huang, Songfang
PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 3246 - 3264
[34] Confounder balancing in adversarial domain adaptation for pre-trained large models fine-tuning
Jiang, Shuoran
Chen, Qingcai
Xiang, Yang
Pan, Youcheng
Wu, Xiangping
Lin, Yukang
NEURAL NETWORKS, 2024, 173
[35] Fine-tuning of pre-trained convolutional neural networks for diabetic retinopathy screening: a clinical study
Roshan, Saboora M.
Karsaz, Ali
Vejdani, Amir Hossein
Roshan, Yaser M.
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2020, 21 (04) : 564 - 573
[36] Improving Gender Fairness of Pre-Trained Language Models without Catastrophic Forgetting
Fatemi, Zahra
Xing, Chen
Liu, Wenhao
Xiong, Caiming
61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2, 2023, : 1249 - 1262
[37] Enhancing Alzheimer's Disease Classification with Transfer Learning: Fine-tuning a Pre-trained Algorithm
Boudi, Abdelmounim
He, Jingfei
Abd El Kader, Isselmou
CURRENT MEDICAL IMAGING, 2024,
[38] Parameter-efficient fine-tuning of large-scale pre-trained language models
Ning Ding
Yujia Qin
Guang Yang
Fuchao Wei
Zonghan Yang
Yusheng Su
Shengding Hu
Yulin Chen
Chi-Min Chan
Weize Chen
Jing Yi
Weilin Zhao
Xiaozhi Wang
Zhiyuan Liu
Hai-Tao Zheng
Jianfei Chen
Yang Liu
Jie Tang
Juanzi Li
Maosong Sun
Nature Machine Intelligence, 2023, 5 : 220 - 235
[39] Food Detection by Fine-Tuning Pre-trained Convolutional Neural Network Using Noisy Labels
Alshomrani, Shroog
Aljoudi, Lina
Aljabri, Banan
Al-Shareef, Sarah
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (07): : 182 - 190
[40] Fine-tuning pre-trained neural networks for medical image classification in small clinical datasets
Newton Spolaôr
Huei Diana Lee
Ana Isabel Mendes
Conceição Veloso Nogueira
Antonio Rafael Sabino Parmezan
Weber Shoity Resende Takaki
Claudio Saddy Rodrigues Coy
Feng Chung Wu
Rui Fonseca-Pinto
Multimedia Tools and Applications, 2024, 83 (9) : 27305 - 27329

← 1 2 3 4 5 →