Retrospective Adversarial Replay for Continual Learning

被引:0
作者
Kumari, Lilly [1 ]
Wang, Shengjie [2 ]
Zhou, Tianyi [3 ]
Bilmes, Jeff [1 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] ByteDance, Beijing, Peoples R China
[3] Univ Maryland, College Pk, MD 20742 USA
来源
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022) | 2022年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continual learning is an emerging research challenge in machine learning that addresses the problem where models quickly fit the most recently trained-on data but suffer from catastrophic forgetting of previous data due to distribution shifts it does this by maintaining a small historical replay buffer in replay-based methods. To avoid these problems, this paper proposes a method, "Retrospective Adversarial Replay (RAR)", that synthesizes adversarial samples near the forgetting boundary. RAR perturbs a buffered sample towards its nearest neighbor drawn from the current task in a latent representation space. By replaying such samples, we are able to refine the boundary between previous and current tasks, hence combating forgetting and reducing bias towards the current task. To mitigate the severity of a small replay buffer, we develop a novel MixUp-based strategy to increase replay variation by replaying mixed augmentations. Combined with RAR, this achieves a holistic framework that helps to alleviate catastrophic forgetting. We show that this excels on broadly-used benchmarks and outperforms other continual learning baselines especially when only a small buffer is available. We conduct a thorough ablation study over each key component as well as a hyperparameter sensitivity analysis to demonstrate the effectiveness and robustness of RAR.
引用
收藏
页数:15
相关论文
共 70 条
[1]  
Aljundi R., 2019, ARXIV
[2]   Memory Aware Synapses: Learning What (not) to Forget [J].
Aljundi, Rahaf ;
Babiloni, Francesca ;
Elhoseiny, Mohamed ;
Rohrbach, Marcus ;
Tuytelaars, Tinne .
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :144-161
[3]  
Aljundi Rahaf, 2019, ARXIV
[4]   PERSEVERANCE OF SOCIAL THEORIES - THE ROLE OF EXPLANATION IN THE PERSISTENCE OF DISCREDITED INFORMATION [J].
ANDERSON, CA ;
LEPPER, MR ;
ROSS, L .
JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, 1980, 39 (06) :1037-1049
[5]  
Borsos Z., 2020, Advances in neural information processing systems, V33, P14879, DOI [10.48550/arXiv.2006.03875, DOI 10.48550/ARXIV.2006.03875]
[6]  
Buzzega P., 2020, Advances in neural information processing systems, V33, P15920
[7]   Mitigating Evasion Attacks to Deep Neural Networks via Region-based Classification [J].
Cao, Xiaoyu ;
Gong, Neil Zhenqiang .
33RD ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE (ACSAC 2017), 2017, :278-287
[8]  
Chaudhry A., 2019, ARXIV
[9]   Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence [J].
Chaudhry, Arslan ;
Dokania, Puneet K. ;
Ajanthan, Thalaiyasingam ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 :556-572
[10]  
Chaudhry Arslan, 2018, ARXIV