Dynamic Memory-Based Continual Learning with Generating and Screening

被引:0
作者
Tao, Siying [1 ]
Huang, Jinyang [1 ]
Zhang, Xiang [2 ]
Sun, Xiao [1 ,3 ]
Gu, Yu [4 ]
机构
[1] Hefei Univ Technol, Sch Comp Sci & Informat Engn, Hefei, Peoples R China
[2] Univ Sci & Technol China, Sch Cybers Sci & Technol, Hefei, Peoples R China
[3] Hefei Comprehens Natl Sci Ctr, Inst Artificial Intelligence, Hefei, Peoples R China
[4] Univ Elect Sci & Technol China, Sch Comp Sci & Engn, I Lab, Chengdu, Peoples R China
来源
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III | 2023年 / 14256卷
关键词
Continual Learning; Generative replay; Deep learning;
D O I
10.1007/978-3-031-44213-1_31
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks suffer from catastrophic forgetting when continually learning new tasks. Although simply replaying all previous data alleviates the problem, it requires large memory and even worse, often infeasible in real-world applications where access to past data is limited. Therefore, We propose a two-stage framework that dynamically reproduces data features of previous tasks to reduce catastrophic forgetting. Specifically, at each task step, we use a new memory module to learn the data distribution of the new task and reproduce pseudo-data from previous memory modules to learn together. This enables us to integrate new visual concepts with retaining learned knowledge to achieve a better stability-malleability balance. We introduce an N-step model fusion strategy to accelerate the memorization process of the memory module and a screening strategy to control the quantity and quality of generated data, reducing distribution differences. We experimented on CIFAR-100, MNIST, and SVHN datasets to demonstrate the effectiveness of our method.
引用
收藏
页码:365 / 376
页数:12
相关论文
共 50 条
  • [31] Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments
    Meloni, Enrico
    Betti, Alessandro
    Faggi, Lapo
    Marullo, Simone
    Tiezzi, Matteo
    Melacci, Stefano
    CONTINUAL SEMI-SUPERVISED LEARNING, CSSL 2021, 2022, 13418 : 62 - 74
  • [32] Memory-Based Learning and Fusion Attention for Few-Shot Food Image Generation Method
    Ma, Jinlin
    Wan, Yuetong
    Ma, Ziping
    APPLIED SCIENCES-BASEL, 2024, 14 (18):
  • [33] Continual Learning of Multi-modal Dynamics with External Memory
    Akgul, Abdullah
    Unal, Gozde
    Kandemir, Melih
    6TH ANNUAL LEARNING FOR DYNAMICS & CONTROL CONFERENCE, 2024, 242 : 40 - 51
  • [34] Distributionally Robust Memory Evolution With Generalized Divergence for Continual Learning
    Wang, Zhenyi
    Shen, Li
    Duan, Tiehang
    Suo, Qiuling
    Fang, Le
    Liu, Wei
    Gao, Mingchen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14337 - 14352
  • [35] Memory efficient data-free distillation for continual learning
    Li, Xiaorong
    Wang, Shipeng
    Sun, Jian
    Xu, Zongben
    PATTERN RECOGNITION, 2023, 144
  • [36] Continual Learning Based on Knowledge Distillation and Representation Learning
    Chen, Xiu-Yan
    Liu, Jian-Wei
    Li, Wen-Tao
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT IV, 2022, 13532 : 27 - 38
  • [37] Continual learning classification method with constant-sized memory cells based on the artificial immune system
    Li, Dong
    Liu, Shulin
    Gao, Furong
    Sun, Xin
    KNOWLEDGE-BASED SYSTEMS, 2021, 213
  • [38] Memory-Efficient Continual Learning Object Segmentation for Long Videos
    Nazemi, Amir
    Shafiee, Mohammad Javad
    Gharaee, Zahra
    Fieguth, Paul
    IEEE ACCESS, 2024, 12 : 97067 - 97084
  • [39] Continual learning of neural networks for quality prediction in production using memory aware synapses and weight transfer
    Tercan, Hasan
    Deibert, Philipp
    Meisen, Tobias
    JOURNAL OF INTELLIGENT MANUFACTURING, 2022, 33 (01) : 283 - 292
  • [40] Continual learning of neural networks for quality prediction in production using memory aware synapses and weight transfer
    Hasan Tercan
    Philipp Deibert
    Tobias Meisen
    Journal of Intelligent Manufacturing, 2022, 33 : 283 - 292