Continual learning with invertible generative models

被引:3
作者
Pomponi, Jary [1 ]
Scardapane, Simone [1 ]
Uncini, Aurelio [1 ]
机构
[1] Sapienza Univ Rome, Dept Informat Engn Elect & Telecommun DIET, Rome, Italy
关键词
Machine learning; Continual learning; Normalizing flow; Catastrophic forgetting; NEURAL-NETWORKS;
D O I
10.1016/j.neunet.2023.05.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks), and rehearsal strategies, where the network is constantly re-trained on past data. Generative models have also been applied for the latter, in order to have endless sources of data. In this paper, we propose a novel method that combines the strengths of regularization and generative-based rehearsal approaches. Our generative model consists of a normal-izing flow (NF), a probabilistic and invertible neural network, trained on the internal embeddings of the network. By keeping a single NF throughout the training process, we show that our memory overhead remains constant. In addition, exploiting the invertibility of the NF, we propose a simple approach to regularize the network's embeddings with respect to past tasks. We show that our method performs favorably with respect to state-of-the-art approaches in the literature, with bounded computational power and memory overheads.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:606 / 616
页数:11
相关论文
共 50 条
[41]   Modular Dynamic Neural Network: A Continual Learning Architecture [J].
Turner, Daniel ;
Cardoso, Pedro J. S. ;
Rodrigues, Joao M. F. .
APPLIED SCIENCES-BASEL, 2021, 11 (24)
[42]   Adiabatic replay for continual learning [J].
Krawczyk, Alexander ;
Gepperth, Alexander .
2024 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN 2024, 2024,
[43]   Concept drift detection and adaptation for federated and continual learning [J].
Casado, Fernando E. ;
Lema, Dylan ;
Criado, Marcos F. ;
Iglesias, Roberto ;
Regueiro, Carlos, V ;
Barro, Senen .
MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (03) :3397-3419
[44]   UIFGAN: An unsupervised continual-learning generative adversarial network for unified image fusion [J].
Le, Zhuliang ;
Huang, Jun ;
Xu, Han ;
Fan, Fan ;
Ma, Yong ;
Mei, Xiaoguang ;
Ma, Jiayi .
INFORMATION FUSION, 2022, 88 :305-318
[45]   Continual learning with selective nets [J].
Luu, Hai Tung ;
Szemenyei, Marton .
APPLIED INTELLIGENCE, 2025, 55 (07)
[46]   TactCLNet: Tactile Continual Learning Network Based on Generative Replay for Object Hardness Recognition [J].
Liu, Yiwen ;
Yi, Zhengkun ;
Fang, Senlin ;
Zhang, Yupo ;
Wan, Feng ;
Yang, Zhi-Xin ;
Lu, Xu ;
Wu, Xinyu .
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 :10045-10055
[47]   Adversary Aware Continual Learning [J].
Umer, Muhammad ;
Polikar, Robi .
IEEE ACCESS, 2024, 12 :126108-126121
[48]   Advances and Trends of Continual Learning [J].
Li, Wenbin ;
Xiong, Yakun ;
Fan, Zhichen ;
Deng, Bo ;
Cao, Fuyuan ;
Gao, Yang .
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2024, 61 (06) :1476-1496
[49]   Unified Probabilistic Deep Continual Learning through Generative Replay and Open Set Recognition [J].
Mundt, Martin ;
Pliushch, Iuliia ;
Majumder, Sagnik ;
Hong, Yongwon ;
Ramesh, Visvanathan .
JOURNAL OF IMAGING, 2022, 8 (04)
[50]   Drifting explanations in continual learning [J].
Cossu, Andrea ;
Spinnato, Francesco ;
Guidotti, Riccardo ;
Bacciu, Davide .
NEUROCOMPUTING, 2024, 597