Continual learning with invertible generative models

被引:3
作者
Pomponi, Jary [1 ]
Scardapane, Simone [1 ]
Uncini, Aurelio [1 ]
机构
[1] Sapienza Univ Rome, Dept Informat Engn Elect & Telecommun DIET, Rome, Italy
关键词
Machine learning; Continual learning; Normalizing flow; Catastrophic forgetting; NEURAL-NETWORKS;
D O I
10.1016/j.neunet.2023.05.020
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks), and rehearsal strategies, where the network is constantly re-trained on past data. Generative models have also been applied for the latter, in order to have endless sources of data. In this paper, we propose a novel method that combines the strengths of regularization and generative-based rehearsal approaches. Our generative model consists of a normal-izing flow (NF), a probabilistic and invertible neural network, trained on the internal embeddings of the network. By keeping a single NF throughout the training process, we show that our memory overhead remains constant. In addition, exploiting the invertibility of the NF, we propose a simple approach to regularize the network's embeddings with respect to past tasks. We show that our method performs favorably with respect to state-of-the-art approaches in the literature, with bounded computational power and memory overheads.(c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:606 / 616
页数:11
相关论文
共 50 条
[31]   Continual learning for predictive maintenance: Overview and challenges [J].
Hurtado, Julio ;
Salvati, Dario ;
Semola, Rudy ;
Bosio, Mattia ;
Lomonaco, Vincenzo .
INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 19
[32]   Evaluating and Explaining Generative Adversarial Networks for Continual Learning under Concept Drift [J].
Guzy, Filip ;
Wozniak, Michal ;
Krawczyk, Bartosz .
21ST IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS ICDMW 2021, 2021, :295-303
[33]   Continual Deep Learning for Time Series Modeling [J].
Ao, Sio-Iong ;
Fayek, Haytham .
SENSORS, 2023, 23 (16)
[34]   Recent Advances of Foundation Language Models-based Continual Learning: A Survey [J].
Yang, Yutao ;
Zhou, Jie ;
Ding, Xuan wen ;
Huai, Tianyu ;
Liu, Shunyu ;
Chen, Qin ;
Xie, Yuan ;
He, Liang .
ACM COMPUTING SURVEYS, 2025, 57 (05)
[35]   Learning generative models of natural images [J].
Wu, JM ;
Lin, ZH .
NEURAL NETWORKS, 2002, 15 (03) :337-347
[36]   CRNet: A Fast Continual Learning Framework With Random Theory [J].
Li, Depeng ;
Zeng, Zhigang .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (09) :10731-10744
[37]   A Comprehensive Survey of Continual Learning: Theory, Method and Application [J].
Wang, Liyuan ;
Zhang, Xingxing ;
Su, Hang ;
Zhu, Jun .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 46 (08) :5362-5383
[38]   Continual Learning Approach for Remote Sensing Scene Classification [J].
Ammour, Nassim ;
Bazi, Yakoub ;
Alhichri, Haikel ;
Alajlan, Naif .
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[39]   A Continual Learning Survey: Defying Forgetting in Classification Tasks [J].
De Lange, Matthias ;
Aljundi, Rahaf ;
Masana, Marc ;
Parisot, Sarah ;
Jia, Xu ;
Leonardis, Ales ;
Slabaugh, Greg ;
Tuytelaars, Tinne .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) :3366-3385
[40]   Continual Information Cascade Learning [J].
Zhou, Fan ;
Jing, Xin ;
Xu, Xovee ;
Zhong, Ting ;
Trajcevski, Goce ;
Wu, Jin .
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,