Lifelong learning with Shared and Private Latent Representations learned through synaptic intelligence

被引:3
作者
Yang, Yang [1 ]
Huang, Jie [1 ]
Hu, Dexiu [1 ]
机构
[1] PLA Strateg Support Force Informat Engn Univ, Zhengzhou 450001, Henan, Peoples R China
基金
中国国家自然科学基金;
关键词
Shared and Private Latent Representations; Synaptic Intelligence; Lifelong learning; Entire learning trajectory; Task-invariant; Task-specific;
D O I
10.1016/j.neunet.2023.04.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper explores a novel lifelong learning method with Shared and Private Latent Representations (SPLR), which are learned through synaptic intelligence. To solve a sequence of tasks, by considering the entire parameter learning trajectory, SPLR can learn task-invariant representation which changes little, and task-specific features that change greatly along the entire parameter updating trajectory. Therefore, in the lifelong learning scenarios, our model can obtain a task-invariant structure shared by all tasks and also contain some private properties that are task-specific to each task. To reduce the parameter quantity, a l1 regularization to promote sparsity is employed in the weights. We use multiple datasets under lifelong learning scenes to verify our SPLR, on these datasets it can get comparable performance compared with existing lifelong learning approaches, and learn a sparse network which means fewer parameters while requiring less model training time. (c) 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页码:165 / 177
页数:13
相关论文
共 42 条
[1]  
Adel T., 2020, INT C LEARNING REPRE
[2]   Entropy-based Stability-Plasticity for Lifelong Learning [J].
Araujo, Vladimir ;
Hurtado, Julio ;
Soto, Alvaro ;
Moens, Marie-Francine .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2022, 2022, :3720-3727
[3]  
Arslan C., 2019, INT C MACH LEARN
[4]  
Bachem O, 2015, PR MACH LEARN RES, V37, P209
[5]   Computational principles of synaptic memory consolidation [J].
Benna, Marcus K. ;
Fusi, Stefano .
NATURE NEUROSCIENCE, 2016, 19 (12) :1697-1706
[6]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[7]  
Blundell C, 2015, PR MACH LEARN RES, V37, P1613
[8]  
Broderick T., 2013, ADV NEURAL INFORM PR, P1727
[9]  
Chaudhry A, 2019, ICLR
[10]  
Chaudhuri K, 2009, P 26 ANN INT C MACH, P129