Neuro-Symbolic Continual Learning: Knowledge, Reasoning Shortcuts and Concept Rehearsal

被引:0
作者
Marconato, Emanuele [1 ,2 ]
Bontempo, Gianpaolo [1 ,3 ]
Ficarra, Elisa [3 ]
Calderara, Simone [3 ]
Passerini, Andrea [2 ]
Teso, Stefano [2 ,4 ]
机构
[1] Univ Pisa, Pisa, Italy
[2] Univ Trento, DISI, Trento, Italy
[3] Univ Modena & Reggio Emilia, Modena, Italy
[4] Univ Trento, CIMeC, Trento, Italy
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷
基金
欧盟地平线“2020”;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We introduce Neuro-Symbolic Continual Learning, where a model has to solve a sequence of neuro-symbolic tasks, that is, it has to map subsymbolic inputs to high-level concepts and compute predictions by reasoning consistently with prior knowledge. Our key observation is that neuro-symbolic tasks, although different, often share concepts whose semantics remains stable over time. Traditional approaches fall short: existing continual strategies ignore knowledge altogether, while stock neuro-symbolic architectures suffer from catastrophic forgetting. We show that leveraging prior knowledge by combining neurosymbolic architectures with continual strategies does help avoid catastrophic forgetting, but also that doing so can yield models affected by reasoning shortcuts. These undermine the semantics of the acquired concepts, even when detailed prior knowledge is provided upfront and inference is exact, and in turn continual performance. To overcome these issues, we introduce COOL, a COncept-level cOntinual Learning strategy tailored for neuro-symbolic continual problems that acquires high-quality concepts and remembers them over time. Our experiments on three novel benchmarks highlights how COOL attains sustained high performance on neuro-symbolic continual learning tasks in which other strategies fail.
引用
收藏
页数:22
相关论文
共 66 条
[1]  
Ahmed K., 2022, NEURIPS
[2]  
Ahmed K., 2022, UAI
[3]  
Aljundi R, 2019, ADV NEUR IN, V32
[4]   Memory Aware Synapses: Learning What (not) to Forget [J].
Aljundi, Rahaf ;
Babiloni, Francesca ;
Elhoseiny, Mohamed ;
Rohrbach, Marcus ;
Tuytelaars, Tinne .
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :144-161
[5]  
Boschini M., 2022, IEEE Transactions on Pattern Analysis and Machine Intelligence
[6]  
Buzzega P., 2020, Advances in Neural Information Processing Systems, V33, P15920
[7]  
Caccia Lucas, 2022, ICLR
[8]   End-to-End Incremental Learning [J].
Castro, Francisco M. ;
Marin-Jimenez, Manuel J. ;
Guil, Nicolas ;
Schmid, Cordelia ;
Alahari, Karteek .
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :241-257
[9]  
Choi Y., 2020, Probabilistic circuits: A unifying framework for tractable probabilistic models
[10]  
Community B. O., 2018, Blender-a 3D modelling and rendering package