Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence

被引:678
作者
Chaudhry, Arslan [1 ]
Dokania, Puneet K. [1 ]
Ajanthan, Thalaiyasingam [1 ]
Torr, Philip H. S. [1 ]
机构
[1] Univ Oxford, Oxford, England
来源
COMPUTER VISION - ECCV 2018, PT XI | 2018年 / 11215卷
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1007/978-3-030-01252-6_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Incremental learning (IL) has received a lot of attention recently, however, the literature lacks a precise problem definition, proper evaluation settings, and metrics tailored specifically for the IL problem. One of the main objectives of this work is to fill these gaps so as to provide a common ground for better understanding of IL. The main challenge for an IL algorithm is to update the classifier whilst preserving existing knowledge. We observe that, in addition to forgetting, a known issue while preserving knowledge, IL also suffers from a problem we call intransigence, its inability to update knowledge. We introduce two metrics to quantify forgetting and intransigence that allow us to understand, analyse, and gain better insights into the behaviour of IL algorithms. Furthermore, we present RWalk, a generalization of EWC++ (our efficient version of EWC [6]) and Path Integral [25] with a theoretically grounded KL-divergence based perspective. We provide a thorough analysis of various IL algorithms on MNIST and CIFAR-100 datasets. In these experiments, RWalk obtains superior results in terms of accuracy, and also provides a better trade-off for forgetting and intransigence.
引用
收藏
页码:556 / 572
页数:17
相关论文
共 25 条
[1]   Natural gradient works efficiently in learning [J].
Amari, S .
NEURAL COMPUTATION, 1998, 10 (02) :251-276
[2]  
[Anonymous], 2015, ICML
[3]  
[Anonymous], 1988, NEURAL NETW S1, DOI DOI 10.1016/0893-6080(88)90469-8
[4]  
Grosse R, 2016, PR MACH LEARN RES, V48
[5]  
Hinton G., 2014, NEURIPS
[6]  
Kingma D. P., P 3 INT C LEARN REPR
[7]  
Kirkpatrick J., 2016, P NATL ACAD SCI US P
[8]  
Krizhevsky A, 2009, LEARNING MULTIPLE LA
[9]   ON INFORMATION AND SUFFICIENCY [J].
KULLBACK, S ;
LEIBLER, RA .
ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (01) :79-86
[10]  
Le Roux N., 2007, NIPS