Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence

被引：678

作者：

Chaudhry, Arslan ^{[1
]}

Dokania, Puneet K. ^{[1
]}

Ajanthan, Thalaiyasingam ^{[1
]}

Torr, Philip H. S. ^{[1
]}

机构：

[1] Univ Oxford, Oxford, England

来源：

COMPUTER VISION - ECCV 2018, PT XI | 2018年 / 11215卷

基金：

英国工程与自然科学研究理事会;

关键词：

D O I：

10.1007/978-3-030-01252-6_33

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Incremental learning (IL) has received a lot of attention recently, however, the literature lacks a precise problem definition, proper evaluation settings, and metrics tailored specifically for the IL problem. One of the main objectives of this work is to fill these gaps so as to provide a common ground for better understanding of IL. The main challenge for an IL algorithm is to update the classifier whilst preserving existing knowledge. We observe that, in addition to forgetting, a known issue while preserving knowledge, IL also suffers from a problem we call intransigence, its inability to update knowledge. We introduce two metrics to quantify forgetting and intransigence that allow us to understand, analyse, and gain better insights into the behaviour of IL algorithms. Furthermore, we present RWalk, a generalization of EWC++ (our efficient version of EWC [6]) and Path Integral [25] with a theoretically grounded KL-divergence based perspective. We provide a thorough analysis of various IL algorithms on MNIST and CIFAR-100 datasets. In these experiments, RWalk obtains superior results in terms of accuracy, and also provides a better trade-off for forgetting and intransigence.

引用

页码：556 / 572

页数：17

共 25 条

[1] Natural gradient works efficiently in learning [J].

Amari, S .

NEURAL COMPUTATION, 1998, 10 (02) :251-276

[2]

[Anonymous], 2015, ICML

[3]

[Anonymous], 1988, NEURAL NETW S1, DOI DOI 10.1016/0893-6080(88)90469-8

[4]

Grosse R, 2016, PR MACH LEARN RES, V48

[5]

Hinton G., 2014, NEURIPS

[6]

Kingma D. P., P 3 INT C LEARN REPR

[7]

Kirkpatrick J., 2016, P NATL ACAD SCI US P

[8]

Krizhevsky A, 2009, LEARNING MULTIPLE LA

[9] ON INFORMATION AND SUFFICIENCY [J].

KULLBACK, S ;

LEIBLER, RA .

ANNALS OF MATHEMATICAL STATISTICS, 1951, 22 (01) :79-86

[10]

Le Roux N., 2007, NIPS

← 1 2 3 →