Learning Without Forgetting

被引：649

作者：

Li, Zhizhong ^{[1
]}

Hoiem, Derek ^{[1
]}

机构：

[1] Univ Illinois, Dept Comp Sci, Champaign, IL 61801 USA

来源：

COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷

关键词：

Convolutional neural networks; Transfer learning; Multitask learning; Deep learning; Visual recognition;

D O I：

10.1007/978-3-319-46493-0_37

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When building a unified vision system or gradually adding new capabilities to a system, the usual assumption is that training data for all tasks is always available. However, as the number of tasks grows, storing and retraining on such data becomes infeasible. A new problem arises where we add new capabilities to a Convolutional Neural Network (CNN), but the training data for its existing capabilities are unavailable. We propose our Learning without Forgetting method, which uses only new task data to train the network while preserving the original capabilities. Our method performs favorably compared to commonly used feature extraction and fine-tuning adaption techniques and performs similarly to multitask learning that uses original task data we assume unavailable. A more surprising observation is that Learning without Forgetting may be able to replace fine-tuning as standard practice for improved new task performance.

引用

页码：614 / 629

页数：16

共 27 条

[1]

Adriana R., 2015, Proceedings of ICLR, V2, P1, DOI DOI 10.48550/ARXIV.1412.6550

[2]

Agrawal P, 2014, LECT NOTES COMPUT SC, V8695, P329, DOI 10.1007/978-3-319-10584-0_22

[3]

[Anonymous], 2014, Advances in Neural Information Processing Systems, DOI DOI 10.48550/ARXIV.1411.1792

[4]

[Anonymous], 2013, INT C MACHINE LEARNI

[5] Factors of Transferability for a Generic ConvNet Representation [J].

Azizpour, Hossein ;

Razavian, Ali Sharif ;

Sullivan, Josephine ;

Maki, Atsuto ;

Carlsson, Stefan .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (09) :1790-1802

[6] Multitask learning [J].

Caruana, R .

MACHINE LEARNING, 1997, 28 (01) :41-75

[7] Boosted multi-task learning [J].

Chapelle, Olivier ;

Shivaswamy, Pannagadatta ;

Vadrevu, Srinivas ;

Weinberger, Kilian ;

Zhang, Ya ;

Tseng, Belle .

MACHINE LEARNING, 2011, 85 (1-2) :149-173

[8]

Chen T., 2016, P INT C LEA IN PRESS

[9]

Donahue J, 2014, PR MACH LEARN RES, V32

[10] The PASCAL Visual Object Classes Challenge: A Retrospective [J].

Everingham, Mark ;

Eslami, S. M. Ali ;

Van Gool, Luc ;

Williams, Christopher K. I. ;

Winn, John ;

Zisserman, Andrew .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136

← 1 2 3 →