Learning Without Forgetting

被引:525
作者
Li, Zhizhong [1 ]
Hoiem, Derek [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Champaign, IL 61801 USA
来源
COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷
关键词
Convolutional neural networks; Transfer learning; Multitask learning; Deep learning; Visual recognition;
D O I
10.1007/978-3-319-46493-0_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When building a unified vision system or gradually adding new capabilities to a system, the usual assumption is that training data for all tasks is always available. However, as the number of tasks grows, storing and retraining on such data becomes infeasible. A new problem arises where we add new capabilities to a Convolutional Neural Network (CNN), but the training data for its existing capabilities are unavailable. We propose our Learning without Forgetting method, which uses only new task data to train the network while preserving the original capabilities. Our method performs favorably compared to commonly used feature extraction and fine-tuning adaption techniques and performs similarly to multitask learning that uses original task data we assume unavailable. A more surprising observation is that Learning without Forgetting may be able to replace fine-tuning as standard practice for improved new task performance.
引用
收藏
页码:614 / 629
页数:16
相关论文
共 27 条
  • [1] Adriana R., 2015, Proceedings of ICLR, V2, P1, DOI DOI 10.48550/ARXIV.1412.6550
  • [2] Agrawal P, 2014, LECT NOTES COMPUT SC, V8695, P329, DOI 10.1007/978-3-319-10584-0_22
  • [3] [Anonymous], 2014, Advances in Neural Information Processing Systems, DOI DOI 10.48550/ARXIV.1411.1792
  • [4] [Anonymous], 2013, INT C MACHINE LEARNI
  • [5] Factors of Transferability for a Generic ConvNet Representation
    Azizpour, Hossein
    Razavian, Ali Sharif
    Sullivan, Josephine
    Maki, Atsuto
    Carlsson, Stefan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (09) : 1790 - 1802
  • [6] Multitask learning
    Caruana, R
    [J]. MACHINE LEARNING, 1997, 28 (01) : 41 - 75
  • [7] Boosted multi-task learning
    Chapelle, Olivier
    Shivaswamy, Pannagadatta
    Vadrevu, Srinivas
    Weinberger, Kilian
    Zhang, Ya
    Tseng, Belle
    [J]. MACHINE LEARNING, 2011, 85 (1-2) : 149 - 173
  • [8] Chen T., 2016, P INT C LEA IN PRESS
  • [9] Donahue J, 2014, PR MACH LEARN RES, V32
  • [10] The PASCAL Visual Object Classes Challenge: A Retrospective
    Everingham, Mark
    Eslami, S. M. Ali
    Van Gool, Luc
    Williams, Christopher K. I.
    Winn, John
    Zisserman, Andrew
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) : 98 - 136