Learning Without Forgetting

被引:649
作者
Li, Zhizhong [1 ]
Hoiem, Derek [1 ]
机构
[1] Univ Illinois, Dept Comp Sci, Champaign, IL 61801 USA
来源
COMPUTER VISION - ECCV 2016, PT IV | 2016年 / 9908卷
关键词
Convolutional neural networks; Transfer learning; Multitask learning; Deep learning; Visual recognition;
D O I
10.1007/978-3-319-46493-0_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When building a unified vision system or gradually adding new capabilities to a system, the usual assumption is that training data for all tasks is always available. However, as the number of tasks grows, storing and retraining on such data becomes infeasible. A new problem arises where we add new capabilities to a Convolutional Neural Network (CNN), but the training data for its existing capabilities are unavailable. We propose our Learning without Forgetting method, which uses only new task data to train the network while preserving the original capabilities. Our method performs favorably compared to commonly used feature extraction and fine-tuning adaption techniques and performs similarly to multitask learning that uses original task data we assume unavailable. A more surprising observation is that Learning without Forgetting may be able to replace fine-tuning as standard practice for improved new task performance.
引用
收藏
页码:614 / 629
页数:16
相关论文
共 27 条
[21]   Knowledge Transfer in Deep Block-Modular Neural Networks [J].
Terekhov, Alexander V. ;
Montone, Guglielmo ;
O'Regan, J. Kevin .
BIOMIMETIC AND BIOHYBRID SYSTEMS, LIVING MACHINES 2015, 2015, 9222 :268-279
[22]  
Thrun S, 1998, LEARNING TO LEARN, P181
[23]   Simultaneous Deep Transfer Across Domains and Tasks [J].
Tzeng, Eric ;
Hoffman, Judy ;
Darrell, Trevor ;
Saenko, Kate .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :4068-4076
[24]   MatConvNet Convolutional Neural Networks for MATLAB [J].
Vedaldi, Andrea ;
Lenc, Karel .
MM'15: PROCEEDINGS OF THE 2015 ACM MULTIMEDIA CONFERENCE, 2015, :689-692
[25]  
Vinyals O., 2014, Advances in Neural Information Processing Systems
[26]  
Wah C., 2011, CALTECH UCSD BIRDS 2
[27]  
Zhou B., 2015, PLACES2 LAR IN PRESS