Modeling the Background for Incremental Learning in Semantic Segmentation

被引:221
作者
Cermelli, Fabio [1 ,2 ]
Mancini, Massimiliano [2 ,3 ,4 ]
Bulo, Samuel Rota [5 ]
Ricci, Elisa [3 ,6 ]
Caputo, Barbara [1 ,2 ]
机构
[1] Politecn Torino, Turin, Italy
[2] Italian Inst Technol, Genoa, Italy
[3] Fdn Bruno Kessler, Trento, Italy
[4] Sapienza Univ Rome, Rome, Italy
[5] Mapillary Res, Malmo, Sweden
[6] Univ Trento, Trento, Italy
来源
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年
关键词
D O I
10.1109/CVPR42600.2020.00925
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite their effectiveness in a wide range of tasks, deep architectures suffer from some important limitations. In particular, they are vulnerable to catastrophic forgetting, i.e. they perform poorly when they are required to update their model as new classes are available but the original training set is not retained. This paper addresses this problem in the context of semantic segmentation. Current strategies fail on this task because they do not consider a peculiar aspect of semantic segmentation: since each training step provides annotation only for a subset of all possible classes, pixels of the background class (i.e. pixels that do not belong to any other classes) exhibit a semantic distribution shift. In this work we revisit classical incremental learning methods, proposing a new distillation-based framework which explicitly accounts for this shift. Furthermore, we introduce a novel strategy to initialize classifier's parameters, thus preventing biased predictions toward the background class. We demonstrate the effectiveness of our approach with an extensive evaluation on the Pascal-VOC 2012 and ADE20K datasets, significantly outperforming state of the art incremental learning methods. Code can be found at https://github.com/fcdl94/MiB.
引用
收藏
页码:9230 / 9239
页数:10
相关论文
共 39 条
[1]   Memory Aware Synapses: Learning What (not) to Forget [J].
Aljundi, Rahaf ;
Babiloni, Francesca ;
Elhoseiny, Mohamed ;
Rohrbach, Marcus ;
Tuytelaars, Tinne .
COMPUTER VISION - ECCV 2018, PT III, 2018, 11207 :144-161
[2]  
[Anonymous], 2019, INT J COMPUTER ASSIS, DOI DOI 10.1007/978-3-030-05773-21
[3]  
[Anonymous], 2017, NEURIPS
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]   End-to-End Incremental Learning [J].
Castro, Francisco M. ;
Marin-Jimenez, Manuel J. ;
Guil, Nicolas ;
Schmid, Cordelia ;
Alahari, Karteek .
COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 :241-257
[6]   Riemannian Walk for Incremental Learning: Understanding Forgetting and Intransigence [J].
Chaudhry, Arslan ;
Dokania, Puneet K. ;
Ajanthan, Thalaiyasingam ;
Torr, Philip H. S. .
COMPUTER VISION - ECCV 2018, PT XI, 2018, 11215 :556-572
[7]  
Chen L., 2017, Rethinking Atrous Convolution for Semantic Image Segmentation
[8]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[9]   An algorithm for highway vehicle detection based on convolutional neural network [J].
Chen, Linkai ;
Ye, Feiyue ;
Ruan, Yaduan ;
Fan, Honghui ;
Chen, Qimei .
EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2018,
[10]   The Mobile App Usability Inspection (MAUi) Framework as a Guide for Minimal Viable Product (MVP) Testing in Lean Development Cycle [J].
Cheng, Lin Chou .
PROCEEDINGS OF CHIUXID 2016: BRIDGING THE GAPS IN THE HCI & UX WORLD, 2016, :1-11