Continual coarse-to-fine domain adaptation in semantic segmentation

被引:12
作者
Shenaj, Donald [1 ]
Barbato, Francesco [1 ]
Michieli, Umberto [1 ]
Zanuttigh, Pietro [1 ]
机构
[1] Univ Padua, Dept Informat Engn, Padua, Italy
关键词
Coarse-to-fine learning; Unsupervised domain adaptation; Semantic segmentation; Continual learning; Deep learning;
D O I
10.1016/j.imavis.2022.104426
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep neural networks are typically trained in a single shot for a specific task and data distribution, but in real world settings both the task and the domain of application can change. The problem becomes even more challenging in dense predictive tasks, such as semantic segmentation, and furthermore most approaches tackle the two problems separately. In this paper we introduce the novel task of coarse-to-fine learning of semantic segmentation architectures in presence of domain shift. We consider subsequent learning stages progressively refining the task at the semantic level; i.e., the finer set of semantic labels at each learning step is hierarchically derived from the coarser set of the previous step. We propose a new approach (CCDA) to tackle this scenario. First, we employ the maximum squares loss to align source and target domains and, at the same time, to balance the gradients between well-classified and harder samples. Second, we introduce a novel coarse-to-fine knowledge distillation constraint to transfer network capabilities acquired on a coarser set of labels to a set of finer labels. Finally, we design a coarse-to-fine weight initialization rule to spread the importance from each coarse class to the respective finer classes. To evaluate our approach, we design two benchmarks where source knowledge is extracted from the GTA5 dataset and it is transferred to either the Cityscapes or the IDD datasets, and we show how it outperforms the main competitors.
引用
收藏
页数:11
相关论文
共 43 条
  • [1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [2] Barbato F., 2021, ARXIV 2021 PREPRINT
  • [3] Barbato F., P IEEE C COMPUTER VI, P2835
  • [4] Cardace A., P WINT C APPL COMP V, P1160
  • [5] Cermelli F., P IEEE C COMP VIS PA, P9233
  • [6] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
    Chen, Liang-Chieh
    Papandreou, George
    Kokkinos, Iasonas
    Murphy, Kevin
    Yuille, Alan L.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
  • [7] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
  • [8] Chen M., P INT C COMPUTER VIS, P2090
  • [9] Chen Y.-C., P IEEE C COMPUTER VI, P1791
  • [10] Cordts M., P IEEE C COMPUTER VI, P3213