Continual coarse-to-fine domain adaptation in semantic segmentation

被引：12

作者：

Shenaj, Donald ^{[1
]}

Barbato, Francesco ^{[1
]}

Michieli, Umberto ^{[1
]}

Zanuttigh, Pietro ^{[1
]}

机构：

[1] Univ Padua, Dept Informat Engn, Padua, Italy

来源：

IMAGE AND VISION COMPUTING | 2022年 / 121卷

关键词：

Coarse-to-fine learning; Unsupervised domain adaptation; Semantic segmentation; Continual learning; Deep learning;

D O I：

10.1016/j.imavis.2022.104426

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep neural networks are typically trained in a single shot for a specific task and data distribution, but in real world settings both the task and the domain of application can change. The problem becomes even more challenging in dense predictive tasks, such as semantic segmentation, and furthermore most approaches tackle the two problems separately. In this paper we introduce the novel task of coarse-to-fine learning of semantic segmentation architectures in presence of domain shift. We consider subsequent learning stages progressively refining the task at the semantic level; i.e., the finer set of semantic labels at each learning step is hierarchically derived from the coarser set of the previous step. We propose a new approach (CCDA) to tackle this scenario. First, we employ the maximum squares loss to align source and target domains and, at the same time, to balance the gradients between well-classified and harder samples. Second, we introduce a novel coarse-to-fine knowledge distillation constraint to transfer network capabilities acquired on a coarser set of labels to a set of finer labels. Finally, we design a coarse-to-fine weight initialization rule to spread the importance from each coarse class to the respective finer classes. To evaluate our approach, we design two benchmarks where source knowledge is extracted from the GTA5 dataset and it is transferred to either the Cityscapes or the IDD datasets, and we show how it outperforms the main competitors.

引用

页数：11

共 43 条

[1] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
Badrinarayanan, Vijay
Kendall, Alex
Cipolla, Roberto
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
[2] Barbato F., 2021, ARXIV 2021 PREPRINT
[3] Barbato F., P IEEE C COMPUTER VI, P2835
[4] Cardace A., P WINT C APPL COMP V, P1160
[5] Cermelli F., P IEEE C COMP VIS PA, P9233
[6] DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Chen, Liang-Chieh
Papandreou, George
Kokkinos, Iasonas
Murphy, Kevin
Yuille, Alan L.
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) : 834 - 848
[7] Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[8] Chen M., P INT C COMPUTER VIS, P2090
[9] Chen Y.-C., P IEEE C COMPUTER VI, P1791
[10] Cordts M., P IEEE C COMPUTER VI, P3213

← 1 2 3 4 5 →