Temporal Consistency as Pretext Task in Unsupervised Domain Adaptation for Semantic Segmentation

被引:0
作者
Barbosa, Felipe [1 ]
Osorio, Fernando [1 ]
机构
[1] Univ Sao Paulo, Inst Math & Comp Sci, Sao Paulo, Brazil
关键词
Semantic segmentation; Unsupervised domain adaptation; Temporal consistency; Self-supervised learning; Review; DATASET;
D O I
10.1007/s10846-025-02220-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Intelligent and autonomous robots (and vehicles) largely adopt computer vision systems to help in localization, navigation and obstacle avoidance tasks. By integrating different deep learning techniques, such as Object Detection and Image Semantic Segmentation, these systems achieve high accuracy in the domain they were trained on. Nonetheless, robustly operating in different domains still poses a major challenge to vision-based perception. In this sense, Unsupervised Domain Adaptation (UDA) has recently gained momentum due to its importance to real-world applications. Specifically, it leverages the prompt availability of unlabeled data to design auxiliary sources of supervision to guide the knowledge transfer between domains. The advantages of such an approach are two-fold: avoiding going through exhaustive labeling processes, and enhancing adaptation performance. In this scenario, exploring temporal correlations in unlabeled video data stands as an interesting alternative, which has not yet been explored to its full potential. In this work, we propose a Self-supervised learning framework that employs Temporal Consistency from unlabeled video sequences as a pretext task for improving UDA for Semantic Segmentation (UDASS). A simple yet effective strategy, it has shown promising results in a real-to-real adaptation setting. Our results and discussions are expected to benefit both new and experienced researchers on the subject.
引用
收藏
页数:15
相关论文
共 41 条
[1]  
Barbosa F., 2023, ZED2 Dataset
[2]   Domain Adaptation for Semantic Segmentation with Maximum Squares Loss [J].
Chen, Minghao ;
Xue, Hongyang ;
Cai, Deng .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :2090-2099
[3]   No More Discrimination: Cross City Adaptation of Road Scene Segmenters [J].
Chen, Yi-Hsin ;
Chen, Wei-Yu ;
Chen, Yu-Ting ;
Tsai, Bo-Cheng ;
Wang, Yu-Chiang Frank ;
Sun, Min .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :2011-2020
[4]  
Contributors M., 2020, MMSegmentation: openmmlab semantic segmentation toolbox and benchmark
[5]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[6]  
Dai DX, 2018, IEEE INT C INTELL TR, P3819, DOI 10.1109/ITSC.2018.8569387
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]   The PASCAL Visual Object Classes Challenge: A Retrospective [J].
Everingham, Mark ;
Eslami, S. M. Ali ;
Van Gool, Luc ;
Williams, Christopher K. I. ;
Winn, John ;
Zisserman, Andrew .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 111 (01) :98-136
[9]  
Guan D., 2021, Domain Adaptive Video Segmentation via Temporal Consistency Regularization, DOI [10.1109/ICCV48922.2021.00795, DOI 10.1109/ICCV48922.2021.00795]
[10]   Geometric Unsupervised Domain Adaptation for Semantic Segmentation [J].
Guizilini, Vitor ;
Li, Jie ;
Ambrus, Rares ;
Gaidon, Adrien .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :8517-8527