Co-Training for Unsupervised Domain Adaptation of Semantic Segmentation Models

被引:6
作者
Gomez, Jose L. [1 ,2 ]
Villalonga, Gabriel [1 ]
Lopez, Antonio M. [1 ,2 ]
机构
[1] Univ Autonoma Barcelona UAB, Comp Vis Ctr CVC, Bellaterra 08193, Spain
[2] Univ Autonoma Barcelona UAB, Comp Sci Dept, Bellaterra 08193, Spain
关键词
domain adaptation; semi-supervised learning; semantic segmentation; autonomous driving;
D O I
10.3390/s23020621
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Semantic image segmentation is a core task for autonomous driving, which is performed by deep models. Since training these models draws to a curse of human-based image labeling, the use of synthetic images with automatically generated labels together with unlabeled real-world images is a promising alternative. This implies addressing an unsupervised domain adaptation (UDA) problem. In this paper, we propose a new co-training procedure for synth-to-real UDA of semantic segmentation models. It performs iterations where the (unlabeled) real-world training images are labeled by intermediate deep models trained with both the (labeled) synthetic images and the real-world ones labeled in previous iterations. More specifically, a self-training stage provides two domain-adapted models and a model collaboration loop allows the mutual improvement of these two models. The final semantic segmentation labels (pseudo-labels) for the real-world images are provided by these two models. The overall procedure treats the deep models as black boxes and drives their collaboration at the level of pseudo-labeled target images, i.e., neither modifying loss functions is required, nor explicit feature alignment. We test our proposal on standard synthetic and real-world datasets for onboard semantic segmentation. Our procedure shows improvements ranging from approximately 13 to 31 mIoU points over baselines.
引用
收藏
页数:28
相关论文
共 44 条
[1]  
[Anonymous], 2018, P EUROPEAN C COMPUTE
[2]  
[Anonymous], 1998, P 11 ANN C COMP LEAR
[3]  
[Anonymous], 2017, ARXIV170310593CS
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]   Rethinking Ensemble-Distillation for Semantic Segmentation Based Unsupervised Domain Adaption [J].
Chao, Chen-Hao ;
Cheng, Bo-Wun ;
Lee, Chun-Yi .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :2610-2620
[6]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[7]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[8]   Semi-supervised Domain Adaptation based on Dual-level Domain Mixing for Semantic Segmentation [J].
Chen, Shuaijun ;
Jia, Xu ;
He, Jianzhong ;
Shi, Yongjie ;
Liu, Jianzhuang .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11013-11022
[9]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[10]  
Csurka G, 2017, ADV COMPUT VIS PATT, P1, DOI 10.1007/978-3-319-58347-1_1