Pull and concentrate: improving unsupervised semantic segmentation adaptation with cross- and intra-domain consistencies

被引:1
作者
Zhang, Jian-Wei [1 ]
Sun, Yifan [2 ]
Chen, Wei [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD &CG, Yuhangtang Rd, Hangzhou 310000, Zhejiang, Peoples R China
[2] Baidu Res, Malianwa St, Beijing 100080, Peoples R China
基金
中国国家自然科学基金;
关键词
Unsupervised domain adaptation; Semantic segmentation; Self-training; Consistency constraints; REPRESENTATION;
D O I
10.1007/s00530-023-01131-9
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Unsupervised domain adaptation (UDA) is an important solution for the cross-domain problem in semantic segmentation. Existing segmentation UDA methods mainly consider the domain shift as the major challenge. This paper, from a novel viewpoint, disentangles the cross-domain problem into two negative factors beyond the domain shift. Specifically, we find that apart from the domain shift factor, the dispersed within-class distribution on the target domain is another factor that compromises cross-domain segmentation. This paper finds that the neglected target domain distribution dispersion is a challenge as crucial as the domain shift. In response to the joint of these two negative factors, we propose a "Pull-and-Concentrate" (PuCo) method comprised of two consistencies: (1) A cross-domain consistency "pulls" the source and target domain distribution (of the same class) close to each other based on a novel statistical style transfer. (2) An intra-domain consistency "concentrates" the within-class distribution on the target domain in a new unsupervised teacher-student method. Both consistencies have the advantage of being robust (or insulated) from pseudo-label noises. This advantage allows PuCo to bring consistent improvement over a battery of pseudo-label-based UDA methods. For example, on GTA5 to Cityscapes and SYNTHIA to Cityscapes, PuCo achieves 60.3% and 57.2% mean IoU, respectively. Code is available at https://github.com/Jarvis73/PuCo.
引用
收藏
页码:2633 / 2650
页数:18
相关论文
共 77 条
[11]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223
[12]  
French Geoffrey, 2018, P ICLR
[13]   Image Style Transfer Using Convolutional Neural Networks [J].
Gatys, Leon A. ;
Ecker, Alexander S. ;
Bethge, Matthias .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :2414-2423
[14]  
Ghosh A., 2020, ICLR
[15]   AlphaMatch: Improving Consistency for Semi-supervised Learning with Alpha-divergence [J].
Gong, Chengyue ;
Wang, Dilin ;
Liu, Qiang .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13678-13687
[16]   Continuous Pseudo-Label Rectified Domain Adaptive Semantic Segmentation with Implicit Neural Representations [J].
Gong, Rui ;
Wang, Qin ;
Danelljan, Martin ;
Dai, Dengxin ;
Van Gool, Luc .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :7225-7235
[17]  
Grill J.-B., 2020, P ADV NEUR INF PROC, V33, P21271, DOI DOI 10.48550/ARXIV.2006.07733
[18]   MetaCorrection: Domain-aware Meta Loss Correction for Unsupervised Domain Adaptation in Semantic Segmentation [J].
Guo, Xiaoqing ;
Yang, Chen ;
Li, Baopu ;
Yuan, Yixuan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :3926-3935
[19]   Classes Matter: A Fine-Grained Adversarial Approach to Cross-Domain Semantic Segmentation [J].
Wang, Haoran ;
Shen, Tong ;
Zhang, Wei ;
Duan, Ling-Yu ;
Mei, Tao .
COMPUTER VISION - ECCV 2020, PT XIV, 2020, 12359 :642-659
[20]   Momentum Contrast for Unsupervised Visual Representation Learning [J].
He, Kaiming ;
Fan, Haoqi ;
Wu, Yuxin ;
Xie, Saining ;
Girshick, Ross .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :9726-9735