OTCLDA: Optimal Transport and Contrastive Learning for Domain Adaptive Semantic Segmentation

被引:3
作者
Fan, Qizhe [1 ]
Shen, Xiaoqin [1 ]
Ying, Shihui [2 ]
Du, Shaoyi [3 ,4 ]
机构
[1] Xian Univ Technol, Sch Sci, Xian 710054, Peoples R China
[2] Shanghai Univ, Sch Sci, Shanghai 200444, Peoples R China
[3] Xi An Jiao Tong Univ, Affiliated Hosp 2, Dept Ultrasound, Xian 710049, Peoples R China
[4] Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China
关键词
Semantic segmentation; Semantics; Probability distribution; Adaptation models; Training; Task analysis; Fans; Domain adaptation; semantic segmentation; optimal transport; contrastive learning; self-training; ADAPTATION;
D O I
10.1109/TITS.2024.3399399
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
Unsupervised domain adaptive (UDA) semantic segmentation aims to assign a predetermined semantic label to every single pixel of the unannotated target data by exploiting a model that is trained on the labeled source data. Numerous current methods only display concern for grouping similar features together but ignore dispersing those features across various classes, so that some feature representations can not be well-separated. Therefore, we propose to employ contrastive learning (CL) method to increase the similarity of pixel features, propelling similar features closer and dispelling different ones far away. Furthermore, due to the domain shift, the UDA model frequently has poor generalization on the target domain. Accordingly, we design an optimal transport (OT) module to enhance UDA by comparing and aligning sample distributions to minimize transport loss between them. By taking advantage of this, the domain shift can be efficaciously mitigated by bringing the target probability distribution closer to that of the source. Specially, due to its simplicity, our OT module can be integrated into various UDA methods. In light of the aforementioned viewpoints, we put forth an ingenious approach, named OTCLDA, which successfully combines OT and CL while enhancing the performance of the UDA model. Multitudinous experiments demonstrate the importance of our method involving OT and CL. It significantly gains mIoU of 75.1% on benchmark GTA -> Cityscapes, and 66.9% on SYNTHIA -> Cityscapes respectively, displaying a competitive performance compared with previous works. The source code of OTCLDA is publicly available at https://github.com/YYDSDD/OTCLDA.
引用
收藏
页码:14685 / 14697
页数:13
相关论文
共 96 条
[1]   Unsupervised Domain Adaptation With Optimal Transport in Multi-Site Segmentation of Multiple Sclerosis Lesions From MRI Data [J].
Ackaouy, Antoine ;
Courty, Nicolas ;
Vallee, Emmanuel ;
Commowick, Olivier ;
Barillot, Christian ;
Galassi, Francesca .
FRONTIERS IN COMPUTATIONAL NEUROSCIENCE, 2020, 14
[2]  
Andoni A, 2008, PROCEEDINGS OF THE NINETEENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P343
[3]   Self-supervised Augmentation Consistency for Adapting Semantic Segmentation [J].
Araslanov, Nikita ;
Roth, Stefan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15379-15389
[4]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[5]  
Benamou JD, 2000, NUMER MATH, V84, P375, DOI 10.1007/s002119900117
[6]   Sliced and Radon Wasserstein Barycenters of Measures [J].
Bonneel, Nicolas ;
Rabin, Julien ;
Peyre, Gabriel ;
Pfister, Hanspeter .
JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2015, 51 (01) :22-45
[7]   Displacement Interpolation Using Lagrangian Mass Transport [J].
Bonneel, Nicolas ;
van de Panne, Michiel ;
Paris, Sylvain ;
Heidrich, Wolfgang .
ACM TRANSACTIONS ON GRAPHICS, 2011, 30 (06)
[8]  
Cai Qi, 2020, Advances in Neural Information Processing Systems, V33
[9]   NUMERICAL METHODS FOR MATCHING FOR TEAMS AND WASSERSTEIN BARYCENTERS [J].
Carlier, Guillaume ;
Oberman, Adam ;
Oudet, Edouard .
ESAIM-MATHEMATICAL MODELLING AND NUMERICAL ANALYSIS-MODELISATION MATHEMATIQUE ET ANALYSE NUMERIQUE, 2015, 49 (06) :1621-1642
[10]   All about Structure: Adapting Structural Information across Domains for Boosting Semantic Segmentation [J].
Chang, Wei-Lun ;
Wang, Hui-Po ;
Peng, Wen-Hsiao ;
Chiu, Wei-Chen .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1900-1909