Class-Balanced Sampling and Discriminative Stylization for Domain Generalization Semantic Segmentation

被引：1

作者：

Liao, Muxin ^{[1
]}

Tian, Shishun ^{[2
]}

Wei, Binbin ^{[2
]}

Zhang, Yuhang ^{[2
]}

Zou, Wenbin ^{[3
]}

Li, Xia ^{[2
]}

机构：

[1] Jiangxi Agr Univ, Sch Comp Sci & Engn, Nanchang 330045, Peoples R China

[2] Shenzhen Univ, Coll Elect & Informat Engn, Guangdong Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China

[3] Shenzhen Univ, Shenzhen Key Lab Adv Machine Learning & Applicat, Guangdong Key Lab Intelligent Informat Proc, Inst Artificial Intelligence & Adv Commun,Coll Ele, Shenzhen 518060, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2025年 / 26卷 / 02期

基金：

中国国家自然科学基金;

关键词：

Semantic segmentation; Semantics; Intelligent transportation systems; Training data; Training; Benchmark testing; Automobiles; Adaptation models; Tires; Motorcycles; Domain generalization; semantic segmentation; adaptively class-balanced sampling; class-discriminative stylization;

D O I：

10.1109/TITS.2024.3496538

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

Existing domain generalization semantic segmentation (DGSS) methods have achieved remarkable performance on unseen domains by generating stylized images to increase the diversity of training data. However, since the training data is usually class-imbalanced, uniform style randomization is unable to generate diverse minority classes. This means that models may overfit to the minority classes, resulting in suboptimal performance on the minority classes. In addition, the image-level style randomization may also corrupt the class-discriminative regions of objects, leading to a loss of the class-discriminative representation. To address these issues, a novel class-balanced sampling and discriminative stylization (CSDS) approach is proposed for DGSS. Specifically, first, a pixel-level class-balanced sampling (PCS) strategy is proposed to adaptively sample patches of the minority classes from the source domain images and paste the sampled patches on the input images. Unlike existing class sampling strategies that fix the minority classes, the PCS strategy dynamically determines the minority classes by estimating the class distribution after each sampling. Then, a class-discriminative style randomization (CSR) strategy is proposed to increase the style diversity of the sampled patches while preserving the class-discriminative regions. Finally, since the pasting positions of the sampled patches are uncertain, which may confuse the semantic relations between the classes, a semantic consistency constraint is proposed to ensure the learning of reliable semantic relations. Extensive experiments demonstrate that the proposed approach achieves superior performance compared to existing DGSS methods on multiple benchmarks. The source code has been released on https://github.com/seabearlmx/CSDS.

引用

页码：2596 / 2608

页数：13

共 60 条

[1] PASTA: Proportional Amplitude Spectrum Training Augmentation for Syn-to-Real Domain Generalization [J].

Chattopadhyay, Prithvijit ;

Sarangmath, Kartik ;

Vijaykumar, Vivek ;

Hoffman, Judy .

2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, :19231-19243

[2] Hyperbolic Uncertainty Aware Semantic Segmentation [J].

Chen, Bike ;

Peng, Wei ;

Cao, Xiaofeng ;

Roning, Juha .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) :1275-1290

[3] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[4] RobustNet: Improving Domain Generalization in Urban-Scene Segmentation via Instance Selective Whitening [J].

Choi, Sungha ;

Jung, Sanghun ;

Yun, Huiwon ;

Kim, Joanne T. ;

Kim, Seungryong ;

Choo, Jaegul .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11575-11585

[5] Toward Generalizable Multispectral Pedestrian Detection [J].

Chu, Fuchen ;

Cao, Jiale ;

Song, Zhanjie ;

Shao, Zhuang ;

Pang, Yanwei ;

Li, Xuelong .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (05) :3739-3750

[6] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[7]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[8] HGFormer: Hierarchical Grouping Transformer for Domain Generalized Semantic Segmentation [J].

Ding, Jian ;

Xue, Nan ;

Xia, Gui-Song ;

Schiele, Bernt ;

Dai, Dengxin .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :15413-15423

[9] A Simple Recipe for Language-guided Domain Generalized Segmentation [J].

Fahes, Mohammad ;

Vu, Tuan-Hung ;

Bursuc, Andrei ;

Perez, Patrick ;

de Charette, Raoul .

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, :23428-23437

[10] An Active and Contrastive Learning Framework for Fine-Grained Off-Road Semantic Segmentation [J].

Gao, Biao ;

Zhao, Xijun ;

Zhao, Huijing .

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) :564-579

← 1 2 3 4 5 6 →