Considering representation diversity and prediction consistency for domain generalization semantic segmentation

被引：0

作者：

Liao, Muxin ^{[1
,2
]}

Tian, Shishun ^{[2
,5
]}

Zhang, Yuhang ^{[2
,5
]}

Hua, Guoguang ^{[2
,5
]}

Zou, Wenbin ^{[2
,3
,4
,5
]}

Li, Xia ^{[2
,5
]}

机构：

[1] Jiangxi Agr Univ, Sch Comp Sci & Engn, Lushan South Ave, Nanchang 330045, Jiangxi, Peoples R China

[2] Shenzhen Univ, Guangdong Key Lab Intelligent Informat Proc, Shenzhen 518060, Peoples R China

[3] Shenzhen Univ, Shenzhen Key Lab Adv Machine Learning & Applicat, Shenzhen 518060, Peoples R China

[4] Shenzhen Univ, Inst Artificial Intelligence & Adv Commun, Shenzhen 518060, Peoples R China

[5] Shenzhen Univ, Coll Elect & Informat Engn, Shenzhen 518060, Peoples R China

来源：

KNOWLEDGE-BASED SYSTEMS | 2024年 / 305卷

关键词：

Domain generalization; Semantic segmentation; Representation diversity; Prediction consistency; Adaptive style randomization; INVARIANT REPRESENTATION; ADAPTATION;

D O I：

10.1016/j.knosys.2024.112649

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Existing domain generalization semantic segmentation (DGSS) methods aim to learn domain-invariant representation from single or multiple domains by using the consistency constraint or normalization strategy. Although these methods can improve the generalization capability of the models, the representation capability maybe inevitably weakened since some effective information is eliminated, which may lead to the low discrimination capability of models. To simultaneously preserve the generalization and discrimination capability of models, we provide a novel perspective that encourages to learn the distinct representation while keeping the consistent prediction. Based on this, we propose a novel approach for domain generalization semantic segmentation by simultaneously considering the representation diversity and prediction consistency. The proposed approach consists of an adaptive style randomization (ASR) module, a representation diversity (RD) constraint, and a prediction consistency (PC) constraint. Specifically, first, since diverse stylized images are beneficial for the RD constraint to learn the distinct representation, the ASR module is proposed to leverage global and local style randomization to generate diverse stylized images which have the same content as the source domain images. Then, the distinct representation is learned from the source domain images and the stylized images by using the RD constraint to improve the discrimination capability of the model. Finally, the distinct representation is expected to generate consistent prediction by using the PC constraint for preserving the generalization capability of the model. Extensive experiments demonstrate that our approach achieves superior performance over current approaches on several benchmarks.

引用

页数：12

共 64 条

[1] Style Blind Domain Generalized Semantic Segmentation via Covariance Alignment and Semantic Consistence Contrastive Learning [J].

Ahn, Woo-Jin ;

Yang, Geun-Yeong ;

Choi, Hyun-Duck ;

Lim, Myo-Taeg .

2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2024, 2024, :3616-3626

[2] Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].

Chen, Liang-Chieh ;

Zhu, Yukun ;

Papandreou, George ;

Schroff, Florian ;

Adam, Hartwig .

COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851

[3] Decomposed adversarial domain generalization [J].

Chen, Sentao .

KNOWLEDGE-BASED SYSTEMS, 2023, 263

[4]

Chen XY, 2019, PR MACH LEARN RES, V97

[5] CrDoCo: Pixel-level Domain Transfer with Cross-Domain Consistency [J].

Chen, Yun-Chun ;

Lin, Yen-Yu ;

Yang, Ming-Hsuan ;

Huang, Jia-Bin .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :1791-1800

[6] ADPL: Adaptive Dual Path Learning for Domain Adaptation of Semantic Segmentation [J].

Cheng, Yiting ;

Wei, Fangyun ;

Bao, Jianmin ;

Chen, Dong ;

Zhang, Wenqiang .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) :9339-9356

[7] Dual Path Learning for Domain Adaptation of Semantic Segmentation [J].

Cheng, Yiting ;

Wei, Fangyun ;

Bao, Jianmin ;

Chen, Dong ;

Wen, Fang ;

Zhang, Wenqiang .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9062-9071

[8] RobustNet: Improving Domain Generalization in Urban-Scene Segmentation via Instance Selective Whitening [J].

Choi, Sungha ;

Jung, Sanghun ;

Yun, Huiwon ;

Kim, Joanne T. ;

Kim, Seungryong ;

Choo, Jaegul .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11575-11585

[9] The Cityscapes Dataset for Semantic Urban Scene Understanding [J].

Cordts, Marius ;

Omran, Mohamed ;

Ramos, Sebastian ;

Rehfeld, Timo ;

Enzweiler, Markus ;

Benenson, Rodrigo ;

Franke, Uwe ;

Roth, Stefan ;

Schiele, Bernt .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223

[10] Deformable Convolutional Networks [J].

Dai, Jifeng ;

Qi, Haozhi ;

Xiong, Yuwen ;

Li, Yi ;

Zhang, Guodong ;

Hu, Han ;

Wei, Yichen .

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773

← 1 2 3 4 5 6 7 →