Exploring Explicitly Disentangled Features for Domain Generalization

被引：11

作者：

Li, Jingwei ^{[1
,2
]}

Li, Yuan ^{[1
,2
]}

Wang, Huanjie ^{[1
,2
]}

Liu, Chengbao ^{[1
]}

Tan, Jie ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100080, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 101408, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Domain generalization; feature disentanglement; Fourier transform; data augmentation;

D O I：

10.1109/TCSVT.2023.3269534

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Domain generalization (DG) is a challenging task that aims to train a robust model with only labeled source data and can generalize well on unseen target data. The domain gap between the source and target data may degrade the performance. A plethora of methods resort to obtaining domain-invariant features to overcome the difficulties. However, these methods require sophisticated network designs or training strategies, causing inefficiency and complexity. In this paper, we first analyze and reclassify the features into two categories, i.e., implicitly disentangled ones and explicitly disentangled counterparts. Since we aim to design a generic algorithm for DG to alleviate the problems mentioned above, we focus more on the explicitly disentangled features due to their simplicity and interpretability. We find out that the shape features of images are simple and elegant choices based on our analysis. We extract the shape features from two aspects. In the aspect of networks, we propose Multi-Scale Amplitude Mixing (MSAM) to strengthen shape features at different layers of the network by Fourier transform. In the aspect of inputs, we propose a new data augmentation method called Random Shape Warping (RSW) to facilitate the model to concentrate more on the global structures of the objects. RSW randomly distorts the local parts of the images and keeps the global structures unchanged, which can further improve the robustness of the model. Our methods are simple yet efficient and can be conveniently used as plug-and-play modules. They can outperform state-of-the-art (SOTA) methods without bells and whistles.

引用

页码：6360 / 6373

页数：14

共 50 条

[41] Domain-Specific Risk Minimization for Domain Generalization
Zhang, Yi-Fan
Wang, Jindong
Liang, Jian
Zhang, Zhang
Yu, Baosheng
Wang, Liang
Tao, Dacheng
Xie, Xing
PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3409 - 3421
[42] Domain Adversarial Active Learning for Domain Generalization Classification
Chen, Jianting
Ding, Ling
Yang, Yunxiao
Di, Zaiyuan
Xiang, Yang
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (01) : 226 - 238
[43] Domain-aware triplet loss in domain generalization
Guo, Kaiyu
Lovell, Brian C.
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 243
[44] Cross-Domain Gated Learning for Domain Generalization
Dapeng Du
Jiawei Chen
Yuexiang Li
Kai Ma
Gangshan Wu
Yefeng Zheng
Limin Wang
International Journal of Computer Vision, 2022, 130 : 2842 - 2857
[45] Inter-domain curriculum learning for domain generalization
Kim, Daehee
Kim, Jinkyu
Lee, Jaekoo
ICT EXPRESS, 2022, 8 (02): : 225 - 229
[46] Domain Attention Model for Domain Generalization in Object Detection
He, Weixiong
Zheng, Huicheng
Lai, Jianhuang
PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 27 - 39
[47] Cross-domain Ensemble Distillation for Domain Generalization
Lee, Kyungmoon
Kim, Sungyeon
Kwak, Suha
COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 1 - 20
[48] Contextual Distribution Alignment via Correlation Contrasting for Domain Generalization
Lin, Huibin
Zhang, Chun-Yang
Philip Chen, C. L.
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (04) : 3619 - 3632
[49] Cross-Domain Gated Learning for Domain Generalization
Du, Dapeng
Chen, Jiawei
Li, Yuexiang
Ma, Kai
Wu, Gangshan
Zheng, Yefeng
Wang, Limin
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2842 - 2857
[50] Respecting Domain Relations: Hypothesis Invariance for Domain Generalization
Wang, Ziqi
Loog, Marco
van Gemert, Jan
2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9756 - 9763

← 1 2 3 4 5 →