Exploring Explicitly Disentangled Features for Domain Generalization

被引：11

作者：

Li, Jingwei ^{[1
,2
]}

Li, Yuan ^{[1
,2
]}

Wang, Huanjie ^{[1
,2
]}

Liu, Chengbao ^{[1
]}

Tan, Jie ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, Beijing 100080, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 101408, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2023年 / 33卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Domain generalization; feature disentanglement; Fourier transform; data augmentation;

D O I：

10.1109/TCSVT.2023.3269534

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Domain generalization (DG) is a challenging task that aims to train a robust model with only labeled source data and can generalize well on unseen target data. The domain gap between the source and target data may degrade the performance. A plethora of methods resort to obtaining domain-invariant features to overcome the difficulties. However, these methods require sophisticated network designs or training strategies, causing inefficiency and complexity. In this paper, we first analyze and reclassify the features into two categories, i.e., implicitly disentangled ones and explicitly disentangled counterparts. Since we aim to design a generic algorithm for DG to alleviate the problems mentioned above, we focus more on the explicitly disentangled features due to their simplicity and interpretability. We find out that the shape features of images are simple and elegant choices based on our analysis. We extract the shape features from two aspects. In the aspect of networks, we propose Multi-Scale Amplitude Mixing (MSAM) to strengthen shape features at different layers of the network by Fourier transform. In the aspect of inputs, we propose a new data augmentation method called Random Shape Warping (RSW) to facilitate the model to concentrate more on the global structures of the objects. RSW randomly distorts the local parts of the images and keeps the global structures unchanged, which can further improve the robustness of the model. Our methods are simple yet efficient and can be conveniently used as plug-and-play modules. They can outperform state-of-the-art (SOTA) methods without bells and whistles.

引用

页码：6360 / 6373

页数：14

共 50 条

[11] Exploring rounD Dataset for Domain Generalization in Autonomous Vehicle Trajectory Prediction
Zhang, Zikai
SENSORS, 2024, 24 (23)
[12] SADGFeat: Learning local features with layer spatial attention and domain generalization
Bai, Wenjing
Zhang, Yunzhou
Wang, Li
Liu, Wei
Hu, Jun
Huang, Guan
IMAGE AND VISION COMPUTING, 2024, 146
[13] Feature Stylization Adversarial Domain Generalization
Hu, Zhengzhong
2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
[14] Cross-modal domain generalization semantic segmentation based on fusion features
Yue, Wanlin
Zhou, Zhiheng
Cao, Yinglie
Liuman
KNOWLEDGE-BASED SYSTEMS, 2024, 302
[15] Exploring Regularization Methods for Domain Generalization in Accelerometer-Based Human Activity Recognition
Bento, Nuno
Rebelo, Joana
Carreiro, Andre V.
Ravache, Francois
Barandas, Marilia
SENSORS, 2023, 23 (14)
[16] DIFLF: A domain-invariant features learning framework for single-source domain generalization in mammogram classification
Xie, Wanfang
Liu, Zhenyu
Zhao, Litao
Wang, Meiyun
Tian, Jie
Liu, Jiangang
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2025, 261
[17] TOWARDS DOMAIN GENERALIZATION IN UNDERWATER OBJECT DETECTION
Liu, Hong
Song, Pinhao
Ding, Runwei
2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1971 - 1975
[18] Style Normalization and Restitution for Domain Generalization and Adaptation
Jin, Xin
Lan, Cuiling
Zeng, Wenjun
Chen, Zhibo
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3636 - 3651
[19] Latent Feature Disentanglement for Visual Domain Generalization
Gholami, Behnam
El-Khamy, Mostafa
Song, Kee-Bong
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5751 - 5763
[20] DOMAIN GENERALIZATION WITH FOURIER TRANSFORM AND SOFT THRESHOLDING
Pan, Hongyi
Wang, Bin
Zhang, Zheyuan
Zhu, Xin
Jha, Debesh
Cetin, Ahmet Enis
Spampinato, Concetto
Bagci, Ulas
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2106 - 2110

← 1 2 3 4 5 →