Exploring Explicitly Disentangled Features for Domain Generalization

被引:11
|
作者
Li, Jingwei [1 ,2 ]
Li, Yuan [1 ,2 ]
Wang, Huanjie [1 ,2 ]
Liu, Chengbao [1 ]
Tan, Jie [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100080, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 101408, Peoples R China
基金
中国国家自然科学基金;
关键词
Domain generalization; feature disentanglement; Fourier transform; data augmentation;
D O I
10.1109/TCSVT.2023.3269534
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Domain generalization (DG) is a challenging task that aims to train a robust model with only labeled source data and can generalize well on unseen target data. The domain gap between the source and target data may degrade the performance. A plethora of methods resort to obtaining domain-invariant features to overcome the difficulties. However, these methods require sophisticated network designs or training strategies, causing inefficiency and complexity. In this paper, we first analyze and reclassify the features into two categories, i.e., implicitly disentangled ones and explicitly disentangled counterparts. Since we aim to design a generic algorithm for DG to alleviate the problems mentioned above, we focus more on the explicitly disentangled features due to their simplicity and interpretability. We find out that the shape features of images are simple and elegant choices based on our analysis. We extract the shape features from two aspects. In the aspect of networks, we propose Multi-Scale Amplitude Mixing (MSAM) to strengthen shape features at different layers of the network by Fourier transform. In the aspect of inputs, we propose a new data augmentation method called Random Shape Warping (RSW) to facilitate the model to concentrate more on the global structures of the objects. RSW randomly distorts the local parts of the images and keeps the global structures unchanged, which can further improve the robustness of the model. Our methods are simple yet efficient and can be conveniently used as plug-and-play modules. They can outperform state-of-the-art (SOTA) methods without bells and whistles.
引用
收藏
页码:6360 / 6373
页数:14
相关论文
共 50 条
  • [11] Exploring rounD Dataset for Domain Generalization in Autonomous Vehicle Trajectory Prediction
    Zhang, Zikai
    SENSORS, 2024, 24 (23)
  • [12] SADGFeat: Learning local features with layer spatial attention and domain generalization
    Bai, Wenjing
    Zhang, Yunzhou
    Wang, Li
    Liu, Wei
    Hu, Jun
    Huang, Guan
    IMAGE AND VISION COMPUTING, 2024, 146
  • [13] Feature Stylization Adversarial Domain Generalization
    Hu, Zhengzhong
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [14] Cross-modal domain generalization semantic segmentation based on fusion features
    Yue, Wanlin
    Zhou, Zhiheng
    Cao, Yinglie
    Liuman
    KNOWLEDGE-BASED SYSTEMS, 2024, 302
  • [15] Exploring Regularization Methods for Domain Generalization in Accelerometer-Based Human Activity Recognition
    Bento, Nuno
    Rebelo, Joana
    Carreiro, Andre V.
    Ravache, Francois
    Barandas, Marilia
    SENSORS, 2023, 23 (14)
  • [16] DIFLF: A domain-invariant features learning framework for single-source domain generalization in mammogram classification
    Xie, Wanfang
    Liu, Zhenyu
    Zhao, Litao
    Wang, Meiyun
    Tian, Jie
    Liu, Jiangang
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2025, 261
  • [17] TOWARDS DOMAIN GENERALIZATION IN UNDERWATER OBJECT DETECTION
    Liu, Hong
    Song, Pinhao
    Ding, Runwei
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1971 - 1975
  • [18] Style Normalization and Restitution for Domain Generalization and Adaptation
    Jin, Xin
    Lan, Cuiling
    Zeng, Wenjun
    Chen, Zhibo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 24 : 3636 - 3651
  • [19] Latent Feature Disentanglement for Visual Domain Generalization
    Gholami, Behnam
    El-Khamy, Mostafa
    Song, Kee-Bong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 32 : 5751 - 5763
  • [20] DOMAIN GENERALIZATION WITH FOURIER TRANSFORM AND SOFT THRESHOLDING
    Pan, Hongyi
    Wang, Bin
    Zhang, Zheyuan
    Zhu, Xin
    Jha, Debesh
    Cetin, Ahmet Enis
    Spampinato, Concetto
    Bagci, Ulas
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024, 2024, : 2106 - 2110