Exploring Explicitly Disentangled Features for Domain Generalization

被引:11
|
作者
Li, Jingwei [1 ,2 ]
Li, Yuan [1 ,2 ]
Wang, Huanjie [1 ,2 ]
Liu, Chengbao [1 ]
Tan, Jie [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Automat, Beijing 100080, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 101408, Peoples R China
基金
中国国家自然科学基金;
关键词
Domain generalization; feature disentanglement; Fourier transform; data augmentation;
D O I
10.1109/TCSVT.2023.3269534
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Domain generalization (DG) is a challenging task that aims to train a robust model with only labeled source data and can generalize well on unseen target data. The domain gap between the source and target data may degrade the performance. A plethora of methods resort to obtaining domain-invariant features to overcome the difficulties. However, these methods require sophisticated network designs or training strategies, causing inefficiency and complexity. In this paper, we first analyze and reclassify the features into two categories, i.e., implicitly disentangled ones and explicitly disentangled counterparts. Since we aim to design a generic algorithm for DG to alleviate the problems mentioned above, we focus more on the explicitly disentangled features due to their simplicity and interpretability. We find out that the shape features of images are simple and elegant choices based on our analysis. We extract the shape features from two aspects. In the aspect of networks, we propose Multi-Scale Amplitude Mixing (MSAM) to strengthen shape features at different layers of the network by Fourier transform. In the aspect of inputs, we propose a new data augmentation method called Random Shape Warping (RSW) to facilitate the model to concentrate more on the global structures of the objects. RSW randomly distorts the local parts of the images and keeps the global structures unchanged, which can further improve the robustness of the model. Our methods are simple yet efficient and can be conveniently used as plug-and-play modules. They can outperform state-of-the-art (SOTA) methods without bells and whistles.
引用
收藏
页码:6360 / 6373
页数:14
相关论文
共 50 条
  • [41] Domain-Specific Risk Minimization for Domain Generalization
    Zhang, Yi-Fan
    Wang, Jindong
    Liang, Jian
    Zhang, Zhang
    Yu, Baosheng
    Wang, Liang
    Tao, Dacheng
    Xie, Xing
    PROCEEDINGS OF THE 29TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2023, 2023, : 3409 - 3421
  • [42] Domain Adversarial Active Learning for Domain Generalization Classification
    Chen, Jianting
    Ding, Ling
    Yang, Yunxiao
    Di, Zaiyuan
    Xiang, Yang
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2025, 37 (01) : 226 - 238
  • [43] Domain-aware triplet loss in domain generalization
    Guo, Kaiyu
    Lovell, Brian C.
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 243
  • [44] Cross-Domain Gated Learning for Domain Generalization
    Dapeng Du
    Jiawei Chen
    Yuexiang Li
    Kai Ma
    Gangshan Wu
    Yefeng Zheng
    Limin Wang
    International Journal of Computer Vision, 2022, 130 : 2842 - 2857
  • [45] Inter-domain curriculum learning for domain generalization
    Kim, Daehee
    Kim, Jinkyu
    Lee, Jaekoo
    ICT EXPRESS, 2022, 8 (02): : 225 - 229
  • [46] Domain Attention Model for Domain Generalization in Object Detection
    He, Weixiong
    Zheng, Huicheng
    Lai, Jianhuang
    PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT IV, 2018, 11259 : 27 - 39
  • [47] Cross-domain Ensemble Distillation for Domain Generalization
    Lee, Kyungmoon
    Kim, Sungyeon
    Kwak, Suha
    COMPUTER VISION, ECCV 2022, PT XXV, 2022, 13685 : 1 - 20
  • [48] Contextual Distribution Alignment via Correlation Contrasting for Domain Generalization
    Lin, Huibin
    Zhang, Chun-Yang
    Philip Chen, C. L.
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 35 (04) : 3619 - 3632
  • [49] Cross-Domain Gated Learning for Domain Generalization
    Du, Dapeng
    Chen, Jiawei
    Li, Yuexiang
    Ma, Kai
    Wu, Gangshan
    Zheng, Yefeng
    Wang, Limin
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (11) : 2842 - 2857
  • [50] Respecting Domain Relations: Hypothesis Invariance for Domain Generalization
    Wang, Ziqi
    Loog, Marco
    van Gemert, Jan
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 9756 - 9763