Latent Feature Disentanglement for Visual Domain Generalization

被引:4
|
作者
Gholami, Behnam [1 ]
El-Khamy, Mostafa [1 ,2 ]
Song, Kee-Bong [1 ]
机构
[1] Samsung Semicond Inc, Samsung Device Solut Res Amer, San Diego, CA 92126 USA
[2] Alexandria Univ, Dept Elect Engn, Alexandria 21544, Egypt
关键词
Domain generalization; latent feature; feature disentanglement; image to image translation; StarGAN; ADVERSARIAL NETWORKS;
D O I
10.1109/TIP.2023.3321511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite remarkable success in a variety of computer vision applications, it is well-known that deep learning can fail catastrophically when presented with out-of-distribution data, where there are usually style differences between the training and test images. Toward addressing this challenge, we consider the domain generalization problem, wherein predictors are trained using data drawn from a family of related training (source) domains and then evaluated on a distinct and unseen test domain. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalizes imperfectly to test domains. Data augmentation has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to simple transformations like rotation, brightness change, etc. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in the image style). In this paper, taking the advantage of multiple source domains, we propose a novel approach to express and formalize robustness to these kind of real-world image perturbations. The three key ideas underlying our formulation are (1) leveraging disentangled representations of the images to define different factors of variations, (2) generating perturbed images by changing such factors composing the representations of the images, (3) enforcing the learner (classifier) to be invariant to such changes in the images. We use image-to-image translation models to demonstrate the efficacy of this approach. Based on this, we propose a domain-invariant regularization (DIR) loss function that enforces invariant prediction of targets (class labels) across domains which yields improved generalization performance. We demonstrate the effectiveness of our approach on several widely used datasets for the domain generalization problem, on all of which our results are competitive with the state-of-the-art.
引用
收藏
页码:5751 / 5763
页数:13
相关论文
共 50 条
  • [21] Domain Generalization Via Encoding and Resampling in a Unified Latent Space
    Liu, Yajing
    Xiong, Zhiwei
    Li, Ya
    Tian, Xinmei
    Zha, Zheng-Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 126 - 139
  • [22] Feature-based domain disentanglement and randomization: A generalized framework for rail surface defect segmentation in unseen scenarios
    Ma, Shuai
    Song, Kechen
    Niu, Menghui
    Tian, Hongkun
    Wang, Yanyan
    Yan, Yunhui
    ADVANCED ENGINEERING INFORMATICS, 2024, 59
  • [23] Domain generalized open-set intelligent fault diagnosis based on feature disentanglement meta-learning
    Zhou, Xiangdong
    Deng, Xiao
    Liu, Zhengwu
    Shao, Haidong
    Liu, Bin
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2024, 35 (11)
  • [24] Exploring Explicit Domain Supervision for Latent Space Disentanglement in Unpaired Image-to-Image Translation
    Lin, Jianxin
    Chen, Zhibo
    Xia, Yingce
    Liu, Sen
    Qin, Tao
    Luo, Jiebo
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (04) : 1254 - 1266
  • [25] Visual representations with texts domain generalization for semantic segmentation
    Yue, Wanlin
    Zhou, Zhiheng
    Cao, Yinglie
    Wu, Weikang
    APPLIED INTELLIGENCE, 2023, 53 (24) : 30069 - 30079
  • [26] Visual representations with texts domain generalization for semantic segmentation
    Wanlin Yue
    Zhiheng Zhou
    Yinglie Cao
    Weikang Wu
    Applied Intelligence, 2023, 53 : 30069 - 30079
  • [27] FSN: Feature Shift Network for Load-Domain (LD) Domain Generalization
    Chen, Heng
    Zhao, Erkang
    Jia, Yunpeng
    Shi, Lei
    APPLIED SCIENCES-BASEL, 2024, 14 (12):
  • [28] A novel domain feature disentanglement method for multi-target cross-domain mechanical fault diagnosis
    Liu, Zhenyu
    Zheng, Haowen
    Liu, Hui
    Duan, Guifang
    Tan, Jianrong
    ISA TRANSACTIONS, 2025, 158 : 512 - 524
  • [29] FDGNet: Frequency Disentanglement and Data Geometry for Domain Generalization in Cross-Scene Hyperspectral Image Classification
    Qin, Boao
    Feng, Shou
    Zhao, Chunhui
    Xi, Bobo
    Li, Wei
    Tao, Ran
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024,
  • [30] Prompt-Driven Latent Domain Generalization for Medical Image Classification
    Yan, Siyuan
    Yu, Zhen
    Liu, Chi
    Ju, Lie
    Mahapatra, Dwarikanath
    Betz-Stablein, Brigid
    Mar, Victoria
    Janda, Monika
    Soyer, Peter
    Ge, Zongyuan
    IEEE TRANSACTIONS ON MEDICAL IMAGING, 2025, 44 (01) : 348 - 360