Latent Feature Disentanglement for Visual Domain Generalization

被引:4
|
作者
Gholami, Behnam [1 ]
El-Khamy, Mostafa [1 ,2 ]
Song, Kee-Bong [1 ]
机构
[1] Samsung Semicond Inc, Samsung Device Solut Res Amer, San Diego, CA 92126 USA
[2] Alexandria Univ, Dept Elect Engn, Alexandria 21544, Egypt
关键词
Domain generalization; latent feature; feature disentanglement; image to image translation; StarGAN; ADVERSARIAL NETWORKS;
D O I
10.1109/TIP.2023.3321511
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Despite remarkable success in a variety of computer vision applications, it is well-known that deep learning can fail catastrophically when presented with out-of-distribution data, where there are usually style differences between the training and test images. Toward addressing this challenge, we consider the domain generalization problem, wherein predictors are trained using data drawn from a family of related training (source) domains and then evaluated on a distinct and unseen test domain. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalizes imperfectly to test domains. Data augmentation has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to simple transformations like rotation, brightness change, etc. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in the image style). In this paper, taking the advantage of multiple source domains, we propose a novel approach to express and formalize robustness to these kind of real-world image perturbations. The three key ideas underlying our formulation are (1) leveraging disentangled representations of the images to define different factors of variations, (2) generating perturbed images by changing such factors composing the representations of the images, (3) enforcing the learner (classifier) to be invariant to such changes in the images. We use image-to-image translation models to demonstrate the efficacy of this approach. Based on this, we propose a domain-invariant regularization (DIR) loss function that enforces invariant prediction of targets (class labels) across domains which yields improved generalization performance. We demonstrate the effectiveness of our approach on several widely used datasets for the domain generalization problem, on all of which our results are competitive with the state-of-the-art.
引用
收藏
页码:5751 / 5763
页数:13
相关论文
共 50 条
  • [41] Treat Noise as Domain Shift: Noise Feature Disentanglement for Underwater Perception and Maritime Surveys in Side-Scan Sonar Images
    Yu, Yongcan
    Zhao, Jianhu
    Huang, Chao
    Zhao, Xi
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [42] HCVP: Leveraging Hierarchical Contrastive Visual Prompt for Domain Generalization
    Zhou, Guanglin
    Han, Zhongyi
    Chen, Shiming
    Huang, Biwei
    Zhu, Liming
    Liu, Tongliang
    Yao, Lina
    Zhang, Kun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1142 - 1152
  • [43] Learning generalized visual relations for domain generalization semantic segmentation
    Li, Zijun
    Liao, Muxin
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
  • [44] Domain generalization through latent distribution exploration for motor imagery EEG classification
    Song, Hao
    She, Qingshan
    Fang, Feng
    Liu, Su
    Chen, Yun
    Zhang, Yingchun
    NEUROCOMPUTING, 2025, 614
  • [45] Exploiting Low-Rank Structure from Latent Domains for Domain Generalization
    Xu, Zheng
    Li, Wen
    Niu, Li
    Xu, Dong
    COMPUTER VISION - ECCV 2014, PT III, 2014, 8691 : 628 - 643
  • [46] Evolving Domain Generalization via Latent Structure-Aware Sequential Autoencoder
    Qin, Tiexin
    Wang, Shiqi
    Li, Haoliang
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14514 - 14527
  • [47] Generative feature style augmentation for domain generalization in medical image segmentation
    Huang, Yunzhi
    Han, Luyi
    Dou, Haoran
    PATTERN RECOGNITION, 2025, 162
  • [48] Frequency domain guided latent diffusion model for domain generalization in cross-machine fault diagnosis
    Liu, Xiaolin
    Liu, Fuzheng
    Geng, Xiangyi
    Fan, Longqing
    Jiang, Mingshun
    Zhang, Faye
    MEASUREMENT, 2025, 249
  • [49] A feature disentanglement and unsupervised domain adaptation of remaining useful life prediction for sensor-equipped machines
    Yan, Jianhai
    Ye, Zhi-Sheng
    He, Shuguang
    He, Zhen
    RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 242
  • [50] Cross-Domain Few-Shot Learning Based on Feature Disentanglement for Hyperspectral Image Classification
    Qin, Boao
    Feng, Shou
    Zhao, Chunhui
    Li, Wei
    Tao, Ran
    Xiang, Wei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15