Latent Feature Disentanglement for Visual Domain Generalization

被引：4

作者：

Gholami, Behnam ^{[1
]}

El-Khamy, Mostafa ^{[1
,2
]}

Song, Kee-Bong ^{[1
]}

机构：

[1] Samsung Semicond Inc, Samsung Device Solut Res Amer, San Diego, CA 92126 USA

[2] Alexandria Univ, Dept Elect Engn, Alexandria 21544, Egypt

来源：

IEEE TRANSACTIONS ON IMAGE PROCESSING | 2023年 / 32卷

关键词：

Domain generalization; latent feature; feature disentanglement; image to image translation; StarGAN; ADVERSARIAL NETWORKS;

D O I：

10.1109/TIP.2023.3321511

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Despite remarkable success in a variety of computer vision applications, it is well-known that deep learning can fail catastrophically when presented with out-of-distribution data, where there are usually style differences between the training and test images. Toward addressing this challenge, we consider the domain generalization problem, wherein predictors are trained using data drawn from a family of related training (source) domains and then evaluated on a distinct and unseen test domain. Naively training a model on the aggregate set of data (pooled from all source domains) has been shown to perform suboptimally, since the information learned by that model might be domain-specific and generalizes imperfectly to test domains. Data augmentation has been shown to be an effective approach to overcome this problem. However, its application has been limited to enforcing invariance to simple transformations like rotation, brightness change, etc. Such perturbations do not necessarily cover plausible real-world variations that preserve the semantics of the input (such as a change in the image style). In this paper, taking the advantage of multiple source domains, we propose a novel approach to express and formalize robustness to these kind of real-world image perturbations. The three key ideas underlying our formulation are (1) leveraging disentangled representations of the images to define different factors of variations, (2) generating perturbed images by changing such factors composing the representations of the images, (3) enforcing the learner (classifier) to be invariant to such changes in the images. We use image-to-image translation models to demonstrate the efficacy of this approach. Based on this, we propose a domain-invariant regularization (DIR) loss function that enforces invariant prediction of targets (class labels) across domains which yields improved generalization performance. We demonstrate the effectiveness of our approach on several widely used datasets for the domain generalization problem, on all of which our results are competitive with the state-of-the-art.

引用

页码：5751 / 5763

页数：13

共 50 条

[41] Treat Noise as Domain Shift: Noise Feature Disentanglement for Underwater Perception and Maritime Surveys in Side-Scan Sonar Images
Yu, Yongcan
Zhao, Jianhu
Huang, Chao
Zhao, Xi
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[42] HCVP: Leveraging Hierarchical Contrastive Visual Prompt for Domain Generalization
Zhou, Guanglin
Han, Zhongyi
Chen, Shiming
Huang, Biwei
Zhu, Liming
Liu, Tongliang
Yao, Lina
Zhang, Kun
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1142 - 1152
[43] Learning generalized visual relations for domain generalization semantic segmentation
Li, Zijun
Liao, Muxin
EXPERT SYSTEMS WITH APPLICATIONS, 2025, 267
[44] Domain generalization through latent distribution exploration for motor imagery EEG classification
Song, Hao
She, Qingshan
Fang, Feng
Liu, Su
Chen, Yun
Zhang, Yingchun
NEUROCOMPUTING, 2025, 614
[45] Exploiting Low-Rank Structure from Latent Domains for Domain Generalization
Xu, Zheng
Li, Wen
Niu, Li
Xu, Dong
COMPUTER VISION - ECCV 2014, PT III, 2014, 8691 : 628 - 643
[46] Evolving Domain Generalization via Latent Structure-Aware Sequential Autoencoder
Qin, Tiexin
Wang, Shiqi
Li, Haoliang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (12) : 14514 - 14527
[47] Generative feature style augmentation for domain generalization in medical image segmentation
Huang, Yunzhi
Han, Luyi
Dou, Haoran
PATTERN RECOGNITION, 2025, 162
[48] Frequency domain guided latent diffusion model for domain generalization in cross-machine fault diagnosis
Liu, Xiaolin
Liu, Fuzheng
Geng, Xiangyi
Fan, Longqing
Jiang, Mingshun
Zhang, Faye
MEASUREMENT, 2025, 249
[49] A feature disentanglement and unsupervised domain adaptation of remaining useful life prediction for sensor-equipped machines
Yan, Jianhai
Ye, Zhi-Sheng
He, Shuguang
He, Zhen
RELIABILITY ENGINEERING & SYSTEM SAFETY, 2024, 242
[50] Cross-Domain Few-Shot Learning Based on Feature Disentanglement for Hyperspectral Image Classification
Qin, Boao
Feng, Shou
Zhao, Chunhui
Li, Wei
Tao, Ran
Xiang, Wei
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 15

← 1 2 3 4 5 →