A Causality-Aware Perspective on Domain Generalization via Domain Intervention

被引:1
作者
Shao, Youjia [1 ]
Wang, Shaohui [1 ]
Zhao, Wencang [1 ,2 ,3 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Automat & Elect Engn, Qingdao 266061, Peoples R China
[2] Qingdao Inst Intelligent Nav & Control, Qingdao 266071, Peoples R China
[3] Shandong Key Lab Autonomous Landing Deep Space Exp, Qingdao 266061, Peoples R China
关键词
domain generalization; causal inference; counterfactual representation; domain intervention;
D O I
10.3390/electronics13101891
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most mainstream statistical models will achieve poor performance in Out-Of-Distribution (OOD) generalization. This is because these models tend to learn the spurious correlation between data and will collapse when the domain shift exists. If we want artificial intelligence (AI) to make great strides in real life, the current focus needs to be shifted to the OOD problem of deep learning models to explore the generalization ability under unknown environments. Domain generalization (DG) focusing on OOD generalization is proposed, which is able to transfer the knowledge extracted from multiple source domains to the unseen target domain. We are inspired by intuitive thinking about human intelligence relying on causality. Unlike relying on plain probability correlations, we apply a novel causal perspective to DG, which can improve the OOD generalization ability of the trained model by mining the invariant causal mechanism. Firstly, we construct the inclusive causal graph for most DG tasks through stepwise causal analysis based on the data generation process in the natural environment and introduce the reasonable Structural Causal Model (SCM). Secondly, based on counterfactual inference, causal semantic representation learning with domain intervention (CSRDN) is proposed to train a robust model. In this regard, we generate counterfactual representations for different domain interventions, which can help the model learn causal semantics and develop generalization capacity. At the same time, we seek the Pareto optimal solution in the optimization process based on the loss function to obtain a more advanced training model. Extensive experimental results of Rotated MNIST and PACS as well as VLCS datasets verify the effectiveness of the proposed CSRDN. The proposed method can integrate causal inference into domain generalization by enhancing interpretability and applicability and brings a boost to challenging OOD generalization problems.
引用
收藏
页数:18
相关论文
共 58 条
[51]  
Vapnik V., 2013, The nature of statistical learning theory
[52]  
Volpi R, 2018, ADV NEUR IN, V31
[53]   Deep visual domain adaptation: A survey [J].
Wang, Mei ;
Deng, Weihong .
NEUROCOMPUTING, 2018, 312 :135-153
[54]  
Yan S, 2020, Arxiv, DOI arXiv:2001.00677
[55]   CausalVAE: Disentangled Representation Learning via Neural Structural Causal Models [J].
Yang, Mengyue ;
Liu, Furui ;
Chen, Zhitang ;
Shen, Xinwei ;
Hao, Jianye ;
Wang, Jun .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :9588-9597
[56]   Deep Stable Learning for Out-Of-Distribution Generalization [J].
Zhang, Xingxuan ;
Cui, Peng ;
Xu, Renzhe ;
Zhou, Linjun ;
He, Yue ;
Shen, Zheyan .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :5368-5378
[57]   Exact Feature Distribution Matching for Arbitrary Style Transfer and Domain Generalization [J].
Zhang, Yabin ;
Li, Minghan ;
Li, Ruihuang ;
Jia, Kui ;
Zhang, Lei .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :8025-8035
[58]   Domain Generalization: A Survey [J].
Zhou, Kaiyang ;
Liu, Ziwei ;
Qiao, Yu ;
Xiang, Tao ;
Loy, Chen Change .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (04) :4396-4415