A Causality-Aware Perspective on Domain Generalization via Domain Intervention

被引:1
作者
Shao, Youjia [1 ]
Wang, Shaohui [1 ]
Zhao, Wencang [1 ,2 ,3 ]
机构
[1] Qingdao Univ Sci & Technol, Coll Automat & Elect Engn, Qingdao 266061, Peoples R China
[2] Qingdao Inst Intelligent Nav & Control, Qingdao 266071, Peoples R China
[3] Shandong Key Lab Autonomous Landing Deep Space Exp, Qingdao 266061, Peoples R China
关键词
domain generalization; causal inference; counterfactual representation; domain intervention;
D O I
10.3390/electronics13101891
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Most mainstream statistical models will achieve poor performance in Out-Of-Distribution (OOD) generalization. This is because these models tend to learn the spurious correlation between data and will collapse when the domain shift exists. If we want artificial intelligence (AI) to make great strides in real life, the current focus needs to be shifted to the OOD problem of deep learning models to explore the generalization ability under unknown environments. Domain generalization (DG) focusing on OOD generalization is proposed, which is able to transfer the knowledge extracted from multiple source domains to the unseen target domain. We are inspired by intuitive thinking about human intelligence relying on causality. Unlike relying on plain probability correlations, we apply a novel causal perspective to DG, which can improve the OOD generalization ability of the trained model by mining the invariant causal mechanism. Firstly, we construct the inclusive causal graph for most DG tasks through stepwise causal analysis based on the data generation process in the natural environment and introduce the reasonable Structural Causal Model (SCM). Secondly, based on counterfactual inference, causal semantic representation learning with domain intervention (CSRDN) is proposed to train a robust model. In this regard, we generate counterfactual representations for different domain interventions, which can help the model learn causal semantics and develop generalization capacity. At the same time, we seek the Pareto optimal solution in the optimization process based on the loss function to obtain a more advanced training model. Extensive experimental results of Rotated MNIST and PACS as well as VLCS datasets verify the effectiveness of the proposed CSRDN. The proposed method can integrate causal inference into domain generalization by enhancing interpretability and applicability and brings a boost to challenging OOD generalization problems.
引用
收藏
页数:18
相关论文
共 58 条
[1]  
Arjovsky M, 2020, Arxiv, DOI arXiv:1907.02893
[2]  
Arjovsky M, 2017, PR MACH LEARN RES, V70
[3]  
Blanchard G, 2021, J MACH LEARN RES, V22
[4]  
Bottou L, 2013, J MACH LEARN RES, V14, P3207
[5]   Hallucinating Agnostic Images to Generalize Across Domains [J].
Carlucci, Fabio M. ;
Russo, Paolo ;
Tommasi, Tatiana ;
Caputo, Barbara .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, :3227-3234
[6]   A Style and Semantic Memory Mechanism for Domain Generalization [J].
Chen, Yang ;
Wang, Yu ;
Pan, Yingwei ;
Yao, Ting ;
Tian, Xinmei ;
Mei, Tao .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9144-9153
[7]   A Causal Framework for Distribution Generalization [J].
Christiansen, Rune ;
Pfister, Niklas ;
Jakobsen, Martin Emil ;
Gnecco, Nicola ;
Peters, Jonas .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) :6614-6630
[8]  
Dou Q, 2019, ADV NEUR IN, V32
[9]   Unbiased Metric Learning: On the Utilization of Multiple Datasets and Web Images for Softening Bias [J].
Fang, Chen ;
Xu, Ye ;
Rockmore, Daniel N. .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :1657-1664
[10]  
Finn C, 2017, PR MACH LEARN RES, V70