Learning Causal Representation for Training Cross-Domain Pose Estimator via Generative Interventions

被引:15
作者
Zhang, Xiheng [1 ]
Wong, Yongkang [2 ]
Wu, Xiaofei [3 ]
Lu, Juwei [3 ]
Kankanhalli, Mohan [2 ]
Li, Xiangdong [1 ]
Geng, Weidong [1 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, State Key Lab CAD CG, Hangzhou, Peoples R China
[2] Natl Univ Singapore, Sch Comp, Singapore, Singapore
[3] Huawei Noahs Ark Lab, Xian, Peoples R China
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
关键词
D O I
10.1109/ICCV48922.2021.01108
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D pose estimation has attracted increasing attention with the availability of high-quality benchmark datasets. However, prior works show that deep learning models tend to learn spurious correlations, which fail to generalize beyond the specific dataset they are trained on. In this work, we take a step towards training robust models for crossdomain pose estimation task, which brings together ideas from causal representation learning and generative adversarial networks. Specifically, this paper introduces a novel framework for causal representation learning which explicitly exploits the causal structure of the task. We consider changing domain as interventions on images under the data-generation process and steer the generative model to produce counterfactual features. This help the model learn transferable and causal relations across different domains. Our framework is able to learn with various types of unlabeled datasets. We demonstrate the efficacy of our proposed method on both human and hand pose estimation task. The experiment results show the proposed approach achieves state-of-the-art performance on most datasets for both domain adaptation and domain generalization settings.
引用
收藏
页码:11250 / 11260
页数:11
相关论文
共 77 条
[1]  
[Anonymous], 2016, ICML
[2]  
[Anonymous], 2016, ICML
[3]  
[Anonymous], 2018, Advances in Neural Information Processing Systems
[4]  
[Anonymous], 2017, ICML
[5]  
[Anonymous], 2010, International journal of computer vision, DOI DOI 10.1007/s11263-009-0275-4
[6]  
[Anonymous], 2021, CVPR, DOI DOI 10.1109/TSMC.2019.2958072
[7]  
Arjovsky Martin, 2021, ICLR, P2021
[8]   Weakly-supervised Domain Adaptation via GAN and Mesh Model for Estimating 3D Hand Poses Interacting Objects [J].
Baek, Seungryul ;
Kim, Kwang In ;
Kim, Tae-Kyun .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, :6120-6130
[9]   Learning to Detect Human-Object Interactions with Knowledge [J].
Xu, Bingjie ;
Wong, Yongkang ;
Li, Junnan ;
Zhao, Qi ;
Kankanhalli, Mohan S. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :2019-2028
[10]  
Chalupka K, 2015, UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, P181