PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation

被引:8
|
作者
Liu, Hanbing [1 ]
He, Jun-Yan [2 ]
Cheng, Zhi-Qi [3 ]
Xiang, Wangmeng [2 ]
Yang, Qize [2 ]
Chai, Wenhao [4 ]
Wang, Gaoang [5 ]
Bao, Xu [2 ]
Luo, Bin [2 ]
Geng, Yifeng [2 ]
Xie, Xuansong [2 ]
机构
[1] Tsinghua Univ, Beijing, Peoples R China
[2] Alibaba Grp, DAMO Acad, Hangzhou, Peoples R China
[3] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
[4] Univ Washington, Seattle, WA 98195 USA
[5] Zhejiang Univ, Hangzhou, Peoples R China
来源
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023 | 2023年
关键词
3D human pose estimation; diffusion model; domain-adaptation; multi-hypothesis; Low-Rank adaptation;
D O I
10.1145/3581783.3612368
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The current 3D human pose estimators face challenges in adapting to new datasets due to the scarcity of 2D-3D pose pairs in target domain training sets. We present the Multi-Hypothesis Pose Synthesis Domain Adaptation (PoSynDA) framework to overcome this issue without extensive target domain annotation. Utilizing a diffusion-centric structure, PoSynDA simulates the 3D pose distribution in the target domain, filling the data diversity gap. By incorporating a multi-hypothesis network, it creates diverse pose hypotheses and aligns them with the target domain. Target-specific source augmentation obtains the target domain distribution data from the source domain by decoupling the scale and position parameters. The teacher-student paradigm and low-rank adaptation further refine the process. PoSynDA demonstrates competitive performance on benchmarks, such as Human3.6M, MPI-INF-3DHP, and 3DPW, even comparable with the target-trained MixSTE model [66]. This work paves the way for the practical application of 3D human pose estimation.(1)
引用
收藏
页码:5542 / 5551
页数:10
相关论文
共 29 条
  • [21] SILHOUETTE-BASED SYNTHETIC DATA GENERATION FOR 3D HUMAN POSE ESTIMATION WITH A SINGLE WRIST-MOUNTED 360° CAMERA
    Hori, Ryosuke
    Hachiuma, Ryo
    Saito, Hideo
    Isogawa, Mariko
    Mikami, Dan
    2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 1304 - 1308
  • [22] Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency
    Zhou, Xingyi
    Karpur, Arjun
    Gan, Chuang
    Luo, Linjie
    Huang, Qixing
    COMPUTER VISION - ECCV 2018, PT XII, 2018, 11216 : 141 - 157
  • [23] CAD-to-real: enabling deep neural networks for 3D pose estimation of electronic control units
    Baeuerle, Simon
    Boehland, Moritz
    Barth, Jonas
    Reischl, Markus
    Steimer, Andreas
    Mikut, Ralf
    AT-AUTOMATISIERUNGSTECHNIK, 2021, 69 (10) : 880 - 891
  • [24] Multi-task Domain Adaptation for Language Grounding with 3D Objects
    Sun, Penglei
    Song, Yaoxian
    Pan, Xinglin
    Dong, Peijie
    Yang, Xiaofei
    Wang, Qiang
    Li, Zhixu
    Li, Tiefeng
    Chu, Xiaowen
    COMPUTER VISION - ECCV 2024, PT XXXIV, 2025, 15092 : 387 - 404
  • [25] Pose-Invariant Facial Expression Recognition Based on 3D Face Morphable Model and Domain Adversarial Learning
    Ma, Xiao
    Zhang, Kaige
    Yang, Xuan
    IMAGE AND GRAPHICS, ICIG 2019, PT III, 2019, 11903 : 491 - 502
  • [26] Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions
    Ai, Yihao
    Qi, Yifei
    Wang, Bo
    Cheng, Yu
    Wang, Xinchao
    Tan, Robby T.
    COMPUTER VISION - ECCV 2024, PT XLVII, 2025, 15105 : 221 - 239
  • [27] Adversarial unsupervised domain adaptation for 3D semantic segmentation with multi-modal learning
    Liu, Wei
    Luo, Zhiming
    Cai, Yuanzheng
    Yu, Ying
    Ke, Yang
    Marcato Junior, Jose
    Goncalves, Wesley Nunes
    Li, Jonathan
    ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2021, 176 : 211 - 221
  • [28] Multi-Scale Part-Based Feature Representation for 3D Domain Generalization and Adaptation
    Wei, Xin
    Gu, Xiang
    Sun, Jian
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 47 (03) : 1414 - 1430
  • [29] Diffusion model with temporal constraint for 3D human pose estimationDiffusion model with temporal constraint...Z. Chen et al.
    Zhangmeng Chen
    Ju Dai
    Junjun Pan
    Feng Zhou
    The Visual Computer, 2025, 41 (8) : 5961 - 5977