3d human pose estimation based on conditional dual-branch diffusion

被引:0
作者
Li, Jinghua [1 ]
Bai, Zhuowei [1 ]
Kong, Dehui [1 ]
Chen, Dongpan [1 ]
Li, Qianxing [1 ]
Yin, Baocai [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Human pose estimation; Diffusion model; Dual-branch; Joint semantics;
D O I
10.1007/s00530-024-01569-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Thanks to the development of 2D keypoint detectors, monocular 3D human pose estimation (HPE) via 2D-to-3D lifting approaches have achieved remarkable improvements. However, monocular 3D HPE is still a challenging problem due to the inherent depth ambiguities and occlusions. Recently, diffusion models have achieved great success in the field of image generation. Inspired by this, we transform 3D human pose estimation problem into a reverse diffusion process, and propose a dual-branch diffusion model so as to handle the indeterminacy and uncertainty of 3D pose and fully explore the global and local correlations between joints. Furthermore, we propose conditional dual-branch diffusion model to enhance the performance of 3D human pose estimation, in which the joint-level semantic information are regarded as the condition of the diffusion model, and integrated into the joint-level representations of 2D pose to enhance the expression of joints. The proposed method is verified on two widely used datasets and the experimental results have demonstrated the superiority.
引用
收藏
页数:11
相关论文
共 34 条
  • [1] Ailing Zeng, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12359), P507, DOI 10.1007/978-3-030-58568-6_30
  • [2] Cai Jialun, 2023, IEEE INT C AC SPEECH, P1
  • [3] Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks
    Cai, Yujun
    Ge, Liuhao
    Liu, Jun
    Cai, Jianfei
    Cham, Tat-Jen
    Yuan, Junsong
    Thalmann, Nadia Magnenat
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2272 - 2281
  • [4] Cascaded Pyramid Network for Multi-Person Pose Estimation
    Chen, Yilun
    Wang, Zhicheng
    Peng, Yuxiang
    Zhang, Zhiqiang
    Yu, Gang
    Sun, Jian
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
  • [5] DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model
    Choi, Jeongjun
    Shim, Dongseok
    Kim, H. Jin
    [J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 3773 - 3780
  • [6] Optimizing Network Structure for 3D Human Pose Estimation
    Ci, Hai
    Wang, Chunyu
    Ma, Xiaoxuan
    Wang, Yizhou
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2262 - 2271
  • [7] Dhariwal P, 2021, ADV NEUR IN, V34
  • [8] Motion Adaptive Pose Estimation from Compressed Videos
    Fan, Zhipeng
    Liu, Jun
    Wang, Yao
    [J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11699 - 11708
  • [9] System-status-aware Adaptive Network for Online Streaming Video Understanding
    Foo, Lin Geng
    Gong, Jia
    Fan, Zhipeng
    Liu, Jun
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10514 - 10523
  • [10] DiffPose: Toward More Reliable 3D Pose Estimation
    Gong, Jia
    Foo, Lin Geng
    Fan, Zhipeng
    Ke, Qiuhong
    Rahmani, Hossein
    Liu, Jun
    [J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13041 - 13051