3d human pose estimation based on conditional dual-branch diffusion

被引：0

作者：

Li, Jinghua ^{[1
]}

Bai, Zhuowei ^{[1
]}

Kong, Dehui ^{[1
]}

Chen, Dongpan ^{[1
]}

Li, Qianxing ^{[1
]}

Yin, Baocai ^{[1
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China

来源：

MULTIMEDIA SYSTEMS | 2025年 / 31卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Human pose estimation; Diffusion model; Dual-branch; Joint semantics;

D O I：

10.1007/s00530-024-01569-5

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Thanks to the development of 2D keypoint detectors, monocular 3D human pose estimation (HPE) via 2D-to-3D lifting approaches have achieved remarkable improvements. However, monocular 3D HPE is still a challenging problem due to the inherent depth ambiguities and occlusions. Recently, diffusion models have achieved great success in the field of image generation. Inspired by this, we transform 3D human pose estimation problem into a reverse diffusion process, and propose a dual-branch diffusion model so as to handle the indeterminacy and uncertainty of 3D pose and fully explore the global and local correlations between joints. Furthermore, we propose conditional dual-branch diffusion model to enhance the performance of 3D human pose estimation, in which the joint-level semantic information are regarded as the condition of the diffusion model, and integrated into the joint-level representations of 2D pose to enhance the expression of joints. The proposed method is verified on two widely used datasets and the experimental results have demonstrated the superiority.

引用

页数：11

共 34 条

[1] Ailing Zeng, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12359), P507, DOI 10.1007/978-3-030-58568-6_30
[2] Cai Jialun, 2023, IEEE INT C AC SPEECH, P1
[3] Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks
Cai, Yujun
Ge, Liuhao
Liu, Jun
Cai, Jianfei
Cham, Tat-Jen
Yuan, Junsong
Thalmann, Nadia Magnenat
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2272 - 2281
[4] Cascaded Pyramid Network for Multi-Person Pose Estimation
Chen, Yilun
Wang, Zhicheng
Peng, Yuxiang
Zhang, Zhiqiang
Yu, Gang
Sun, Jian
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
[5] DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model
Choi, Jeongjun
Shim, Dongseok
Kim, H. Jin
[J]. 2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 3773 - 3780
[6] Optimizing Network Structure for 3D Human Pose Estimation
Ci, Hai
Wang, Chunyu
Ma, Xiaoxuan
Wang, Yizhou
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2262 - 2271
[7] Dhariwal P, 2021, ADV NEUR IN, V34
[8] Motion Adaptive Pose Estimation from Compressed Videos
Fan, Zhipeng
Liu, Jun
Wang, Yao
[J]. 2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 11699 - 11708
[9] System-status-aware Adaptive Network for Online Streaming Video Understanding
Foo, Lin Geng
Gong, Jia
Fan, Zhipeng
Liu, Jun
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 10514 - 10523
[10] DiffPose: Toward More Reliable 3D Pose Estimation
Gong, Jia
Foo, Lin Geng
Fan, Zhipeng
Ke, Qiuhong
Rahmani, Hossein
Liu, Jun
[J]. 2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 13041 - 13051

← 1 2 3 4 →