LEARNING MONOCULAR 3D HUMAN POSE ESTIMATION WITH SKELETAL INTERPOLATION

被引：2

作者：

Chen, Ziyi ^{[1
]}

Sugimoto, Akihiro ^{[2
]}

Lai, Shang-Hong ^{[1
]}

机构：

[1] Natl Tsing Hua Univ, Hsinchu, Taiwan

[2] Natl Inst Informat, Tokyo, Japan

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年

关键词：

Data augmentation; skeletal interpolation; transformer; 3D human pose estimation;

D O I：

10.1109/ICASSP43922.2022.9746410

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Deep learning has achieved unprecedented accuracy for monocular 3D human pose estimation. However, current learning-based 3D human pose estimation still suffers from poor generalization. Inspired by skeletal animation, which is popular in game development and animation production, we put forward an simple, intuitive yet effective interpolation-based data augmentation approach to synthesize continuous and diverse 3D human body sequences to enhance model generalization. The Transformer-based lifting network, trained with the augmented data, utilizes the self-attention mechanism to perform 2D-to-3D lifting and successfully infer high-quality predictions in the qualitative experiment. The quantitative result of cross-dataset experiment demonstrates that our resulting model achieves superior generalization accuracy on the publicly available dataset.

引用

页码：4218 / 4222

页数：5

共 50 条

[41] Exploiting Static and Dynamic Human Joint Relations for 3D Pose Estimation via Cascade Transformers
Song, Bo
Ji, Changjiang
Fan, Shuo
2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4522 - 4528
[42] DBMHT: A double-branch multi-hypothesis transformer for 3D human pose estimation in video
Xiang, Xuezhi
Li, Xiaoheng
Bao, Weijie
Qiaoa, Yulong
El Saddik, Abdulmotaleb
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
[43] A survey on deep 3D human pose estimationA survey on deep 3D human pose estimationR. B. Neupane et al.
Rama Bastola Neupane
Kan Li
Tesfaye Fenta Boka
Artificial Intelligence Review, 58 (1)
[44] HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation
Cheng, Wencan
Kim, Eunji
Ko, Jong Hwan
COMPUTER VISION - ECCV 2024, PT LXXXVIII, 2025, 15146 : 35 - 52
[45] Corn pose estimation using 3D object detection and stereo images
Gao, Yuliang
Li, Zhen
Hong, Qingqing
Li, Bin
Zhang, Lifeng
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 231
[46] A Study on 3D Human Pose Estimation Using Through-Wall IR-UWB Radar and Transformer
Kim, Gon Woo
Lee, Sang Won
Son, Ha Young
Choi, Kae Won
IEEE ACCESS, 2023, 11 : 15082 - 15095
[47] Hierarchical Local Temporal Network for 2D-to-3D Human Pose Estimation
Yan, Xin
Xie, Jiucheng
Liu, Mengqi
Li, Haolun
Gao, Hao
IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (01): : 869 - 880
[48] GTIGNet: Global Topology Interaction Graphormer Network for 3D hand pose estimation
Liu, Yanjun
Fan, Wanshu
Wang, Cong
Wen, Shixi
Yang, Xin
Zhang, Qiang
Wei, Xiaopeng
Zhou, Dongsheng
NEURAL NETWORKS, 2025, 185
[49] 3D interacting hand pose and shape estimation from a single RGB image
Gao, Chengying
Yang, Yujia
Li, Wensheng
NEUROCOMPUTING, 2022, 474 : 25 - 36
[50] Diffusion-Based Hypotheses Generation and Joint-Level Hypotheses Aggregation for 3D Human Pose Estimation
Shan, Wenkang
Zhang, Yuhuai
Zhang, Xinfeng
Wang, Shanshe
Zhou, Xilong
Ma, Siwei
Gao, Wen
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 10678 - 10691

← 1 2 3 4 5 →