LEARNING MONOCULAR 3D HUMAN POSE ESTIMATION WITH SKELETAL INTERPOLATION

被引:2
作者
Chen, Ziyi [1 ]
Sugimoto, Akihiro [2 ]
Lai, Shang-Hong [1 ]
机构
[1] Natl Tsing Hua Univ, Hsinchu, Taiwan
[2] Natl Inst Informat, Tokyo, Japan
来源
2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2022年
关键词
Data augmentation; skeletal interpolation; transformer; 3D human pose estimation;
D O I
10.1109/ICASSP43922.2022.9746410
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep learning has achieved unprecedented accuracy for monocular 3D human pose estimation. However, current learning-based 3D human pose estimation still suffers from poor generalization. Inspired by skeletal animation, which is popular in game development and animation production, we put forward an simple, intuitive yet effective interpolation-based data augmentation approach to synthesize continuous and diverse 3D human body sequences to enhance model generalization. The Transformer-based lifting network, trained with the augmented data, utilizes the self-attention mechanism to perform 2D-to-3D lifting and successfully infer high-quality predictions in the qualitative experiment. The quantitative result of cross-dataset experiment demonstrates that our resulting model achieves superior generalization accuracy on the publicly available dataset.
引用
收藏
页码:4218 / 4222
页数:5
相关论文
共 50 条
  • [41] Exploiting Static and Dynamic Human Joint Relations for 3D Pose Estimation via Cascade Transformers
    Song, Bo
    Ji, Changjiang
    Fan, Shuo
    2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 4522 - 4528
  • [42] DBMHT: A double-branch multi-hypothesis transformer for 3D human pose estimation in video
    Xiang, Xuezhi
    Li, Xiaoheng
    Bao, Weijie
    Qiaoa, Yulong
    El Saddik, Abdulmotaleb
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 249
  • [43] A survey on deep 3D human pose estimationA survey on deep 3D human pose estimationR. B. Neupane et al.
    Rama Bastola Neupane
    Kan Li
    Tesfaye Fenta Boka
    Artificial Intelligence Review, 58 (1)
  • [44] HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation
    Cheng, Wencan
    Kim, Eunji
    Ko, Jong Hwan
    COMPUTER VISION - ECCV 2024, PT LXXXVIII, 2025, 15146 : 35 - 52
  • [45] Corn pose estimation using 3D object detection and stereo images
    Gao, Yuliang
    Li, Zhen
    Hong, Qingqing
    Li, Bin
    Zhang, Lifeng
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2025, 231
  • [46] A Study on 3D Human Pose Estimation Using Through-Wall IR-UWB Radar and Transformer
    Kim, Gon Woo
    Lee, Sang Won
    Son, Ha Young
    Choi, Kae Won
    IEEE ACCESS, 2023, 11 : 15082 - 15095
  • [47] Hierarchical Local Temporal Network for 2D-to-3D Human Pose Estimation
    Yan, Xin
    Xie, Jiucheng
    Liu, Mengqi
    Li, Haolun
    Gao, Hao
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (01): : 869 - 880
  • [48] GTIGNet: Global Topology Interaction Graphormer Network for 3D hand pose estimation
    Liu, Yanjun
    Fan, Wanshu
    Wang, Cong
    Wen, Shixi
    Yang, Xin
    Zhang, Qiang
    Wei, Xiaopeng
    Zhou, Dongsheng
    NEURAL NETWORKS, 2025, 185
  • [49] 3D interacting hand pose and shape estimation from a single RGB image
    Gao, Chengying
    Yang, Yujia
    Li, Wensheng
    NEUROCOMPUTING, 2022, 474 : 25 - 36
  • [50] Diffusion-Based Hypotheses Generation and Joint-Level Hypotheses Aggregation for 3D Human Pose Estimation
    Shan, Wenkang
    Zhang, Yuhuai
    Zhang, Xinfeng
    Wang, Shanshe
    Zhou, Xilong
    Ma, Siwei
    Gao, Wen
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 10678 - 10691