LEARNING MONOCULAR 3D HUMAN POSE ESTIMATION WITH SKELETAL INTERPOLATION

被引:2
|
作者
Chen, Ziyi [1 ]
Sugimoto, Akihiro [2 ]
Lai, Shang-Hong [1 ]
机构
[1] Natl Tsing Hua Univ, Hsinchu, Taiwan
[2] Natl Inst Informat, Tokyo, Japan
关键词
Data augmentation; skeletal interpolation; transformer; 3D human pose estimation;
D O I
10.1109/ICASSP43922.2022.9746410
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Deep learning has achieved unprecedented accuracy for monocular 3D human pose estimation. However, current learning-based 3D human pose estimation still suffers from poor generalization. Inspired by skeletal animation, which is popular in game development and animation production, we put forward an simple, intuitive yet effective interpolation-based data augmentation approach to synthesize continuous and diverse 3D human body sequences to enhance model generalization. The Transformer-based lifting network, trained with the augmented data, utilizes the self-attention mechanism to perform 2D-to-3D lifting and successfully infer high-quality predictions in the qualitative experiment. The quantitative result of cross-dataset experiment demonstrates that our resulting model achieves superior generalization accuracy on the publicly available dataset.
引用
收藏
页码:4218 / 4222
页数:5
相关论文
共 50 条
  • [31] Real-Time Reinforcement Learning for Optimal Viewpoint Selection in Monocular 3D Human Pose Estimation
    Lee, Sanghyeon
    Hwang, Yoonho
    Lee, Jong Taek
    IEEE ACCESS, 2024, 12 : 191020 - 191029
  • [32] 3D pose estimation based on multiple monocular cues
    Barrois, Bjoern
    Woehler, Christian
    2007 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOLS 1-8, 2007, : 2724 - +
  • [33] Monocular vehicle pose estimation based on 3D model
    Xu L.-Z.
    Fu Q.-W.
    Tao W.
    Zhao H.
    Guangxue Jingmi Gongcheng/Optics and Precision Engineering, 2021, 29 (06): : 1346 - 1355
  • [34] Efficient Monocular Pose Estimation for Complex 3D Models
    Rubio, A.
    Villamizar, M.
    Ferraz, L.
    Penate-Sanchez, A.
    Ramisa, A.
    Simo-Serra, E.
    Sanfeliu, A.
    Moreno-Noguer, F.
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 1397 - 1402
  • [35] 3D Human Pose Estimation in the Wild by Adversarial Learning
    Yang, Wei
    Ouyang, Wanli
    Wang, Xiaolong
    Ren, Jimmy
    Li, Hongsheng
    Wang, Xiaogang
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 5255 - 5264
  • [36] SPATIO-TEMPORAL ATTENTION GRAPH FOR MONOCULAR 3D HUMAN POSE ESTIMATION
    Zhang, Lijun
    Shao, Xiaohu
    Li, Zhenghao
    Zhou, Xiang-Dong
    Shi, Yu
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1231 - 1235
  • [37] Augmented Reality with Human Body Interaction Based on Monocular 3D Pose Estimation
    Lin, Huei-Yung
    Chen, Ting-Wen
    ADVANCED CONCEPTS FOR INTELLIGENT VISION SYSTEMS, PT I, 2010, 6474 : 321 - 331
  • [38] Multi-Person 3D Human Pose Estimation from Monocular Images
    Dabral, Rishabh
    Gundavarapu, Nitesh B.
    Mitra, Rahul
    Sharma, Abhishek
    Ramakrishnan, Ganesh
    Jain, Arjun
    2019 INTERNATIONAL CONFERENCE ON 3D VISION (3DV 2019), 2019, : 405 - 414
  • [39] Sparseness Meets Deepness: 3D Human Pose Estimation from Monocular Video
    Zhou, Xiaowei
    Zhu, Menglong
    Leonardos, Spyridon
    Derpanis, Konstantinos G.
    Daniilidis, Kostas
    2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 4966 - 4975
  • [40] Towards Alleviating the Modeling Ambiguity of Unsupervised Monocular 3D Human Pose Estimation
    Yu, Zhenbo
    Ni, Bingbing
    Xu, Jingwei
    Wang, Junjie
    Zhao, Chenglong
    Zhang, Wenjun
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 8631 - 8640