HiMoReNet: A Hierarchical Model for Human Motion Refinement

被引:3
作者
Wang, Zhiming [1 ,2 ]
Wang, Juan [3 ]
Ge, Ning [1 ,2 ]
Lu, Jianhua [1 ,2 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
关键词
3D human pose estimation; motion refinement; hierarchical architecture; jitter removal; HUMAN POSE; SHAPE; NETWORK;
D O I
10.1109/LSP.2023.3295756
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
3D human pose estimation has a broad range of applications, including anomaly detection and animation creation. Despite that significant progress on relative research has been made during the past decades, producing precise and smooth estimations for input videos still remains challenging mainly because of its ill-posed attributes. In this letter, we propose HiMoReNet, a post-processing motion refinement neural network based on an elaborate hierarchical architecture. Firstly, we distinguish characteristic motion patterns of joints at different locations by grouping the joints and employing respective spatiotemporal processing modules for each group. In addition, by mimicking interactions among multiple body parts, global context information is leveraged to further guide the motion refinement. Quantitative and qualitative results on the 3DPW dataset demonstrate that our proposed HiMoReNet achieves the state-of-the-art performance, and excels in jitter removal and precise pose estimation.
引用
收藏
页码:868 / 872
页数:5
相关论文
共 42 条
  • [1] Ailing Zeng, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12359), P507, DOI 10.1007/978-3-030-58568-6_30
  • [2] SCAPE: Shape Completion and Animation of People
    Anguelov, D
    Srinivasan, P
    Koller, D
    Thrun, S
    Rodgers, J
    Davis, J
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03): : 408 - 416
  • [3] UniPose plus : A Unified Framework for 2D and 3D Human Pose Estimation in Images and Videos
    Artacho, Bruno
    Savakis, Andreas
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9641 - 9653
  • [4] A Bayesian Framework for Sparse Representation-Based 3-D Human Pose Estimation
    Babagholami-Mohamadabadi, Behnam
    Jourabloo, Amin
    Zarghami, Ali
    Kasaei, Shohreh
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (03) : 297 - 300
  • [5] Bai SJ, 2018, Arxiv, DOI [arXiv:1803.01271, 10.48550/arXiv.1803.01271]
  • [6] Casiez G., 2012, P SIGCHI C HUM FACT, P2527
  • [7] Chang-Hung Lee, 2001, Proceedings of the 2001 ACM CIKM. Tenth International Conference on Information and Knowledge Management, P263
  • [8] Cascaded Pyramid Network for Multi-Person Pose Estimation
    Chen, Yilun
    Wang, Zhicheng
    Peng, Yuxiang
    Zhang, Zhiqiang
    Yu, Gang
    Sun, Jian
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
  • [9] Locally Connected Network for Monocular 3D Human Pose Estimation
    Ci, Hai
    Ma, Xiaoxuan
    Wang, Chunyu
    Wang, Yizhou
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1429 - 1442
  • [10] Relation-Based Associative Joint Location for Human Pose Estimation in Videos
    Dang, Yonghao
    Yin, Jianqin
    Zhang, Shaojie
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3973 - 3986