HiMoReNet: A Hierarchical Model for Human Motion Refinement

被引：3

作者：

Wang, Zhiming ^{[1
,2
]}

Wang, Juan ^{[3
]}

Ge, Ning ^{[1
,2
]}

Lu, Jianhua ^{[1
,2
]}

机构：

[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China

[2] Beijing Natl Res Ctr Informat Sci & Technol, Beijing, Peoples R China

[3] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China

来源：

IEEE SIGNAL PROCESSING LETTERS | 2023年 / 30卷

关键词：

3D human pose estimation; motion refinement; hierarchical architecture; jitter removal; HUMAN POSE; SHAPE; NETWORK;

D O I：

10.1109/LSP.2023.3295756

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

3D human pose estimation has a broad range of applications, including anomaly detection and animation creation. Despite that significant progress on relative research has been made during the past decades, producing precise and smooth estimations for input videos still remains challenging mainly because of its ill-posed attributes. In this letter, we propose HiMoReNet, a post-processing motion refinement neural network based on an elaborate hierarchical architecture. Firstly, we distinguish characteristic motion patterns of joints at different locations by grouping the joints and employing respective spatiotemporal processing modules for each group. In addition, by mimicking interactions among multiple body parts, global context information is leveraged to further guide the motion refinement. Quantitative and qualitative results on the 3DPW dataset demonstrate that our proposed HiMoReNet achieves the state-of-the-art performance, and excels in jitter removal and precise pose estimation.

引用

页码：868 / 872

页数：5

共 42 条

[1] Ailing Zeng, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12359), P507, DOI 10.1007/978-3-030-58568-6_30
[2] SCAPE: Shape Completion and Animation of People
Anguelov, D
Srinivasan, P
Koller, D
Thrun, S
Rodgers, J
Davis, J
[J]. ACM TRANSACTIONS ON GRAPHICS, 2005, 24 (03): : 408 - 416
[3] UniPose plus : A Unified Framework for 2D and 3D Human Pose Estimation in Images and Videos
Artacho, Bruno
Savakis, Andreas
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) : 9641 - 9653
[4] A Bayesian Framework for Sparse Representation-Based 3-D Human Pose Estimation
Babagholami-Mohamadabadi, Behnam
Jourabloo, Amin
Zarghami, Ali
Kasaei, Shohreh
[J]. IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (03) : 297 - 300
[5] Bai SJ, 2018, Arxiv, DOI [arXiv:1803.01271, 10.48550/arXiv.1803.01271]
[6] Casiez G., 2012, P SIGCHI C HUM FACT, P2527
[7] Chang-Hung Lee, 2001, Proceedings of the 2001 ACM CIKM. Tenth International Conference on Information and Knowledge Management, P263
[8] Cascaded Pyramid Network for Multi-Person Pose Estimation
Chen, Yilun
Wang, Zhicheng
Peng, Yuxiang
Zhang, Zhiqiang
Yu, Gang
Sun, Jian
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
[9] Locally Connected Network for Monocular 3D Human Pose Estimation
Ci, Hai
Ma, Xiaoxuan
Wang, Chunyu
Wang, Yizhou
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) : 1429 - 1442
[10] Relation-Based Associative Joint Location for Human Pose Estimation in Videos
Dang, Yonghao
Yin, Jianqin
Zhang, Shaojie
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 3973 - 3986

← 1 2 3 4 5 →