Video-based body geometric aware network for 3D man pose estimation

被引:2
|
作者
Li Chaonan [1 ]
Liu Sheng [1 ]
Yao Lu [1 ]
Zou Siyu [1 ]
机构
[1] Zhejiang Univ Technol, Coll Comp Sci & Technol, Hangzhou 310023, Peoples R China
基金
国家重点研发计划;
关键词
A;
D O I
10.1007/s11801-022-2015-8
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
Three-dimensional human pose estimation (3D HPE) has broad application prospects in the fields of trajectory prediction, posture tracking and action analysis. However, the frequent self-occlusions and the substantial depth ambiguity in two-dimensional (2D) representations hinder the further improvement of accuracy. In this paper, we propose a novel video-based human body geometric aware network to mitigate the above problems. Our network can implicitly be aware of the geometric constraints of the human body by capturing spatial and temporal context information from 2D skeleton data. Specifically, a novel skeleton attention (SA) mechanism is proposed to model geometric context dependencies among different body joints, thereby improving the spatial feature representation ability of the network. To enhance the temporal consistency, a novel multilayer perceptron (MLP)-Mixer based structure is exploited to comprehensively learn temporal context information from input sequences. We conduct experiments on publicly available challenging datasets to evaluate the proposed approach. The results outperform the previous best approach by 0.5 mm in the Human3.6m dataset. It also demonstrates significant improvements in HumanEva-I dataset.
引用
收藏
页码:313 / 320
页数:8
相关论文
共 50 条
  • [1] Video-based body geometric aware network for 3D human pose estimation
    Chaonan Li
    Sheng Liu
    Lu Yao
    Siyu Zou
    Optoelectronics Letters, 2022, 18 : 313 - 320
  • [2] Video-based body geometric aware network for 3D human pose estimation
    LI Chaonan
    LIU Sheng
    YAO Lu
    ZOU Siyu
    OptoelectronicsLetters, 2022, 18 (05) : 313 - 320
  • [3] Video-Based 3D pose estimation for residential roofing
    Wang, Ruochen
    Zheng, Liying
    Hawke, Ashley L.
    Carey, Robert E.
    Breloff, Scott P.
    Li, Kang
    Peng, Xi
    COMPUTER METHODS IN BIOMECHANICS AND BIOMEDICAL ENGINEERING-IMAGING AND VISUALIZATION, 2023, 11 (03): : 369 - 377
  • [4] Video-Based 3D Human Pose Estimation Research
    Tao, Siting
    Zhang, Zhi
    2022 IEEE 17TH CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2022, : 485 - 490
  • [5] Spatiotemporal Neural Network for Video-Based Pose Estimation
    Ji, Bin
    Pan, Ye
    Jin, Xiaogang
    Yang, Xubo
    Jisuanji Fuzhu Sheji Yu Tuxingxue Xuebao/Journal of Computer-Aided Design and Computer Graphics, 2022, 34 (02): : 189 - 197
  • [6] Multiview Video-Based 3-D Hand Pose Estimation
    Khaleghi L.
    Sepas-Moghaddam A.
    Marshall J.
    Etemad A.
    IEEE Transactions on Artificial Intelligence, 2023, 4 (04): : 896 - 909
  • [7] A self-supervised spatio-temporal attention network for video-based 3D infant pose estimation
    Yin, Wang
    Chen, Linxi
    Huang, Xinrui
    Huang, Chunling
    Wang, Zhaohong
    Bian, Yang
    Wan, You
    Zhou, Yuan
    Han, Tongyan
    Yi, Ming
    MEDICAL IMAGE ANALYSIS, 2024, 96
  • [8] Global-to-Local Modeling for Video-based 3D Human Pose and Shape Estimation
    Shen, Xiaolong
    Yang, Zongxin
    Wang, Xiaohan
    Ma, Jianxin
    Zhou, Chang
    Yang, Yi
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 8887 - 8896
  • [9] Kinematics modeling network for video-based human pose estimation
    Dang, Yonghao
    Yin, Jianqin
    Zhang, Shaojie
    Liu, Jiping
    Hu, Yanzhu
    PATTERN RECOGNITION, 2024, 150
  • [10] Occlusion-Aware Networks for 3D Human Pose Estimation in Video
    Cheng, Yu
    Yang, Bo
    Wang, Bo
    Yan, Wending
    Tan, Robby T.
    2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 723 - 732