Dual-view 3D human pose estimation without camera parameters for action recognition

被引:8
|
作者
Liu, Long [1 ]
Yang, Le [1 ]
Chen, Wanjun [2 ]
Gao, Xin [1 ]
机构
[1] Xian Univ Technol, Sch Automat & Informat Engn, 5 Jinhua South Rd, Xian, Shaanxi, Peoples R China
[2] Xian Univ Technol, Dept Informat Sci, Xian, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Cameras - Virtual reality;
D O I
10.1049/ipr2.12277
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The purpose of 3D human pose estimation is to estimate the 3D coordinates of key points of the human body directly from images. Although multi-view based methods have better performance and higher precision of coordinate estimation than a single-view based, they need to know the camera parameters. In order to effectively avoid the restriction of this constraint and improve the generalizability of the model, a dual-view single-person 3D pose estimation method without camera parameters is proposed. This method first uses the 2D pose estimation network HR-net to estimate the 2D joint point coordinates from two images with different views, and then inputs them into the 3D regression network to generate the final 3D joint point coordinates. In order to make the 3D regression network fully learn the spatial structure relationship of the human body and the transformation projection relationship between different views, a self-supervised training method is designed based on a 3D human pose orthogonal projection model to generate the virtual views. In the pose estimation experiments on the Human3.6 dataset, this method achieves a significantly improved estimation error of 34.5 mm. Furthermore, an action recognition based on the human poses extracted by the proposed method is conducted, and an accuracy of 83.19% is obtained.
引用
收藏
页码:3433 / 3440
页数:8
相关论文
共 50 条
  • [31] Multi-view Pictorial Structures for 3D Human Pose Estimation
    Amin, Sikandar
    Andriluka, Mykhaylo
    Rohrbach, Marcus
    Schiele, Bernt
    PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2013, 2013,
  • [32] Multi-view 3D Human Pose Estimation in Complex Environment
    Hofmann, M.
    Gavrila, D. M.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (01) : 103 - 124
  • [33] PROGRESSIVE MULTI-VIEW FUSION FOR 3D HUMAN POSE ESTIMATION
    Zhang, Lijun
    Zhou, Kangkang
    Liu, Liangchen
    Li, Zhenghao
    Zhao, Xunyi
    Zhou, Xiang-Dong
    Shi, Yu
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1600 - 1604
  • [34] Multi-view 3D Human Pose Estimation in Complex Environment
    M. Hofmann
    D. M. Gavrila
    International Journal of Computer Vision, 2012, 96 : 103 - 124
  • [35] Generative Multi-View Based 3D Human Pose Estimation
    Sabri, Motaz
    PROCEEDINGS OF 2021 INTERNATIONAL CONFERENCE ON SUSTAINABLE INFORMATION ENGINEERING AND TECHNOLOGY, SIET 2021, 2021, : 2 - 9
  • [36] View consistency aware holistic triangulation for 3D human pose estimation
    Wan, Xiaoyue
    Chen, Zhuo
    Zhao, Xu
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2023, 236
  • [37] Markerless multi-view 3D human pose estimation: A survey
    Nogueira, Ana Filipa Rodrigues
    Oliveira, Helder P.
    Teixeira, Luis F.
    IMAGE AND VISION COMPUTING, 2025, 155
  • [38] 3d human pose estimation based on multi view information fusion
    Zhang, Shuo
    Liu, Ming
    Zhao, Yuejin
    Dong, Liquan
    Kong, Lingqin
    OPTICAL METROLOGY AND INSPECTION FOR INDUSTRIAL APPLICATIONS IX, 2022, 12319
  • [39] ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting
    Zheng, Hongwei
    Li, Han
    Shi, Bowen
    Dai, Wenrui
    Wang, Botao
    Sun, Yu
    Guo, Min
    Xiong, Hongkai
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2657 - 2662
  • [40] 3D Human Pose Estimation from Deep Multi-View 2D Pose
    Schwarcz, Steven
    Pollard, Thomas
    2018 24TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2018, : 2326 - 2331