Semantic-Structural Graph Convolutional Networks for Whole-Body Human Pose Estimation

被引:0
作者
Li, Weiwei [1 ,2 ]
Du, Rong [1 ,2 ]
Chen, Shudong [1 ,2 ]
机构
[1] Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China
[2] Univ Chinese Acad Sci, Beijing 100864, Peoples R China
关键词
human pose estimation; graph convolutional networks; non-local mechanics; feature embedding; FACE ALIGNMENT;
D O I
10.3390/info13030109
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Existing whole-body human pose estimation methods mostly segment the parts of the body's hands and feet for specific processing, which not only splits the overall semantics of the body, but also increases the amount of calculation and the complexity of the model. To address these drawbacks, we designed a novel semantic-structural graph convolutional network (SSGCN) for whole-body human pose estimation tasks, which leverages the whole-body graph structure to analyze the semantics of the whole-body keypoints through a graph convolutional network and improves the accuracy of pose estimation. Firstly, we introduced a novel heat-map-based keypoint embedding, which encodes the position information and feature information of the keypoints of the human body. Secondly, we propose a novel semantic-structural graph convolutional network consisting of several sets of cascaded structure-based graph layers and data-dependent whole-body non-local layers. Specifically, the proposed method extracts groups of keypoints and constructs a high-level abstract body graph to process the high-level semantic information of the whole-body keypoints. The experimental results showed that our method achieved very promising results on the challenging COCO whole-body dataset.
引用
收藏
页数:14
相关论文
共 44 条
  • [1] UniPose: Unified Human Pose Estimation in Single Images and Videos
    Artacho, Bruno
    Savakis, Andreas
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7033 - 7042
  • [2] Athitsos V, 2003, PROC CVPR IEEE, P432
  • [3] A non-local algorithm for image denoising
    Buades, A
    Coll, B
    Morel, JM
    [J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 60 - 65
  • [4] Face Alignment by Explicit Shape Regression
    Cao, Xudong
    Wei, Yichen
    Wen, Fang
    Sun, Jian
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 107 (02) : 177 - 190
  • [5] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
    Cao, Zhe
    Hidalgo, Gines
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
  • [6] Choi Hongsuk, 2020, P EUR C COMP VIS
  • [7] Cimen G., 2018, P 12 INT C COMP GRAP
  • [8] Model-Based 3D Hand Pose Estimation from Monocular Video
    de La Gorce, Martin
    Fleet, David J.
    Paragios, Nikos
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (09) : 1793 - 1805
  • [9] RetinaFace: Single-shot Multi-level Face Localisation in the Wild
    Deng, Jiankang
    Guo, Jia
    Ververas, Evangelos
    Kotsia, Irene
    Zafeiriou, Stefanos
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5202 - 5211
  • [10] Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714