Semantic-Structural Graph Convolutional Networks for Whole-Body Human Pose Estimation

被引：0

作者：

Li, Weiwei ^{[1
,2
]}

Du, Rong ^{[1
,2
]}

Chen, Shudong ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Inst Microelect, Beijing 100029, Peoples R China

[2] Univ Chinese Acad Sci, Beijing 100864, Peoples R China

来源：

INFORMATION | 2022年 / 13卷 / 03期

关键词：

human pose estimation; graph convolutional networks; non-local mechanics; feature embedding; FACE ALIGNMENT;

D O I：

10.3390/info13030109

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Existing whole-body human pose estimation methods mostly segment the parts of the body's hands and feet for specific processing, which not only splits the overall semantics of the body, but also increases the amount of calculation and the complexity of the model. To address these drawbacks, we designed a novel semantic-structural graph convolutional network (SSGCN) for whole-body human pose estimation tasks, which leverages the whole-body graph structure to analyze the semantics of the whole-body keypoints through a graph convolutional network and improves the accuracy of pose estimation. Firstly, we introduced a novel heat-map-based keypoint embedding, which encodes the position information and feature information of the keypoints of the human body. Secondly, we propose a novel semantic-structural graph convolutional network consisting of several sets of cascaded structure-based graph layers and data-dependent whole-body non-local layers. Specifically, the proposed method extracts groups of keypoints and constructs a high-level abstract body graph to process the high-level semantic information of the whole-body keypoints. The experimental results showed that our method achieved very promising results on the challenging COCO whole-body dataset.

引用

页数：14

共 44 条

[1] UniPose: Unified Human Pose Estimation in Single Images and Videos
Artacho, Bruno
Savakis, Andreas
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 7033 - 7042
[2] Athitsos V, 2003, PROC CVPR IEEE, P432
[3] A non-local algorithm for image denoising
Buades, A
Coll, B
Morel, JM
[J]. 2005 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, PROCEEDINGS, 2005, : 60 - 65
[4] Face Alignment by Explicit Shape Regression
Cao, Xudong
Wei, Yichen
Wen, Fang
Sun, Jian
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 107 (02) : 177 - 190
[5] OpenPose: Realtime Multi-Person 2D Pose Estimation Using Part Affinity Fields
Cao, Zhe
Hidalgo, Gines
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (01) : 172 - 186
[6] Choi Hongsuk, 2020, P EUR C COMP VIS
[7] Cimen G., 2018, P 12 INT C COMP GRAP
[8] Model-Based 3D Hand Pose Estimation from Monocular Video
de La Gorce, Martin
Fleet, David J.
Paragios, Nikos
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (09) : 1793 - 1805
[9] RetinaFace: Single-shot Multi-level Face Localisation in the Wild
Deng, Jiankang
Guo, Jia
Ververas, Evangelos
Kotsia, Irene
Zafeiriou, Stefanos
[J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5202 - 5211
[10] Du Y, 2015, PROC CVPR IEEE, P1110, DOI 10.1109/CVPR.2015.7298714

← 1 2 3 4 5 →