HKE-GCN: Heatmaps-guided Keypoints Encoder and Graph Convolutional Network for Human Pose Estimation

被引：4

作者：

Xia, Han ^{[1
]}

Wang, Yiran ^{[2
]}

Wang, Xiaoru ^{[1
]}

Xiong, Songkai ^{[1
]}

Yu, Zhihong ^{[3
]}

机构：

[1] Beijing Univ Posts & Telecommun, Beijing, Peoples R China

[2] Beijing Forestry Univ, Beijing, Peoples R China

[3] Intel China Res Ctr, Beijing, Peoples R China

来源：

2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN) | 2022年

基金：

中国国家自然科学基金;

关键词：

Human Pose Estimation; Heatmaps-guided Keypoints Encoder; Graph Convolutional Network;

D O I：

10.1109/IJCNN55064.2022.9892251

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-person pose estimation is a challenging task which aims to locate keypoints for multiple persons. Graph convolutional network can effectively capture the semantic relationship among keypoints according to the kinematic structure of the human body, which is beneficial to locate keypoints but is the lack of ability of most CNN-based models. However, existing GCN-based methods mostly flatten the 2D features directly to obtain 1D embeddings, leading to the redundant information in keypoints embeddings, large size of keypoints embeddings, and high computation cost. To address these problems, we propose a two-stage framework based on Heatmaps-guided Keypoints Encoder and graph convolutional network, called HKE-GCN. The first stage uses a heatmaps-based network to predict the heatmaps of keypoints, then the second stage refines the prediction of the first stage. The second stage consists of two modules: Heatmaps-guided Keypoints Encoder (HKE) and Graph-based Refinement Module (GRM), which are used to generate keypoints embeddings according to the guidance of heatmaps and explicitly learn the relationship among keypoints based on GCN, respectively. Experiments show our framework is model-agnostic and our proposed modules are effective and lightweight. Our best model achieves state-of-the-art 76.4AP on COCO test-dev.

引用

页数：8

共 38 条

[1] Human Pose Estimation Based on a Spatial Temporal Graph Convolutional Network
Wu, Meng
Shi, Pudong
APPLIED SCIENCES-BASEL, 2023, 13 (05):
[2] Human pose estimation with spatial context relationships based on graph convolutional network
Han, Na
PROCEEDINGS OF 2020 IEEE 5TH INFORMATION TECHNOLOGY AND MECHATRONICS ENGINEERING CONFERENCE (ITOEC 2020), 2020, : 1566 - 1570
[3] Complex Human Pose Estimation via Keypoints Association Constraint Network
Zhu, Xuan
Guo, Zhenpeng
Liu, Xin
Li, Bin
Peng, Jinye
Chen, Peirong
Wang, Rongzhi
IEEE ACCESS, 2020, 8 : 205938 - 205947
[4] SA-GCN: structure-aware graph convolutional networks for crowd pose estimation
Wang, Jia
Luo, Yanmin
JOURNAL OF SUPERCOMPUTING, 2023, 79 (09) : 10046 - 10062
[5] SA-GCN: structure-aware graph convolutional networks for crowd pose estimation
Jia Wang
Yanmin Luo
The Journal of Supercomputing, 2023, 79 : 10046 - 10062
[6] Structure guided network for human pose estimation
Yilei Chen
Xuemei Xie
Wenjie Yin
Bo’ao Li
Fu Li
Applied Intelligence, 2023, 53 : 21012 - 21026
[7] Structure guided network for human pose estimation
Chen, Yilei
Xie, Xuemei
Yin, Wenjie
Li, Bo'ao
Li, Fu
APPLIED INTELLIGENCE, 2023, 53 (18) : 21012 - 21026
[8] HPGCN: Hierarchical poselet-guided graph convolutional network for 3D pose estimation
Wu, Yongpeng
Kong, Dehui
Wang, Shaofan
Li, Jinghua
Yin, Baocai
NEUROCOMPUTING, 2022, 487 : 243 - 256
[9] Human action recognition using a convolutional neural network based on skeleton heatmaps from two-stage pose estimation
Sun, Ruiqi
Zhang, Qin
Luo, Chuang
Guo, Jiamin
Chai, Hui
BIOMIMETIC INTELLIGENCE AND ROBOTICS, 2022, 2 (03):
[10] Relation-balanced graph convolutional network for 3D human pose estimation
Chen, Lu
Liu, Qiong
IMAGE AND VISION COMPUTING, 2023, 140

← 1 2 3 4 →