Iterative graph filtering network for 3D human pose estimation

被引：2

作者：

Islam, Zaedul ^{[1
]}

Ben Hamza, A. ^{[1
]}

机构：

[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2023年 / 95卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

Human pose estimation; Graph regularization; Gauss-Seidel; Modulation; Skip connection; NEURAL-NETWORKS;

D O I：

10.1016/j.jvcir.2023.103908

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Graph convolutional networks (GCNs) have proven to be an effective approach for 3D human pose estimation. By naturally modeling the skeleton structure of the human body as a graph, GCNs are able to capture the spatial relationships between joints and learn an efficient representation of the underlying pose. However, most GCN-based methods use a shared weight matrix, making it challenging to accurately capture the different and complex relationships between joints. In this paper, we introduce an iterative graph filtering framework for 3D human pose estimation, which aims to predict the 3D joint positions given a set of 2D joint locations in images. Our approach builds upon the idea of iteratively solving graph filtering with Laplacian regularization via the Gauss-Seidel iterative method. Motivated by this iterative solution, we design a Gauss-Seidel network (GS-Net) architecture, which makes use of weight and adjacency modulation, skip connection, and a pure convolutional block with layer normalization. Adjacency modulation facilitates the learning of edges that go beyond the inherent connections of body joints, resulting in an adjusted graph structure that reflects the human skeleton, while skip connections help maintain crucial information from the input layer's initial features as the network depth increases. We evaluate our proposed model on two standard benchmark datasets, and compare it with a comprehensive set of strong baseline methods for 3D human pose estimation. Our experimental results demonstrate that our approach outperforms the baseline methods on both datasets, achieving state-of-the-art performance. Furthermore, we conduct ablation studies to analyze the contributions of different components of our model architecture and show that the skip connection and adjacency modulation help improve the model performance.

引用

页数：12

共 72 条

[1] Ailing Zeng, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12359), P507, DOI 10.1007/978-3-030-58568-6_30
[2] 3D HUMAN POSE REGRESSION USING GRAPH CONVOLUTIONAL NETWORK
Banik, Soubarna
GarcIa, Alejandro Mendoza
Knoll, Alois
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2021, : 924 - 928
[3] Exploiting Spatial-temporal Relationships for 3D Pose Estimation via Graph Convolutional Networks
Cai, Yujun
Ge, Liuhao
Liu, Jun
Cai, Jianfei
Cham, Tat-Jen
Yuan, Junsong
Thalmann, Nadia Magnenat
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2272 - 2281
[4] 3D Human Pose Estimation=2D Pose Estimation plus Matching
Chen, Ching-Hang
Ramanan, Deva
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5759 - 5767
[5] Chen M, 2020, PR MACH LEARN RES, V119
[6] Bag of Tricks for Training Deeper Graph Neural Networks: A Comprehensive Benchmark Study
Chen, Tianlong
Zhou, Kaixiong
Duan, Keyu
Zheng, Wenqing
Wang, Peihao
Hu, Xia
Wang, Zhangyang
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (03) : 2769 - 2781
[7] Cascaded Pyramid Network for Multi-Person Pose Estimation
Chen, Yilun
Wang, Zhicheng
Peng, Yuxiang
Zhang, Zhiqiang
Yu, Gang
Sun, Jian
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7103 - 7112
[8] Choi Hongsuk, 2020, EUROPEAN C COMPUTER, DOI DOI 10.1007/978-3-030-58571-6_45
[9] Optimizing Network Structure for 3D Human Pose Estimation
Ci, Hai
Wang, Chunyu
Ma, Xiaoxuan
Wang, Yizhou
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 2262 - 2271
[10] Fang HS, 2018, AAAI CONF ARTIF INTE, P6821

← 1 2 3 4 5 6 7 8 →