Multi-hop graph transformer network for 3D human pose estimation

被引：4

作者：

Islam, Zaedul ^{[1
]}

Ben Hamza, A. ^{[1
]}

机构：

[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada

来源：

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION | 2024年 / 101卷

基金：

加拿大自然科学与工程研究理事会;

关键词：

3D human pose estimation; Graph convolutional network; Transformer; Multi-hop; Dilated convolution;

D O I：

10.1016/j.jvcir.2024.104174

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Accurate 3D human pose estimation is a challenging task due to occlusion and depth ambiguity. In this paper, we introduce a multi -hop graph transformer network designed for 2D -to -3D human pose estimation in videos by leveraging the strengths of multi-head self-attention and multi -hop graph convolutional networks with disentangled neighborhoods to capture spatio-temporal dependencies and handle long-range interactions. The proposed network architecture consists of a graph attention block composed of stacked layers of multi-head self-attention and graph convolution with learnable adjacency matrix, and a multi -hop graph convolutional block comprised of multi -hop convolutional and dilated convolutional layers. The combination of multi-head self-attention and multi -hop graph convolutional layers enables the model to capture both local and global dependencies, while the integration of dilated convolutional layers enhances the model's ability to handle spatial details required for accurate localization of the human body joints. Extensive experiments demonstrate the effectiveness and generalization ability of our model, achieving competitive performance on benchmark datasets.

引用

页数：12

共 50 条

[41] Group Spatial Attention for 3D Human Pose Estimation
Tran, Tien-Dat
Cao, Ge
Ashraf, Russo
Jo, Kang-Hyun
2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024, 2024,
[42] MixPose: 3D Human Pose Estimation with Mixed Encoder
Cheng, Jisheng
Cheng, Qin
Yang, Mengjie
Liu, Zhen
Zhang, Qieshi
Cheng, Jun
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 353 - 364
[43] 3D Hand Pose Estimation via Graph-Based Reasoning
Song, Jae-Hun
Kang, Suk-Ju
IEEE ACCESS, 2021, 9 : 35824 - 35833
[44] Study of Adaptive Routing with Multi-Hop Diversity in Multi-Hop Virtual Cellular Network
Wang Jinpeng
Zhang Shufang
Zhang Jingbo
2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 238 - 241
[45] GMDN: A lightweight graph-based mixture density network for 3D human pose regression
Zou, Lu
Huang, Zhangjin
Gu, Naijie
Wang, Fangjun
Yang, Zhouwang
Wang, Guoping
COMPUTERS & GRAPHICS-UK, 2021, 95 : 115 - 122
[46] Multi-scale Feature Injection for Occluded 3D Human Pose and Shape Estimation
Shi, Yunhui
Ge, Yangyang
Wang, Jin
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4881 - 4886
[47] Shift Pose: A Lightweight Transformer-like Neural Network for Human Pose Estimation
Chen, Haijian
Jiang, Xinyun
Dai, Yonghui
SENSORS, 2022, 22 (19)
[48] Graph U-Shaped Network with Mapping-Aware Local Enhancement for Single-Frame 3D Human Pose Estimation
Yu, Bing
Huang, Yan
Cheng, Guang
Huang, Dongjin
Ding, Youdong
ELECTRONICS, 2023, 12 (19)
[49] Towards Accurate Microstructure Estimation via 3D Hybrid Graph Transformer
Yang, Junqing
Jiang, Haotian
Tassew, Tewodros
Sun, Peng
Ma, Jiquan
Xia, Yong
Yap, Pew-Thian
Chen, Geng
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VIII, 2023, 14227 : 25 - 34
[50] EMHIFormer: An Enhanced Multi-Hypothesis Interaction Transformer for 3D human estimation in video✩
Xiang, Xuezhi
Zhang, Kaixu
Qiao, Yulong
El Saddik, Abdulmotaleb
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95

← 1 2 3 4 5 →