Multi-hop graph transformer network for 3D human pose estimation

被引:4
作者
Islam, Zaedul [1 ]
Ben Hamza, A. [1 ]
机构
[1] Concordia Univ, Concordia Inst Informat Syst Engn, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
3D human pose estimation; Graph convolutional network; Transformer; Multi-hop; Dilated convolution;
D O I
10.1016/j.jvcir.2024.104174
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Accurate 3D human pose estimation is a challenging task due to occlusion and depth ambiguity. In this paper, we introduce a multi -hop graph transformer network designed for 2D -to -3D human pose estimation in videos by leveraging the strengths of multi-head self-attention and multi -hop graph convolutional networks with disentangled neighborhoods to capture spatio-temporal dependencies and handle long-range interactions. The proposed network architecture consists of a graph attention block composed of stacked layers of multi-head self-attention and graph convolution with learnable adjacency matrix, and a multi -hop graph convolutional block comprised of multi -hop convolutional and dilated convolutional layers. The combination of multi-head self-attention and multi -hop graph convolutional layers enables the model to capture both local and global dependencies, while the integration of dilated convolutional layers enhances the model's ability to handle spatial details required for accurate localization of the human body joints. Extensive experiments demonstrate the effectiveness and generalization ability of our model, achieving competitive performance on benchmark datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Group Spatial Attention for 3D Human Pose Estimation
    Tran, Tien-Dat
    Cao, Ge
    Ashraf, Russo
    Jo, Kang-Hyun
    2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024, 2024,
  • [42] MixPose: 3D Human Pose Estimation with Mixed Encoder
    Cheng, Jisheng
    Cheng, Qin
    Yang, Mengjie
    Liu, Zhen
    Zhang, Qieshi
    Cheng, Jun
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT VIII, 2024, 14432 : 353 - 364
  • [43] 3D Hand Pose Estimation via Graph-Based Reasoning
    Song, Jae-Hun
    Kang, Suk-Ju
    IEEE ACCESS, 2021, 9 : 35824 - 35833
  • [44] Study of Adaptive Routing with Multi-Hop Diversity in Multi-Hop Virtual Cellular Network
    Wang Jinpeng
    Zhang Shufang
    Zhang Jingbo
    2009 5TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-8, 2009, : 238 - 241
  • [45] GMDN: A lightweight graph-based mixture density network for 3D human pose regression
    Zou, Lu
    Huang, Zhangjin
    Gu, Naijie
    Wang, Fangjun
    Yang, Zhouwang
    Wang, Guoping
    COMPUTERS & GRAPHICS-UK, 2021, 95 : 115 - 122
  • [46] Multi-scale Feature Injection for Occluded 3D Human Pose and Shape Estimation
    Shi, Yunhui
    Ge, Yangyang
    Wang, Jin
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 4881 - 4886
  • [47] Shift Pose: A Lightweight Transformer-like Neural Network for Human Pose Estimation
    Chen, Haijian
    Jiang, Xinyun
    Dai, Yonghui
    SENSORS, 2022, 22 (19)
  • [48] Graph U-Shaped Network with Mapping-Aware Local Enhancement for Single-Frame 3D Human Pose Estimation
    Yu, Bing
    Huang, Yan
    Cheng, Guang
    Huang, Dongjin
    Ding, Youdong
    ELECTRONICS, 2023, 12 (19)
  • [49] Towards Accurate Microstructure Estimation via 3D Hybrid Graph Transformer
    Yang, Junqing
    Jiang, Haotian
    Tassew, Tewodros
    Sun, Peng
    Ma, Jiquan
    Xia, Yong
    Yap, Pew-Thian
    Chen, Geng
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2023, PT VIII, 2023, 14227 : 25 - 34
  • [50] EMHIFormer: An Enhanced Multi-Hypothesis Interaction Transformer for 3D human estimation in video✩
    Xiang, Xuezhi
    Zhang, Kaixu
    Qiao, Yulong
    El Saddik, Abdulmotaleb
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2023, 95