Group Spatial Attention for 3D Human Pose Estimation

被引:0
作者
Tran, Tien-Dat [1 ]
Cao, Ge [1 ]
Ashraf, Russo [1 ]
Jo, Kang-Hyun [1 ]
机构
[1] Univ Ulsan, Sch Elect Engn, Ulsan 44610, South Korea
来源
2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024 | 2024年
基金
新加坡国家研究基金会;
关键词
3D Human pose estimation; efficient attention module; transformer;
D O I
10.1109/ISIE54533.2024.10595678
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This paper introduces a novel Group Spatial Attention Module (GSAM) for enhancing 3D Human Pose Estimation (3DHPE) accuracy in complex scenes. Traditional 3DHPE approaches often struggle with occlusions and varied human poses, leading to decreased precision. GSAM addresses these challenges by leveraging group spatial attention mechanisms that dynamically focus on relevant spatial features and interactions among multiple figures within a scene. Our method incorporates a deep learning architecture that integrates GSAM with a state-of-the-art 3DHPE framework, facilitating the extraction of rich, contextual spatial information. We evaluate our approach on standard benchmarks, including Human3.6M and MPI-INF-3DHP, demonstrating significant improvements over existing methods in terms of accuracy and robustness against occlusions and pose variations. GSAM sets a new standard for 3DHPE, offering substantial advancements for applications in augmented reality, surveillance, and interactive systems.
引用
收藏
页数:7
相关论文
共 19 条
  • [1] [Anonymous], 2018, A guide to convolution arithmetic for deep learning
  • [2] Human Pose Estimation via Convolutional Part Heatmap Regression
    Bulat, Adrian
    Tzimiropoulos, Georgios
    [J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 717 - 732
  • [3] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
    Cao, Zhe
    Simon, Tomas
    Wei, Shih-En
    Sheikh, Yaser
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
  • [4] 3D Human Pose Estimation=2D Pose Estimation plus Matching
    Chen, Ching-Hang
    Ramanan, Deva
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5759 - 5767
  • [5] Hussain Z., 2019, Different approaches for human activity recognition: A survey
  • [6] DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model
    Insafutdinov, Eldar
    Pishchulin, Leonid
    Andres, Bjoern
    Andriluka, Mykhaylo
    Schiele, Bernt
    [J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 34 - 50
  • [7] Ioffe S., 2015, Proceedings of Machine Learning Research, P448
  • [8] Human Activity Recognition and Pattern Discovery
    Kim, Eunju
    Helal, Sumi
    Cook, Diane
    [J]. IEEE PERVASIVE COMPUTING, 2010, 9 (01) : 48 - 53
  • [9] Li W., 2012, Computer Vision-ACCV 2012, P31
  • [10] Microsoft COCO: Common Objects in Context
    Lin, Tsung-Yi
    Maire, Michael
    Belongie, Serge
    Hays, James
    Perona, Pietro
    Ramanan, Deva
    Dollar, Piotr
    Zitnick, C. Lawrence
    [J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755