Group Spatial Attention for 3D Human Pose Estimation

被引：0

作者：

Tran, Tien-Dat ^{[1
]}

Cao, Ge ^{[1
]}

Ashraf, Russo ^{[1
]}

Jo, Kang-Hyun ^{[1
]}

机构：

[1] Univ Ulsan, Sch Elect Engn, Ulsan 44610, South Korea

来源：

2024 33RD INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, ISIE 2024 | 2024年

基金：

新加坡国家研究基金会;

关键词：

3D Human pose estimation; efficient attention module; transformer;

D O I：

10.1109/ISIE54533.2024.10595678

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

This paper introduces a novel Group Spatial Attention Module (GSAM) for enhancing 3D Human Pose Estimation (3DHPE) accuracy in complex scenes. Traditional 3DHPE approaches often struggle with occlusions and varied human poses, leading to decreased precision. GSAM addresses these challenges by leveraging group spatial attention mechanisms that dynamically focus on relevant spatial features and interactions among multiple figures within a scene. Our method incorporates a deep learning architecture that integrates GSAM with a state-of-the-art 3DHPE framework, facilitating the extraction of rich, contextual spatial information. We evaluate our approach on standard benchmarks, including Human3.6M and MPI-INF-3DHP, demonstrating significant improvements over existing methods in terms of accuracy and robustness against occlusions and pose variations. GSAM sets a new standard for 3DHPE, offering substantial advancements for applications in augmented reality, surveillance, and interactive systems.

引用

页数：7

共 19 条

[1] [Anonymous], 2018, A guide to convolution arithmetic for deep learning
[2] Human Pose Estimation via Convolutional Part Heatmap Regression
Bulat, Adrian
Tzimiropoulos, Georgios
[J]. COMPUTER VISION - ECCV 2016, PT VII, 2016, 9911 : 717 - 732
[3] Realtime Multi-Person 2D Pose Estimation using Part Affinity Fields
Cao, Zhe
Simon, Tomas
Wei, Shih-En
Sheikh, Yaser
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1302 - 1310
[4] 3D Human Pose Estimation=2D Pose Estimation plus Matching
Chen, Ching-Hang
Ramanan, Deva
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 5759 - 5767
[5] Hussain Z., 2019, Different approaches for human activity recognition: A survey
[6] DeeperCut: A Deeper, Stronger, and Faster Multi-person Pose Estimation Model
Insafutdinov, Eldar
Pishchulin, Leonid
Andres, Bjoern
Andriluka, Mykhaylo
Schiele, Bernt
[J]. COMPUTER VISION - ECCV 2016, PT VI, 2016, 9910 : 34 - 50
[7] Ioffe S., 2015, Proceedings of Machine Learning Research, P448
[8] Human Activity Recognition and Pattern Discovery
Kim, Eunju
Helal, Sumi
Cook, Diane
[J]. IEEE PERVASIVE COMPUTING, 2010, 9 (01) : 48 - 53
[9] Li W., 2012, Computer Vision-ACCV 2012, P31
[10] Microsoft COCO: Common Objects in Context
Lin, Tsung-Yi
Maire, Michael
Belongie, Serge
Hays, James
Perona, Pietro
Ramanan, Deva
Dollar, Piotr
Zitnick, C. Lawrence
[J]. COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 : 740 - 755

← 1 2 →