Learning Sequential Contexts using Transformer for 3D Hand Pose Estimation

被引：1

作者：

Khaleghi, Leyla ^{[1
,2
]}

Marshall, Joshua ^{[1
,2
]}

Etemad, Ali ^{[1
,2
]}

机构：

[1] Queens Univ Kingston, Dept ECE, Kingston, ON, Canada

[2] Queens Univ Kingston, Ingenu Labs, Res Inst, Kingston, ON, Canada

来源：

2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR) | 2022年

关键词：

D O I：

10.1109/ICPR56361.2022.9955633

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D hand pose estimation (HPE) is the process of locating the joints of the hand in 3D from any visual input. HPE has recently received an increased amount of attention due to its key role in a variety of human-computer interaction applications. Recent HPE methods have demonstrated the advantages of employing videos or multi-view images, allowing for more robust HPE systems. Accordingly, in this study, we propose a new method to perform Sequential learning with Transformer for Hand Pose (SeTHPose) estimation. Our SeTHPose pipeline begins by extracting visual embeddings from individual hand images. We then use a transformer encoder to learn the sequential context along time or viewing angles and generate accurate 21) hand joint locations. Then, a graph convolutional neural network with a U-Net configuration is used to convert the 2D hand joint locations to 3D poses. Our experiments show that SeTHPose performs well on both hand sequence varieties, temporal and angular. Also, SeTHPose outperforms other methods in the lield to achieve new state-of-the-art results on two public available sequential datasets, STB and MuViHand.

引用

页码：535 / 541

页数：7

共 50 条

[31] Database indexing methods for 3D hand pose estimation
Athitsos, V
Sclaroff, S
GESTURE-BASED COMMUNICATION IN HUMAN-COMPUTER INTERACTION, 2003, 2915 : 288 - 299
[32] 3D Hand Pose Estimation on Conventional Capacitive Touchscreens
Choi, Frederick
Mayer, Sven
Harrison, Chris
PROCEEDINGS OF 23RD ACM INTERNATIONAL CONFERENCE ON MOBILE HUMAN-COMPUTER INTERACTION (MOBILEHCI 2021): MOBILE APART, MOBILE TOGETHER, 2021,
[33] 3D Hand Pose Estimation in Everyday Egocentric Images
Prakash, Aditya
Tu, Ruisen
Chang, Matthew
Gupta, Saurabh
COMPUTER VISION - ECCV 2024, PT LXXVIII, 2025, 15136 : 183 - 202
[34] Enhancing 3D hand pose estimation using SHaF: synthetic hand dataset including a forearm
Lee, Jeongho
Kim, Jaeyun
Kim, Seon Ho
Choi, Sang-Il
APPLIED INTELLIGENCE, 2024, 54 (20) : 9565 - 9578
[35] 3D Hand Pose Estimation Using Semantic Dynamic Hypergraph Convolutional Networks
Wu, Yalei
Li, Jinghua
Kong, Dehui
Li, Qianxing
Yin, Baocai
Journal of Shanghai Jiaotong University (Science), 2024,
[36] Two-hand Global 3D Pose Estimation using Monocular RGB
Lin, Fanqing
Wilhelm, Connor
Martinez, Tony
2021 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION WACV 2021, 2021, : 2372 - 2380
[37] Simultaneous 3D hand detection and pose estimation using single depth images
Zhang, Yu
Mi, Siya
Wu, Jianxin
Geng, Xin
PATTERN RECOGNITION LETTERS, 2020, 140 (140) : 43 - 48
[38] 3D Hand Pose Estimation from Single Depth Images with Label Distribution Learning
Xu, Yuanfei
Wang, Xupeng
2020 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED SOFTWARE AND SYSTEMS (ICESS), 2020,
[39] 3D Hand Pose Estimation with a Single Infrared Camera via Domain Transfer Learning
Park, Gabyong
Kim, Tae-Kyun
Woo, Woontack
2020 IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY (ISMAR 2020), 2020, : 588 - 599
[40] ASCS-Reinforcement Learning: A Cascaded Framework for Accurate 3D Hand Pose Estimation
Chen, Mingqi
Shuang, Feng
Li, Shaodong
Liu, Xi
PROCEEDINGS OF THE 2023 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, ICMR 2023, 2023, : 335 - 342

← 1 2 3 4 5 →