Hand Pose Estimation in the Task of Egocentric Actions

被引:1
作者
Hruz, Marek [1 ,2 ]
Kanis, Jakub [2 ]
Krnoul, Zdenek [1 ]
机构
[1] Univ West Bohemia Pilsen, Fac Appl Sci, Dept Cybernet, Plzen 30614, Czech Republic
[2] Univ West Bohemia Pilsen, Fac Appl Sci, New Technol Informat Soc, Plzen 30100, Czech Republic
关键词
Three-dimensional displays; Pose estimation; Task analysis; Two dimensional displays; Solid modeling; Prediction algorithms; Location awareness; 3D convolutional neural network; egocentric; hand pose; TruncatedSVD; volumetric data; REGRESSION;
D O I
10.1109/ACCESS.2021.3050624
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article we tackle the problem of hand pose estimation when the hand is interacting with various objects from egocentric viewpoint. This entails a frequent occlusion of parts of the hand by the object and also self-occlusions of the hand. We use a Voxel-to-Voxel approach to obtain hypotheses of the hand joint locations, ensemble the hypotheses and use several post-processing strategies to improve on the results. We utilize models of prior hand pose in the form of Truncated Singular Value Decomposition (SVD) and the temporal context to produce refined hand joint locations. We present an ablation study of the methods to show the influence of individual features of the post-processing. With our method we were able to constitute state-of-the-art results on the HANDS19 Challenge: Task 2 - Depth-Based 3D Hand Pose Estimation while Interacting with Objects, with precision on unseen test data of 33.09 mm.
引用
收藏
页码:10533 / 10547
页数:15
相关论文
共 67 条
[41]  
Ryumin D, 2019, INT CONF PERVAS COMP, P949, DOI [10.1109/percomw.2019.8730886, 10.1109/PERCOMW.2019.8730886]
[42]   Accurate, Robust, and Flexible Real-time Hand Tracking [J].
Sharp, Toby ;
Keskin, Cem ;
Robertson, Duncan ;
Taylor, Jonathan ;
Shotton, Jamie ;
Kim, David ;
Rhemann, Christoph ;
Leichter, Ido ;
Vinnikov, Alon ;
Wei, Yichen ;
Freedman, Daniel ;
Kohli, Pushmeet ;
Krupka, Eyal ;
Fitzgibbon, Andrew ;
Izadi, Shahram .
CHI 2015: PROCEEDINGS OF THE 33RD ANNUAL CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, 2015, :3633-3642
[43]   DeepHand: Robust Hand Pose Estimation by Completing a Matrix Imputed with Deep Features [J].
Sinha, Ayan ;
Choi, Chiho ;
Ramani, Karthik .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :4150-4158
[44]   Real-Time Joint Tracking of a Hand Manipulating an Object from RGB-D Input [J].
Sridhar, Srinath ;
Mueller, Franziska ;
Zollhoefer, Michael ;
Casas, Dan ;
Oulasvirta, Antti ;
Theobalt, Christian .
COMPUTER VISION - ECCV 2016, PT II, 2016, 9906 :294-310
[45]  
Sun X, 2015, PROC CVPR IEEE, P824, DOI 10.1109/CVPR.2015.7298683
[46]   Opening the Black Box: Hierarchical Sampling Optimization for Estimating Human Hand Pose [J].
Tang, Danhang ;
Taylor, Jonathan ;
Kohli, Pushmeet ;
Keskin, Cem ;
Kim, Tae-Kyun ;
Shotton, Jamie .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :3325-3333
[47]   Latent Regression Forest: Structured Estimation of 3D Articulated Hand Posture [J].
Tang, Danhang ;
Chang, Hyung Jin ;
Tejani, Alykhan ;
Kim, Tae-Kyun .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :3786-3793
[48]   Real-time Articulated Hand Pose Estimation using Semi-supervised Transductive Regression Forests [J].
Tang, Danhang ;
Yu, Tsz-Ho ;
Kim, Tae-Kyun .
2013 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2013, :3224-3231
[49]   Efficient and Precise Interactive Hand Tracking Through Joint, Continuous Optimization of Pose and Correspondences [J].
Taylor, Jonathan ;
Bordeaux, Lucas ;
Cashman, Thomas ;
Corish, Bob ;
Keskin, Cem ;
Sharp, Toby ;
Soto, Eduardo ;
Sweeney, David ;
Valentin, Julien ;
Luff, Benjamin ;
Topalian, Arran ;
Wood, Erroll ;
Khamis, Sameh ;
Kohli, Pushmeet ;
Izadi, Shahram ;
Banks, Richard ;
Fitzgibbon, Andrew ;
Shotton, Jamie .
ACM TRANSACTIONS ON GRAPHICS, 2016, 35 (04)
[50]  
Tekin B., 2016, P BRIT MACH VIS C 20, P1