A survey on 3D hand pose estimation: Cameras, methods, and datasets

被引:70
作者
Li, Rui [1 ]
Liu, Zhenyu [1 ]
Tan, Jianrong [1 ]
机构
[1] Zhejiang Univ, State Key Lab CAD&CG, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Hand pose estimation; Hand tracking; Depth camera; Human-computer interaction; REGRESSION FORESTS; KINECT; SYSTEM; SENSOR; ACCURACY; MOTION; MODEL;
D O I
10.1016/j.patcog.2019.04.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D Hand pose estimation has received an increasing amount of attention, especially since consumer depth cameras came onto the market in 2010. Although substantial progress has occurred recently, no overview has kept up with the latest developments. To bridge the gap, we provide a comprehensive survey, including depth cameras, hand pose estimation methods, and public benchmark datasets. First, a markerless approach is proposed to evaluate the tracking accuracy of depth cameras with the aid of a numerical control linear motion guide. Traditional approaches focus only on static characteristics. The evaluation of dynamic tracking capability has been long neglected. Second, we summarize the state-of-the-art methods and analyze the lines of research. Third, existing benchmark datasets and evaluation criteria are identified to provide further insight into the field of hand pose estimation. In addition, realistic challenges, recent trends, dataset creation and annotation, and open problems for future research directions are also discussed. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:251 / 272
页数:22
相关论文
共 184 条
[71]   Analysis of Human Grasping Behavior: Object Characteristics and Grasp Type [J].
Feix, Thomas ;
Bullock, Ian M. ;
Dollar, Aaron M. .
IEEE TRANSACTIONS ON HAPTICS, 2014, 7 (03) :311-323
[72]  
Fleishman Shachar, 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), P28, DOI 10.1109/CVPRW.2015.7301345
[73]   Multi-task, multi-domain learning: Application to semantic segmentation and pose regression [J].
Fourure, Damien ;
Emonet, Remi ;
Fromont, Elisa ;
Muselet, Damien ;
Neverova, Natalia ;
Tremeau, Alain ;
Wolf, Christian .
NEUROCOMPUTING, 2017, 251 :68-80
[74]   Real-Time 3D Hand Pose Estimation with 3D Convolutional Neural Networks [J].
Ge, Liuhao ;
Liang, Hui ;
Yuan, Junsong ;
Thalmann, Daniel .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2019, 41 (04) :956-970
[75]   Robust 3D Hand Pose Estimation From Single Depth Images Using Multi-View CNNs [J].
Ge, Liuhao ;
Liang, Hui ;
Yuan, Junsong ;
Thalmann, Daniel .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2018, 27 (09) :4422-4436
[76]   3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images [J].
Ge, Liuhao ;
Liang, Hui ;
Yuan, Junsong ;
Thalmann, Daniel .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :5679-5688
[77]   Robust 3D Hand Pose Estimation in Single Depth Images: from Single-View CNN to Multi-View CNNs [J].
Ge, Liuhao ;
Liang, Hui ;
Yuan, Junsong ;
Thalmann, Daniel .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3593-3601
[78]   Metrological comparison between Kinect I and Kinect II sensors [J].
Gonzalez-Jorge, H. ;
Rodriguez-Gonzalvez, P. ;
Martinez-Sanchez, J. ;
Gonzalez-Aguilera, D. ;
Arias, P. ;
Gesto, M. ;
Diaz-Vilarino, L. .
MEASUREMENT, 2015, 70 :21-26
[79]   3D Hand-Object Pose Estimation from Depth with Convolutional Neural Networks [J].
Goudie, Duncan ;
Galata, Aphrodite .
2017 12TH IEEE INTERNATIONAL CONFERENCE ON AUTOMATIC FACE AND GESTURE RECOGNITION (FG 2017), 2017, :406-413
[80]   An Analysis of the Precision and Reliability of the Leap Motion Sensor and Its Suitability for Static and Dynamic Tracking [J].
Guna, Joze ;
Jakus, Grega ;
Pogacnik, Matevz ;
Tomazic, Saso ;
Sodnik, Jaka .
SENSORS, 2014, 14 (02) :3702-3720