Fingertip in the Eye: An Attention-Based Method for Real-Time Hand Tracking and Fingertip Detection in Egocentric Videos

被引:3
作者
Liu, Xiaorui [1 ]
Huang, Yichao [1 ]
Zhang, Xin [1 ]
Jin, Lianwen [1 ]
机构
[1] South China Univ Technol, Sch Elect & Informat Engn, Guangzhou, Guangdong, Peoples R China
来源
PATTERN RECOGNITION (CCPR 2016), PT I | 2016年 / 662卷
关键词
Attention-based hand tracking; Multiple points fingertip detection; Large scale ego-finger dataset; VISION;
D O I
10.1007/978-981-10-3002-4_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The hand and fingertip tracking is the crucial part in the egocentric vision interaction, and it remains a challenging problem due to various factors like dynamic environment and hand deformation. We propose a convolutional neural network (CNN) based method for the real-time and accurate hand tracking and fingertip detection in RGB sequences captured by an egocentric mobile camera. Firstly, we build a large scale dataset, Ego-Finger, containing plenty of scenarios and human labeled ground truth. Secondly, we propose a two stage CNN pipeline, i.e., the human vision inspired Attention-based Hand Tracker (AHT) and the hand physical constrained Multi-Points Fingertip Detector (MFD). Comparing with state-of-the-art methods, the proposed method achieves very promising results in the real-time fashion.
引用
收藏
页码:145 / 154
页数:10
相关论文
共 20 条
[1]  
[Anonymous], 2015, IEEE INT C COMP VIS
[2]   Lending A Hand: Detecting Hands and Recognizing Activities in Complex Egocentric Interactions [J].
Bambach, Sven ;
Lee, Stefan ;
Crandall, David J. ;
Yu, Chen .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1949-1957
[3]   Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation [J].
Baraldi, Lorenzo ;
Paci, Francesco ;
Serra, Giuseppe ;
Benini, Luca ;
Cucchiara, Rita .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2014, :702-+
[4]  
Betancourt A, 2015, IEEE IMAGE PROC, P2552, DOI 10.1109/ICIP.2015.7351263
[5]   The Evolution of First Person Vision Methods: A Survey [J].
Betancourt, Alejandro ;
Morerio, Pietro ;
Regazzoni, Carlo S. ;
Rauterberg, Matthias .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (05) :744-760
[6]   Scene and screen center bias early eye movements in scene viewing [J].
Bindemann, Markus .
VISION RESEARCH, 2010, 50 (23) :2577-2587
[7]   Global Contrast based Salient Region Detection [J].
Cheng, Ming-Ming ;
Zhang, Guo-Xin ;
Mitra, Niloy J. ;
Huang, Xiaolei ;
Hu, Shi-Min .
2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, :409-416
[8]   Context-Aware Saliency Detection [J].
Goferman, Stas ;
Zelnik-Manor, Lihi ;
Tal, Ayellet .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2012, 34 (10) :1915-1926
[9]   High-Speed Tracking with Kernelized Correlation Filters [J].
Henriques, Joao F. ;
Caseiro, Rui ;
Martins, Pedro ;
Batista, Jorge .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2015, 37 (03) :583-596
[10]   DeepFinger: A Cascade Convolutional Neuron Network Approach to Finger Key Point Detection in Egocentric Vision with Mobile Camera [J].
Huang, Yichao ;
Liu, Xiaorui ;
Jin, Lianwen ;
Zhang, Xin .
2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, :2944-2949