Unified learning approach for egocentric hand gesture recognition and fingertip detection

被引:25
作者
Alam, Mohammad Mahmudul [1 ]
Islam, Mohammad Tariqul [2 ]
Rahman, S. M. Mahbubur [3 ]
机构
[1] Univ Maryland Baltimore Cty, Dept Comp Sci & Elect Engn, Baltimore, MD 21250 USA
[2] Princeton Univ, Dept Elect Engn, Princeton, NJ 08544 USA
[3] Bangladesh Univ Engn & Technol, Dept Elect & Elect Engn, Dhaka 1000, Bangladesh
关键词
Convolutional neural network; Fingertip detection; Gesture recognition; Human-computer interaction; Unified detection; VISION;
D O I
10.1016/j.patcog.2021.108200
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Head-mounted device-based human-computer interaction often requires egocentric recognition of hand gestures and fingertips detection. In this paper, a unified approach of egocentric hand gesture recognition and fingertip detection is introduced. The proposed algorithm uses a single convolutional neural network to predict the probabilities of finger class and positions of fingertips in one forward propagation. Instead of directly regressing the positions of fingertips from the fully connected layer, the ensemble of the posi-tion of fingertips is regressed from the fully convolutional network. Subsequently, the ensemble average is taken to regress the final position of fingertips. Since the whole pipeline uses a single network, it is sig-nificantly fast in computation. Experimental results show that the proposed method outperforms the ex -isting fingertip detection approaches including the Direct Regression and the Heatmap-based framework. The effectiveness of the proposed method is also shown in-the-wild scenario as well as in a use-case of virtual reality. (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 30 条
[1]  
Alam MM., 2020, Virtual Real Intell Hardware, V2, P534, DOI DOI 10.1016/J.VRIH.2020.10.001
[2]  
Alamsyah Derry, 2019, Journal of Physics: Conference Series, V1196, DOI 10.1088/1742-6596/1196/1/012017
[3]  
[Anonymous], 2015, 3 INT C LEARN REPR I
[4]  
[Anonymous], 2015, P BRIT MACHINE VISIO
[5]  
Aowal M.A., 2014, P IEEE TENSON IEEE R, V10, P1
[6]   Analysis of the Hands in Egocentric Vision: A Survey [J].
Bandini, Andrea ;
Zariffa, Jose .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (06) :6846-6866
[7]  
Gurav RM, 2015, 2015 INTERNATIONAL CONFERENCE ON INDUSTRIAL INSTRUMENTATION AND CONTROL (ICIC), P974, DOI 10.1109/IIC.2015.7150886
[8]  
Nguyen HD, 2017, 2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), P783, DOI 10.1109/FSKD.2017.8393373
[9]  
Hsien-I Lin, 2014, 2014 IEEE International Conference on Automation Science and Engineering (CASE), P1038, DOI 10.1109/CoASE.2014.6899454
[10]   DeepFinger: A Cascade Convolutional Neuron Network Approach to Finger Key Point Detection in Egocentric Vision with Mobile Camera [J].
Huang, Yichao ;
Liu, Xiaorui ;
Jin, Lianwen ;
Zhang, Xin .
2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, :2944-2949