Deep Transfer Learning Approach for Robust Hand Detection

被引:0
作者
Cvetkovic, Stevica [1 ]
Savic, Nemanja [1 ]
Ciric, Ivan [2 ]
机构
[1] Univ Nis, Fac Elect Engn, Nish 18000, Serbia
[2] Univ Nis, Fac Mech Engn, Nish 18000, Serbia
关键词
Deep learning model; object detection; hand detection; transfer; transfer learning; data augmentation;
D O I
10.32604/iasc.2023.032526
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Human hand detection in uncontrolled environments is a challenging visual recognition task due to numerous variations of hand poses and background image clutter. To achieve highly accurate results as well as provide real-time execution, we proposed a deep transfer learning approach over the state-of-theart deep learning object detector. Our method, denoted as YOLOHANDS, is built on top of the You Only Look Once (YOLO) deep learning architecture, which is modified to adapt to the single class hand detection task. The model transfer is performed by modifying the higher convolutional layers including the last fully connected layer, while initializing lower non-modified layers with the generic pretrained weights. To address robustness issues, we introduced a comprehensive augmentation procedure over the training image dataset, specifically adapted for the hand detection problem. Experimental evaluation of the proposed method, which is performed on a challenging public dataset, has demonstrated highly accurate results, comparable to the state-of-the-art methods.
引用
收藏
页码:967 / 979
页数:13
相关论文
共 31 条
[1]  
[Anonymous], 2016 IEEE Conf. Comp. Vis. Patt. Recog. (CVPR)
[2]  
Bochkovskiy A, 2020, Arxiv, DOI arXiv:2004.10934
[3]   Sign Language Recognition, Generation, and Translation: An Interdisciplinary Perspective [J].
Bragg, Danielle ;
Koller, Oscar ;
Bellard, Mary ;
Berke, Larwan ;
Boudreault, Patrick ;
Braffort, Annelies ;
Caselli, Naomi ;
Huenerfauth, Matt ;
Kacorri, Hernisa ;
Verhoef, Tessa ;
Vogler, Christian ;
Morris, Meredith Ringel .
ASSETS'19: THE 21ST INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2019, :16-31
[4]  
Chen T., 2022, INT C LEARNING REPRE
[5]  
Dosovitskiy Alexey, 2021, P ICLR
[6]  
Fang YX, 2021, ADV NEUR IN
[7]   Recognition of Indian Sign Language Using ORB with Bag of Visual Words by Kinect Sensor [J].
Gangrade, Jayesh ;
Bharti, Jyoti ;
Mulye, Anchit .
IETE JOURNAL OF RESEARCH, 2022, 68 (04) :2953-2967
[8]   Ensemble Deep Learning Using Faster R-CNN and Genetic Algorithm for Vehicle Detection in UAV Images [J].
Ghasemi Darehnaei, Zeinab ;
Rastegar Fatemi, Seyed Mohammad Jalal ;
Mirhassani, Seyed Mostafa ;
Fouladian, Majid .
IETE JOURNAL OF RESEARCH, 2023, 69 (08) :5102-5111
[9]   Fast R-CNN [J].
Girshick, Ross .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1440-1448
[10]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587