An Efficient RGB-D Hand Gesture Detection Framework for Dexterous Robot Hand-Arm Teleoperation System

被引:13
作者
Gao, Qing [1 ,2 ]
Ju, Zhaojie [3 ]
Chen, Yongquan [1 ,2 ]
Wang, Qiwen [1 ,2 ]
Chi, Chuliang [1 ,2 ]
机构
[1] Chinese Univ Hong Kong, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China
[2] Chinese Univ Hong Kong, Inst Robot & Intelligent Mfg, Shenzhen 518172, Peoples R China
[3] Univ Portsmouth, Sch Comp, Portsmouth PO1 3HE, Hants, England
基金
中国国家自然科学基金;
关键词
Robots; Location awareness; Visualization; Robot kinematics; Interference; Robot vision systems; Data integration; Dexterous robot; hand gesture detection; RGB-D; teleoperation; RECOGNITION; NETWORK;
D O I
10.1109/THMS.2022.3206663
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Aiming at the problems of accurate and fast hand gesture detection and teleoperation mapping in the hand-based visual teleoperation of dexterous robots, an efficient hand gesture detection framework based on deep learning is proposed in this article. It can achieve an accurate and fast hand gesture detection and teleoperation of dexterous robots based on an anchor-free network architecture by using an RGB-D camera. First, an RGB-D early-fusion method based on the HSV space is proposed, effectively reducing background interference and enhancing hand information. Second, a hand gesture classification network (HandClasNet) is proposed to realize hand detection and localization by detecting the center and corner points of hands, and a HandClasNet is proposed to realize gesture recognition by using a parallel EfficientNet structure. Then, a dexterous robot hand-arm teleoperation system based on the hand gesture detection framework is designed to realize the hand-based teleoperation of a dexterous robot. Our method achieves high accuracy with fast speed on public and custom hand datasets and outperforms some state-of-the-art methods. In addition, the application of the proposed method in the hand-based teleoperation system can control the grasping of various objects by a dexterous hand-arm system in real time and accurately, which verifies the efficiency of our method.
引用
收藏
页码:13 / 23
页数:11
相关论文
共 38 条
  • [21] Focal Loss for Dense Object Detection
    Lin, Tsung-Yi
    Goyal, Priya
    Girshick, Ross
    He, Kaiming
    Dollar, Piotr
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
  • [22] SSD: Single Shot MultiBox Detector
    Liu, Wei
    Anguelov, Dragomir
    Erhan, Dumitru
    Szegedy, Christian
    Reed, Scott
    Fu, Cheng-Yang
    Berg, Alexander C.
    [J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
  • [23] Multimodal Gesture Recognition Based on the ResC3D Network
    Miao, Qiguang
    Li, Yunan
    Ouyang, Wanli
    Ma, Zhenxin
    Xu, Xin
    Shi, Weikang
    Cao, Xiaochun
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3047 - 3055
  • [24] Hand detection using multiple proposals
    Mittal, Arpit
    Zisserman, Andrew
    Torr, Philip H. S.
    [J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
  • [25] Qingrui Zhang, 2018, IAENG International Journal of Computer Science, V45, P435
  • [26] Redmon J, 2018, Arxiv, DOI arXiv:1804.02767
  • [27] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
    Ren, Shaoqing
    He, Kaiming
    Girshick, Ross
    Sun, Jian
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
  • [28] Sivakumar A, 2022, Arxiv, DOI arXiv:2202.10448
  • [29] Tan MX, 2019, PR MACH LEARN RES, V97
  • [30] American Sign Language alphabet recognition using Convolutional Neural Networks with multiview augmentation and inference fusion
    Tao, Wenjin
    Leu, Ming C.
    Yin, Zhaozheng
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 76 : 202 - 213