An Efficient RGB-D Hand Gesture Detection Framework for Dexterous Robot Hand-Arm Teleoperation System

被引：13

作者：

Gao, Qing ^{[1
,2
]}

Ju, Zhaojie ^{[3
]}

Chen, Yongquan ^{[1
,2
]}

Wang, Qiwen ^{[1
,2
]}

Chi, Chuliang ^{[1
,2
]}

机构：

[1] Chinese Univ Hong Kong, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China

[2] Chinese Univ Hong Kong, Inst Robot & Intelligent Mfg, Shenzhen 518172, Peoples R China

[3] Univ Portsmouth, Sch Comp, Portsmouth PO1 3HE, Hants, England

来源：

IEEE TRANSACTIONS ON HUMAN-MACHINE SYSTEMS | 2023年 / 53卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Robots; Location awareness; Visualization; Robot kinematics; Interference; Robot vision systems; Data integration; Dexterous robot; hand gesture detection; RGB-D; teleoperation; RECOGNITION; NETWORK;

D O I：

10.1109/THMS.2022.3206663

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Aiming at the problems of accurate and fast hand gesture detection and teleoperation mapping in the hand-based visual teleoperation of dexterous robots, an efficient hand gesture detection framework based on deep learning is proposed in this article. It can achieve an accurate and fast hand gesture detection and teleoperation of dexterous robots based on an anchor-free network architecture by using an RGB-D camera. First, an RGB-D early-fusion method based on the HSV space is proposed, effectively reducing background interference and enhancing hand information. Second, a hand gesture classification network (HandClasNet) is proposed to realize hand detection and localization by detecting the center and corner points of hands, and a HandClasNet is proposed to realize gesture recognition by using a parallel EfficientNet structure. Then, a dexterous robot hand-arm teleoperation system based on the hand gesture detection framework is designed to realize the hand-based teleoperation of a dexterous robot. Our method achieves high accuracy with fast speed on public and custom hand datasets and outperforms some state-of-the-art methods. In addition, the application of the proposed method in the hand-based teleoperation system can control the grasping of various objects by a dexterous hand-arm system in real time and accurately, which verifies the efficiency of our method.

引用

页码：13 / 23

页数：11

共 38 条

[21] Focal Loss for Dense Object Detection
Lin, Tsung-Yi
Goyal, Priya
Girshick, Ross
He, Kaiming
Dollar, Piotr
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, : 2999 - 3007
[22] SSD: Single Shot MultiBox Detector
Liu, Wei
Anguelov, Dragomir
Erhan, Dumitru
Szegedy, Christian
Reed, Scott
Fu, Cheng-Yang
Berg, Alexander C.
[J]. COMPUTER VISION - ECCV 2016, PT I, 2016, 9905 : 21 - 37
[23] Multimodal Gesture Recognition Based on the ResC3D Network
Miao, Qiguang
Li, Yunan
Ouyang, Wanli
Ma, Zhenxin
Xu, Xin
Shi, Weikang
Cao, Xiaochun
[J]. 2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, : 3047 - 3055
[24] Hand detection using multiple proposals
Mittal, Arpit
Zisserman, Andrew
Torr, Philip H. S.
[J]. PROCEEDINGS OF THE BRITISH MACHINE VISION CONFERENCE 2011, 2011,
[25] Qingrui Zhang, 2018, IAENG International Journal of Computer Science, V45, P435
[26] Redmon J, 2018, Arxiv, DOI arXiv:1804.02767
[27] Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
Ren, Shaoqing
He, Kaiming
Girshick, Ross
Sun, Jian
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (06) : 1137 - 1149
[28] Sivakumar A, 2022, Arxiv, DOI arXiv:2202.10448
[29] Tan MX, 2019, PR MACH LEARN RES, V97
[30] American Sign Language alphabet recognition using Convolutional Neural Networks with multiview augmentation and inference fusion
Tao, Wenjin
Leu, Ming C.
Yin, Zhaozheng
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2018, 76 : 202 - 213

← 1 2 3 4 →