Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments

被引:6
作者
Xu, Chi [1 ,2 ,3 ]
Zhou, Jun [1 ,2 ]
Cai, Wendi [1 ,2 ]
Jiang, Yunkai [1 ,2 ]
Li, Yongbo [1 ,2 ]
Liu, Yi [4 ,5 ]
机构
[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China
[2] Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China
[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan 430074, Peoples R China
[4] CRRC Zhuzhou Elect Locomot Co Ltd, Zhuzhou 412000, Peoples R China
[5] Natl Innovat Ctr Adv Rail Transit Equipment, Zhuzhou 412000, Peoples R China
基金
中国国家自然科学基金;
关键词
3D hand detection; RGB-D sensor; human– computer interaction; unseen lighting condition; adaptive RGB-D fusion; OBJECT DETECTION; RECOGNITION; NETWORK;
D O I
10.3390/s20216360
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such as light changes. To tackle this problem, we propose a 3D hand detection approach which improves the robustness and accuracy by adaptively fusing the complementary features extracted from the RGB-D channels. Using the fused RGB-D feature, the 2D bounding boxes of hands are detected first, and then the 3D locations along the z-axis are estimated through a cascaded network. Furthermore, we represent a challenging RGB-D hand detection dataset collected in unconstrained environments. Different from previous works which primarily rely on either the RGB or D channel, we adaptively fuse the RGB-D channels for hand detection. Specifically, evaluation results show that the D-channel is crucial for hand detection in unconstrained environments. Our RGB-D fusion-based approach significantly improves the hand detection accuracy from 69.1 to 74.1 comparing to one of the most state-of-the-art RGB-based hand detectors. The existing RGB- or D-based methods are unstable in unseen lighting conditions: in dark conditions, the accuracy of the RGB-based method significantly drops to 48.9, and in back-light conditions, the accuracy of the D-based method dramatically drops to 28.3. Compared with these methods, our RGB-D fusion based approach is much more robust without accuracy degrading, and our detection results are 62.5 and 65.9, respectively, in these two extreme lighting conditions for accuracy.
引用
收藏
页码:1 / 22
页数:22
相关论文
共 50 条
[21]   RGB-D Camera based 3D Object Pose Estimation and Grasping [J].
Liang, Xiaoxiao ;
Cheng, Hongtai .
2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, :1279-1284
[22]   3D Pose Estimation of Daily Objects Using an RGB-D Camera [J].
Choi, Changhyun ;
Christensen, Henrik I. .
2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, :3342-3349
[23]   Integrating 3D structure into traffic scene understanding with RGB-D data [J].
Xia, Yingjie ;
Xu, Weiwei ;
Zhang, Luming ;
Shi, Xingmin ;
Mao, Kuang .
NEUROCOMPUTING, 2015, 151 :700-709
[24]   Large Scale Indoor 3D Mapping Using RGB-D Sensor [J].
Zhu, Xiaoxiao ;
Cao, Qixin ;
Yokoi, Hiroshi ;
Jiang, Yinlai .
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2016, PT I, 2016, 9834 :313-321
[25]   3D-Scene-Former: 3D scene generation from a single RGB image using Transformers [J].
Chatterjee, Jit ;
Vega, Maria Torres .
VISUAL COMPUTER, 2025, 41 (04) :2875-2889
[26]   RGB-D Saliency Detection with 3D Cross-modal Fusion and Mid-level Integration [J].
Liu, Taoqi ;
Li, Bo .
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, :1328-1335
[27]   RGB-D object pose estimation in unstructured environments [J].
Choi, Changhyun ;
Christensen, Henrik I. .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 :595-613
[28]   Semantic Labeling of Indoor Environments from 3D RGB Maps [J].
Brucker, Manuel ;
Durner, Maximilian ;
Ambrus, Rares ;
Marton, Zoltan Csaba ;
Wendt, Axel ;
Jensfelt, Patric ;
Arras, Kai O. ;
Triebel, Rudolph .
2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, :1871-1878
[29]   RangeLVDet: Boosting 3D Object Detection in LIDAR With Range Image and RGB Image [J].
Zhang, Zehan ;
Liang, Zhidong ;
Zhang, Ming ;
Zhao, Xian ;
Li, Hao ;
Yang, Ming ;
Tan, Wenming ;
Pu, Shiliang .
IEEE SENSORS JOURNAL, 2022, 22 (02) :1391-1403
[30]   RGB-D salient object detection: A survey [J].
Zhou, Tao ;
Fan, Deng-Ping ;
Cheng, Ming-Ming ;
Shen, Jianbing ;
Shao, Ling .
COMPUTATIONAL VISUAL MEDIA, 2021, 7 (01) :37-69