Robust 3D Hand Detection from a Single RGB-D Image in Unconstrained Environments

被引：6

作者：

Xu, Chi ^{[1
,2
,3
]}

Zhou, Jun ^{[1
,2
]}

Cai, Wendi ^{[1
,2
]}

Jiang, Yunkai ^{[1
,2
]}

Li, Yongbo ^{[1
,2
]}

Liu, Yi ^{[4
,5
]}

机构：

[1] China Univ Geosci, Sch Automat, Wuhan 430074, Peoples R China

[2] Hubei Key Lab Adv Control & Intelligent Automat C, Wuhan 430074, Peoples R China

[3] Minist Educ, Engn Res Ctr Intelligent Technol Geoexplorat, Wuhan 430074, Peoples R China

[4] CRRC Zhuzhou Elect Locomot Co Ltd, Zhuzhou 412000, Peoples R China

[5] Natl Innovat Ctr Adv Rail Transit Equipment, Zhuzhou 412000, Peoples R China

来源：

SENSORS | 2020年 / 20卷 / 21期

基金：

中国国家自然科学基金;

关键词：

3D hand detection; RGB-D sensor; human– computer interaction; unseen lighting condition; adaptive RGB-D fusion; OBJECT DETECTION; RECOGNITION; NETWORK;

D O I：

10.3390/s20216360

中图分类号：

O65 [分析化学];

学科分类号：

070302 ; 081704 ;

摘要：

Three-dimensional hand detection from a single RGB-D image is an important technology which supports many useful applications. Practically, it is challenging to robustly detect human hands in unconstrained environments because the RGB-D channels can be affected by many uncontrollable factors, such as light changes. To tackle this problem, we propose a 3D hand detection approach which improves the robustness and accuracy by adaptively fusing the complementary features extracted from the RGB-D channels. Using the fused RGB-D feature, the 2D bounding boxes of hands are detected first, and then the 3D locations along the z-axis are estimated through a cascaded network. Furthermore, we represent a challenging RGB-D hand detection dataset collected in unconstrained environments. Different from previous works which primarily rely on either the RGB or D channel, we adaptively fuse the RGB-D channels for hand detection. Specifically, evaluation results show that the D-channel is crucial for hand detection in unconstrained environments. Our RGB-D fusion-based approach significantly improves the hand detection accuracy from 69.1 to 74.1 comparing to one of the most state-of-the-art RGB-based hand detectors. The existing RGB- or D-based methods are unstable in unseen lighting conditions: in dark conditions, the accuracy of the RGB-based method significantly drops to 48.9, and in back-light conditions, the accuracy of the D-based method dramatically drops to 28.3. Compared with these methods, our RGB-D fusion based approach is much more robust without accuracy degrading, and our detection results are 62.5 and 65.9, respectively, in these two extreme lighting conditions for accuracy.

引用

页码：1 / 22

页数：22

共 50 条

[21] RGB-D Camera based 3D Object Pose Estimation and Grasping [J].

Liang, Xiaoxiao ;

Cheng, Hongtai .

2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, :1279-1284

[22] 3D Pose Estimation of Daily Objects Using an RGB-D Camera [J].

Choi, Changhyun ;

Christensen, Henrik I. .

2012 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2012, :3342-3349

[23] Integrating 3D structure into traffic scene understanding with RGB-D data [J].

Xia, Yingjie ;

Xu, Weiwei ;

Zhang, Luming ;

Shi, Xingmin ;

Mao, Kuang .

NEUROCOMPUTING, 2015, 151 :700-709

[24] Large Scale Indoor 3D Mapping Using RGB-D Sensor [J].

Zhu, Xiaoxiao ;

Cao, Qixin ;

Yokoi, Hiroshi ;

Jiang, Yinlai .

INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2016, PT I, 2016, 9834 :313-321

[25] 3D-Scene-Former: 3D scene generation from a single RGB image using Transformers [J].

Chatterjee, Jit ;

Vega, Maria Torres .

VISUAL COMPUTER, 2025, 41 (04) :2875-2889

[26] RGB-D Saliency Detection with 3D Cross-modal Fusion and Mid-level Integration [J].

Liu, Taoqi ;

Li, Bo .

2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, :1328-1335

[27] RGB-D object pose estimation in unstructured environments [J].

Choi, Changhyun ;

Christensen, Henrik I. .

ROBOTICS AND AUTONOMOUS SYSTEMS, 2016, 75 :595-613

[28] Semantic Labeling of Indoor Environments from 3D RGB Maps [J].

Brucker, Manuel ;

Durner, Maximilian ;

Ambrus, Rares ;

Marton, Zoltan Csaba ;

Wendt, Axel ;

Jensfelt, Patric ;

Arras, Kai O. ;

Triebel, Rudolph .

2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, :1871-1878

[29] RangeLVDet: Boosting 3D Object Detection in LIDAR With Range Image and RGB Image [J].

Zhang, Zehan ;

Liang, Zhidong ;

Zhang, Ming ;

Zhao, Xian ;

Li, Hao ;

Yang, Ming ;

Tan, Wenming ;

Pu, Shiliang .

IEEE SENSORS JOURNAL, 2022, 22 (02) :1391-1403

[30] RGB-D salient object detection: A survey [J].

Zhou, Tao ;

Fan, Deng-Ping ;

Cheng, Ming-Ming ;

Shen, Jianbing ;

Shao, Ling .

COMPUTATIONAL VISUAL MEDIA, 2021, 7 (01) :37-69

← 1 2 3 4 5 →