Frustum PointNets for 3D Object Detection from RGB-D Data

被引:1823
作者
Qi, Charles R. [1 ]
Liu, Wei [2 ]
Wu, Chenxia [2 ]
Su, Hao [3 ]
Guibas, Leonidas J. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Nuro Inc, Mountain View, CA USA
[3] Univ Calif San Diego, La Jolla, CA USA
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2018.00102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study 3D object detection from RGBD data in both indoor and outdoor scenes. While previous methods focus on images or 3D voxels, often obscuring natural 3D patterns and invariances of 3D data, we directly operate on raw point clouds by popping up RGB-D scans. However, a key challenge of this approach is how to efficiently localize objects in point clouds of large-scale scenes ( region proposal). Instead of solely relying on 3D proposals, our method leverages both mature 2D object detectors and advanced 3D deep learning for object localization, achieving efficiency as well as high recall for even small objects. Benefited from learning directly in raw point clouds, our method is also able to precisely estimate 3D bounding boxes even under strong occlusion or with very sparse points. Evaluated on KITTI and SUN RGB-D 3D detection benchmarks, our method outperforms the state of the art by remarkable margins while having real-time capability.
引用
收藏
页码:918 / 927
页数:10
相关论文
共 50 条
[41]   RGB-D Camera based 3D Object Pose Estimation and Grasping [J].
Liang, Xiaoxiao ;
Cheng, Hongtai .
2019 9TH IEEE ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER 2019), 2019, :1279-1284
[42]   An Effective 3D Shape Descriptor for Object Recognition with RGB-D Sensors [J].
Liu, Zhong ;
Zhao, Changchen ;
Wu, Xingming ;
Chen, Weihai .
SENSORS, 2017, 17 (03)
[43]   RGB-D Cube R-CNN: 3D Object Detection with Selective Modality Dropout [J].
Piekenbrinck, Jens ;
Hermans, Alexander ;
Vaskevicius, Narunas ;
Linder, Timm ;
Leibe, Bastian .
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW, 2024, :1997-2006
[44]   Semi-supervised 3D object detection based on frustum transformation and RGB voxel grid [J].
Wang, Yan ;
Yuan, Tiantian ;
Hu, Bin ;
Li, Yao .
Hongwai yu Jiguang Gongcheng/Infrared and Laser Engineering, 2024, 53 (08)
[45]   Voting-based 3D Object Cuboid Detection Robust to Partial Occlusion from RGB-D Images [J].
Yun, Sangdoo ;
Jeong, Hawook ;
Kim, Soo Wan ;
Choi, Jin Young .
2016 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2016), 2016,
[46]   Error Accuracy Estimation of 3D Reconstruction and 3D Camera Pose from RGB-D Data [J].
Ortiz-Fernandez, Luis E. ;
Silva, Bruno M. F. ;
Goncalves, Luiz M. G. .
2022 35TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI 2022), 2022, :67-72
[47]   Modified Dendrite Morphological Neural Network Applied to 3D Object Recognition on RGB-D Data [J].
Sossa, Humberto ;
Guevara, Elizabeth .
HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, 2013, 8073 :304-313
[48]   People Detection in RGB-D Data [J].
Spinello, Luciano ;
Arras, Kai O. .
2011 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, 2011, :3838-3843
[49]   Object Detection Using Deformable Part Model in RGB-D Data [J].
Li, Chao ;
Ma, Si ;
Wang, Tao ;
Sheng, Hao ;
Xiong, Zhang .
ADVANCES IN VISUAL COMPUTING (ISVC 2014), PT 1, 2014, 8887 :678-687
[50]   3D Hand Pose Detection in Egocentric RGB-D Images [J].
Rogez, Gregory ;
Khademi, Maryam ;
Supancic, J. S., III ;
Montiel, J. M. M. ;
Ramanan, Deva .
COMPUTER VISION - ECCV 2014 WORKSHOPS, PT I, 2015, 8925 :356-371