Frustum PointNets for 3D Object Detection from RGB-D Data

被引:1637
作者
Qi, Charles R. [1 ]
Liu, Wei [2 ]
Wu, Chenxia [2 ]
Su, Hao [3 ]
Guibas, Leonidas J. [1 ]
机构
[1] Stanford Univ, Stanford, CA 94305 USA
[2] Nuro Inc, Mountain View, CA USA
[3] Univ Calif San Diego, La Jolla, CA USA
来源
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2018年
基金
美国国家科学基金会;
关键词
D O I
10.1109/CVPR.2018.00102
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we study 3D object detection from RGBD data in both indoor and outdoor scenes. While previous methods focus on images or 3D voxels, often obscuring natural 3D patterns and invariances of 3D data, we directly operate on raw point clouds by popping up RGB-D scans. However, a key challenge of this approach is how to efficiently localize objects in point clouds of large-scale scenes ( region proposal). Instead of solely relying on 3D proposals, our method leverages both mature 2D object detectors and advanced 3D deep learning for object localization, achieving efficiency as well as high recall for even small objects. Benefited from learning directly in raw point clouds, our method is also able to precisely estimate 3D bounding boxes even under strong occlusion or with very sparse points. Evaluated on KITTI and SUN RGB-D 3D detection benchmarks, our method outperforms the state of the art by remarkable margins while having real-time capability.
引用
收藏
页码:918 / 927
页数:10
相关论文
共 50 条
  • [21] Interactive Mapping in 3D Using RGB-D Data
    Vieira, Pedro
    Ventura, Rodrigo
    2012 IEEE INTERNATIONAL SYMPOSIUM ON SAFETY, SECURITY, AND RESCUE ROBOTICS (SSRR), 2012,
  • [22] Visual Robotic Object Grasping Through Combining RGB-D Data and 3D Meshes
    Zhou, Yiyang
    Wang, Wenhai
    Guan, Wenjie
    Wu, Yirui
    Lai, Heng
    Lu, Tong
    Cai, Min
    MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 404 - 415
  • [23] Reconstructing 3D Human Pose from RGB-D Data with Occlusions
    Dang, Bowen
    Zhao, Xi
    Zhang, Bowen
    Wang, He
    COMPUTER GRAPHICS FORUM, 2023, 42 (07)
  • [24] 3D indoor scene modeling from RGB-D data: A survey
    Chen K.
    Lai Y.-K.
    Hu S.-M.
    Computational Visual Media, 2015, 1 (4) : 267 - 278
  • [25] 3D indoor scene modeling from RGB-D data:a survey
    Kang Chen
    Yu-Kun Lai
    Shi-Min Hu
    ComputationalVisualMedia, 2015, 1 (04) : 267 - 278
  • [26] An End-to-End Deep Learning Network for 3D Object Detection From RGB-D Data Based on Hough Voting
    Yan, Ming
    Li, Zhongtong
    Yu, Xinyan
    Jin, Cong
    IEEE ACCESS, 2020, 8 : 138810 - 138822
  • [27] Articulated clinician detection using 3D pictorial structures on RGB-D data
    Kadkhodamohammadi, Abdolrahim
    Gangi, Afshin
    de Mathelin, Michel
    Padoy, Nicolas
    MEDICAL IMAGE ANALYSIS, 2017, 35 : 215 - 224
  • [28] Multi-target 3D Reconstruction from RGB-D Data
    Gao, Yang
    Yao, Yuan
    Jiang, Yunliang
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND SOFTWARE ENGINEERING (CSSE 2019), 2019,
  • [29] Yolo+FPN: 2D and 3D Fused Object Detection With an RGB-D Camera
    Wang, Ya
    Zell, Andreas
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 4657 - 4664
  • [30] Salient Object Detection in RGB-D Videos
    Mou, Ao
    Lu, Yukang
    He, Jiahao
    Min, Dingyao
    Fu, Keren
    Zhao, Qijun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6660 - 6675