RSN: Range Sparse Net for Efficient, Accurate LiDAR 3D Object Detection

被引:110
作者
Sun, Pei [1 ]
Wang, Weiyue [1 ]
Chai, Yuning [1 ]
Elsayed, Gamaleldin [2 ]
Bewley, Alex [2 ]
Zhang, Xiao [1 ]
Sminchisescu, Cristian [2 ]
Anguelov, Dragomir [1 ]
机构
[1] Waymo LLC, Mountain View, CA 94043 USA
[2] Google, Mountain View, CA 94043 USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.00567
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection of 3D objects from LiDAR data is a critical component in most autonomous driving systems. Safe, high speed driving needs larger detection ranges, which are enabled by new LiDARs. These larger detection ranges require more efficient and accurate detection models. Towards this goal, we propose Range Sparse Net (RSN) - a simple, efficient, and accurate 3D object detector - in order to tackle real time 3D object detection in this extended detection regime. RSN predicts foreground points from range images and applies sparse convolutions on the selected foreground points to detect objects. The lightweight 2D convolutions on dense range images results in significantly fewer selected foreground points, thus enabling the later sparse convolutions in RSN to efficiently operate. Combining features from the range image further enhance detection accuracy. RSN runs at more than 60 frames per second on a 150m x 150m detection region on Waymo Open Dataset (WOD) while being more accurate than previously published detectors. As of 11/2020, RSN is ranked first in the WOD leaderboard based on the APH/LEVEL_1 metrics for LiDAR-based pedestrian and vehicle detection, while being several times faster than alternatives.
引用
收藏
页码:5721 / 5730
页数:10
相关论文
共 43 条
  • [1] Abadi M, 2016, ACM SIGPLAN NOTICES, V51, P1, DOI [10.1145/2951913.2976746, 10.1145/3022670.2976746]
  • [2] Ba J., 2016, ARXIV160706450, V1050, P21
  • [3] Bewley A., 2020, ARXIV200509927
  • [4] Chen X., 2017, P IEEE CVF C COMP VI, P1907, DOI [DOI 10.1109/CVPR.2017.691, 10.1109/CVPR.2017.691]
  • [5] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [6] Dai J., 2016, P NIPS16 30 INT C NE, P379, DOI DOI 10.1109/CVPR.2017.690
  • [7] Engelcke Martin, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P1355, DOI 10.1109/ICRA.2017.7989161
  • [8] Ge R., 2020, ARXIV200612671
  • [9] Girshick R., 2014, P IEEE C COMPUTER VI, DOI [10.1109/CVPR.2014.81, DOI 10.1109/CVPR.2014.81]
  • [10] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448