RSN: Range Sparse Net for Efficient, Accurate LiDAR 3D Object Detection

被引:110
作者
Sun, Pei [1 ]
Wang, Weiyue [1 ]
Chai, Yuning [1 ]
Elsayed, Gamaleldin [2 ]
Bewley, Alex [2 ]
Zhang, Xiao [1 ]
Sminchisescu, Cristian [2 ]
Anguelov, Dragomir [1 ]
机构
[1] Waymo LLC, Mountain View, CA 94043 USA
[2] Google, Mountain View, CA 94043 USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.00567
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The detection of 3D objects from LiDAR data is a critical component in most autonomous driving systems. Safe, high speed driving needs larger detection ranges, which are enabled by new LiDARs. These larger detection ranges require more efficient and accurate detection models. Towards this goal, we propose Range Sparse Net (RSN) - a simple, efficient, and accurate 3D object detector - in order to tackle real time 3D object detection in this extended detection regime. RSN predicts foreground points from range images and applies sparse convolutions on the selected foreground points to detect objects. The lightweight 2D convolutions on dense range images results in significantly fewer selected foreground points, thus enabling the later sparse convolutions in RSN to efficiently operate. Combining features from the range image further enhance detection accuracy. RSN runs at more than 60 frames per second on a 150m x 150m detection region on Waymo Open Dataset (WOD) while being more accurate than previously published detectors. As of 11/2020, RSN is ranked first in the WOD leaderboard based on the APH/LEVEL_1 metrics for LiDAR-based pedestrian and vehicle detection, while being several times faster than alternatives.
引用
收藏
页码:5721 / 5730
页数:10
相关论文
共 43 条
  • [31] Shuyang Cheng, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12366), P279, DOI 10.1007/978-3-030-58589-1_17
  • [32] Simon M., 2018, ARXIV180306199
  • [33] Sun Pei, 2020, CVPR
  • [34] van der Maaten L., 2017, Submanifold sparse convolutional networks
  • [35] Conceiving resilience: Lexical shifts and proximal meanings in the human-centered natural and built environment literature from 1990 to 2018
    Wang, Yan
    Hulse, David
    Von Meding, Jason
    Brown, Madeline
    Dedenbach, Laura
    [J]. DEVELOPMENTS IN THE BUILT ENVIRONMENT, 2020, 1
  • [36] Waymo, WAYM 5 GEN DRIV
  • [37] Yang Bin, 2018, PROC CVPR IEEE, P7652, DOI DOI 10.1109/CVPR.2018.00798
  • [38] STD: Sparse-to-Dense 3D Object Detector for Point Cloud
    Yang, Zetong
    Sun, Yanan
    Liu, Shu
    Shen, Xiaoyong
    Jia, Jiaya
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1951 - 1960
  • [39] GeoNet: Unsupervised Learning of Dense Depth, Optical Flow and Camera Pose
    Yin, Zhichao
    Shi, Jianping
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 1983 - 1992
  • [40] Headgesture: Hands-free input approach leveraging head movements for HMD devices
    Yan, Yukang
    Yu, Chun
    Yi, Xin
    Shi, Yuanchun
    [J]. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 2018, 2 (04)