Offboard 3D Object Detection from Point Cloud Sequences

被引:85
作者
Qi, Charles R. [1 ]
Zhou, Yin [1 ]
Najibi, Mahyar [1 ]
Sun, Pei [1 ]
Khoa Vo [1 ]
Deng, Boyang [1 ]
Anguelov, Dragomir [1 ]
机构
[1] Waymo LLC, Mountain View, CA 94043 USA
来源
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年
关键词
D O I
10.1109/CVPR46437.2021.00607
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While current 3D object recognition research mostly focuses on the real-time, onboard scenario, there are many offboard use cases of perception that are largely under-explored, such as using machines to automatically generate high-quality 3D labels. Existing 3D object detectors fail to satisfy the high-quality requirement for offboard uses due to the limited input and speed constraints. In this paper, we propose a novel offboard 3D object detection pipeline using point cloud sequence data. Observing that different frames capture complementary views of objects, we design the offboard detector to make use of the temporal points through both multi frame object detection and novel object-centric refinement models. Evaluated on the Waymo Open Dataset, our pipeline named 3D Auto Labeling shows significant gains compared to the state-of-the-art onboard detectors and our offboard baselines. Its performance is even on par with human labels verified through a human label study. Further experiments demonstrate the application of auto labels for semi-supervised learning and provide extensive analysis to validate various design choices.
引用
收藏
页码:6130 / 6140
页数:11
相关论文
共 72 条
  • [1] [Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
  • [2] [Anonymous], 2018, ARXIV181204244
  • [3] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
    Behley, Jens
    Garbade, Martin
    Milioto, Andres
    Quenzel, Jan
    Behnke, Sven
    Stachniss, Cyrill
    Gall, Juergen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
  • [4] Bewley A, 2020, RANGE CONDITIONED DI
  • [5] Annotating Object Instances with a Polygon-RNN
    Castrejon, Lluis
    Kundu, Kaustav
    Urtasun, Raquel
    Fidler, Sanja
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4485 - 4493
  • [6] Chen X., 2017, P IEEE CVF C COMP VI, P1907, DOI [DOI 10.1109/CVPR.2017.691, 10.1109/CVPR.2017.691]
  • [7] Chen YL, 2019, IEEE I CONF COMP VIS, P9774, DOI [10.1109/ICCV.2019.00987, 10.1109/iccv.2019.00987]
  • [8] Chenhang He, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P11870, DOI 10.1109/CVPR42600.2020.01189
  • [9] Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN plus
    Acuna, David
    Ling, Huan
    Kar, Amlan
    Fidler, Sanja
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 859 - 868
  • [10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848