Offboard 3D Object Detection from Point Cloud Sequences

被引：85

作者：

Qi, Charles R. ^{[1
]}

Zhou, Yin ^{[1
]}

Najibi, Mahyar ^{[1
]}

Sun, Pei ^{[1
]}

Khoa Vo ^{[1
]}

Deng, Boyang ^{[1
]}

Anguelov, Dragomir ^{[1
]}

机构：

[1] Waymo LLC, Mountain View, CA 94043 USA

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

关键词：

D O I：

10.1109/CVPR46437.2021.00607

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While current 3D object recognition research mostly focuses on the real-time, onboard scenario, there are many offboard use cases of perception that are largely under-explored, such as using machines to automatically generate high-quality 3D labels. Existing 3D object detectors fail to satisfy the high-quality requirement for offboard uses due to the limited input and speed constraints. In this paper, we propose a novel offboard 3D object detection pipeline using point cloud sequence data. Observing that different frames capture complementary views of objects, we design the offboard detector to make use of the temporal points through both multi frame object detection and novel object-centric refinement models. Evaluated on the Waymo Open Dataset, our pipeline named 3D Auto Labeling shows significant gains compared to the state-of-the-art onboard detectors and our offboard baselines. Its performance is even on par with human labels verified through a human label study. Further experiments demonstrate the application of auto labels for semi-supervised learning and provide extensive analysis to validate various design choices.

引用

页码：6130 / 6140

页数：11

共 72 条

[1] [Anonymous], 2017, COMMUN ACM, DOI DOI 10.1145/3065386
[2] [Anonymous], 2018, ARXIV181204244
[3] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
Behley, Jens
Garbade, Martin
Milioto, Andres
Quenzel, Jan
Behnke, Sven
Stachniss, Cyrill
Gall, Juergen
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
[4] Bewley A, 2020, RANGE CONDITIONED DI
[5] Annotating Object Instances with a Polygon-RNN
Castrejon, Lluis
Kundu, Kaustav
Urtasun, Raquel
Fidler, Sanja
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 4485 - 4493
[6] Chen X., 2017, P IEEE CVF C COMP VI, P1907, DOI [DOI 10.1109/CVPR.2017.691, 10.1109/CVPR.2017.691]
[7] Chen YL, 2019, IEEE I CONF COMP VIS, P9774, DOI [10.1109/ICCV.2019.00987, 10.1109/iccv.2019.00987]
[8] Chenhang He, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P11870, DOI 10.1109/CVPR42600.2020.01189
[9] Efficient Interactive Annotation of Segmentation Datasets with Polygon-RNN plus
Acuna, David
Ling, Huan
Kar, Amlan
Fidler, Sanja
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 859 - 868
[10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 5 6 7 8 →