RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection

被引：129

作者：

Fan, Lue ^{[1
,3
,4
,6
]}

Xiong, Xuan ^{[2
]}

Wang, Feng ^{[2
]}

Wang, Naiyan ^{[2
]}

Zhang, Zhaoxiang ^{[1
,3
,4
,5
]}

机构：

[1] Chinese Acad Sci CASIA, Inst Automat, Beijing, Peoples R China

[2] TuSimple, Beijing, Peoples R China

[3] Univ Chinese Acad Sci UCAS, Beijing, Peoples R China

[4] Natl Lab Pattern Recognit NLPR, Beijing, Peoples R China

[5] HKISI CAS, Ctr Artificial Intelligence & Robot, Hong Kong, Peoples R China

[6] UCAS, Sch Future Technol, Beijing, Peoples R China

来源：

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/ICCV48922.2021.00291

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we propose an anchor-free single-stage LiDAR-based 3D object detector - RangeDet. The most notable difference with previous works is that our method is purely based on the range view representation. Compared with the commonly used voxelized or Bird's Eye View (BEV) representations, the range view representation is more compact and without quantization error. Although there are works adopting it for semantic segmentation, its performance in object detection is largely behind voxelized or BEV counterparts. We first analyze the existing range-view-based methods and find two issues overlooked by previous works: 1) the scale variation between nearby and far away objects; 2) the inconsistency between the 2D range image coordinates used in feature extraction and the 3D Cartesian coordinates used in output. Then we deliberately design three components to address these issues in our RangeDet. We test our RangeDet in the large-scale Waymo Open Dataset (WOD). Our best model achieves 72.9/75.9/65.8 3D AP on vehicle/pedestrian/cyclist. These results outperform other range-view-based methods by a large margin, and are overall comparable with the state-of-the-art multi-view-based methods.

引用

页码：2898 / 2907

页数：10

共 45 条

[1]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01298

[2]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00910

[3]

[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00274

[4]

[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00985

[5]

[Anonymous], 2022, ternational

[6]

[Anonymous], 2016, Vehicle Detection from 3D Lidar Using Fully Convolutional Network

[7]

Bewley Alex, 2020, C ROB LEARN CORL

[8] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

[9] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[10]

Deng Jiajun, 2021, VOXEL RCNN HIGH PERF

← 1 2 3 4 5 →