RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection

被引:98
作者
Fan, Lue [1 ,3 ,4 ,6 ]
Xiong, Xuan [2 ]
Wang, Feng [2 ]
Wang, Naiyan [2 ]
Zhang, Zhaoxiang [1 ,3 ,4 ,5 ]
机构
[1] Chinese Acad Sci CASIA, Inst Automat, Beijing, Peoples R China
[2] TuSimple, Beijing, Peoples R China
[3] Univ Chinese Acad Sci UCAS, Beijing, Peoples R China
[4] Natl Lab Pattern Recognit NLPR, Beijing, Peoples R China
[5] HKISI CAS, Ctr Artificial Intelligence & Robot, Hong Kong, Peoples R China
[6] UCAS, Sch Future Technol, Beijing, Peoples R China
来源
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021) | 2021年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/ICCV48922.2021.00291
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose an anchor-free single-stage LiDAR-based 3D object detector - RangeDet. The most notable difference with previous works is that our method is purely based on the range view representation. Compared with the commonly used voxelized or Bird's Eye View (BEV) representations, the range view representation is more compact and without quantization error. Although there are works adopting it for semantic segmentation, its performance in object detection is largely behind voxelized or BEV counterparts. We first analyze the existing range-view-based methods and find two issues overlooked by previous works: 1) the scale variation between nearby and far away objects; 2) the inconsistency between the 2D range image coordinates used in feature extraction and the 3D Cartesian coordinates used in output. Then we deliberately design three components to address these issues in our RangeDet. We test our RangeDet in the large-scale Waymo Open Dataset (WOD). Our best model achieves 72.9/75.9/65.8 3D AP on vehicle/pedestrian/cyclist. These results outperform other range-view-based methods by a large margin, and are overall comparable with the state-of-the-art multi-view-based methods.
引用
收藏
页码:2898 / 2907
页数:10
相关论文
共 45 条
[1]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01298
[2]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00910
[3]  
[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00274
[4]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00985
[5]  
[Anonymous], 2018, SALT LAKE UT US
[6]  
Bewley Alex, 2020, C ROB LEARN CORL
[7]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[8]  
Chenfeng Xu, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12373), P1, DOI 10.1007/978-3-030-58604-1_1
[9]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[10]  
Deng Jiajun, 2021, VOXEL RCNN HIGH PERF