Out-of-distribution- and location-aware PointNets for real-time 3D road user detection without a GPU

被引:0
作者
Seppanen, Alvari [1 ,2 ]
Alamikkotervo, Eerik [1 ]
Ojala, Risto [1 ]
Dario, Giacomo [1 ]
Tammi, Kari [1 ,2 ]
机构
[1] Aalto Univ, Espoo, Finland
[2] Helsinki Inst Phys, Helsinki, Finland
关键词
Perception; Deep learning; Object detection; Limited computational resources;
D O I
10.1186/s40537-023-00859-5
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
3D road user detection is an essential task for autonomous vehicles and mobile robots, and it plays a key role, for instance, in obstacle avoidance and route planning tasks. Existing solutions for detection require expensive GPU units to run in real-time. This paper presents a light algorithm that runs in real-time without a GPU. The algorithm combines a classical point cloud proposal generator approach with a modern deep learning technique to achieve a small computational requirement and comparable accuracy to the state-of-the-art. Typical downsides of this approach, such as many out-of-distribution proposals and loss of location information, are examined, and solutions are proposed. We have evaluated the performance of the method with the KITTI dataset and with our own annotated dataset collected with a compact mobile robot platform equipped with a low-resolution LiDAR (16-channel). Our approach reaches a real-time inference on a standard CPU, unlike other solutions in the literature. Furthermore, we achieve superior speed on a GPU, which indicates that our method has a high degree of parallelism. Our method enables low-cost mobile robots to detect road users in real-time.
引用
收藏
页数:19
相关论文
共 48 条
[1]  
[Anonymous], 2006, Predicting Structured Data
[2]   BirdNet plus : Two-Stage 3D Object Detection in LiDAR Through a Sparsity-Invariant Bird's Eye View [J].
Barrera, Alejandro ;
Beltran, Jorge ;
Guindel, Carlos ;
Iglesias, Jose Antonio ;
Garcia, Fernando .
IEEE ACCESS, 2021, 9 :160299-160316
[3]   Towards 3D LiDAR-based semantic scene understanding of 3D point cloud sequences: The SemanticKITTI Dataset [J].
Behley, Jens ;
Garbade, Martin ;
Milioto, Andres ;
Quenzel, Jan ;
Behnke, Sven ;
Gall, Juergen ;
Stachniss, Cyrill .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2021, 40 (8-9) :959-967
[4]  
Beltrán J, 2018, IEEE INT C INTELL TR, P3517, DOI 10.1109/ITSC.2018.8569311
[5]  
Bogoslavskyi I, 2016, 2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), P163, DOI 10.1109/IROS.2016.7759050
[6]   To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels [J].
Chai, Yuning ;
Sun, Pei ;
Ngiam, Jiquan ;
Wang, Weiyue ;
Caine, Benjamin ;
Vasudevan, Vijay ;
Zhang, Xiao ;
Anguelov, Dragomir .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :15995-16004
[7]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[8]   Neural Mean Discrepancy for Efficient Out-of-Distribution Detection [J].
Dong, Xin ;
Guo, Junfeng ;
Li, Ang ;
Ting, Wei-Te ;
Liu, Cong ;
Kung, H. T. .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :19195-19205
[9]   RangeDet: In Defense of Range View for LiDAR-based 3D Object Detection [J].
Fan, Lue ;
Xiong, Xuan ;
Wang, Feng ;
Wang, Naiyan ;
Zhang, Zhaoxiang .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :2898-2907
[10]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074