MDT3D: Multi-Dataset Training for LiDAR 3D Object Detection Generalization

被引:4
作者
Soum-Fontez, Louis [1 ]
Deschaud, Jean-Emmanuel [1 ]
Goulette, Francois [1 ,2 ]
机构
[1] PSL Univ, Mines Paris PSL, Ctr Robot, F-75006 Paris, France
[2] Inst Polytech Paris, U2IS, ENSTA Paris, F-91120 Palaiseau, France
来源
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年
基金
欧盟地平线“2020”;
关键词
D O I
10.1109/IROS55552.2023.10341614
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Supervised 3D Object Detection models have been displaying increasingly better performance in single-domain cases where the training data comes from the same environment and sensor as the testing data. However, in real-world scenarios data from the target domain may not be available for finetuning or for domain adaptation methods. Indeed, 3D object detection models trained on a source dataset with a specific point distribution have shown difficulties in generalizing to unseen datasets. Therefore, we decided to leverage the information available from several annotated source datasets with our Multi-Dataset Training for 3D Object Detection (MDT3D) method to increase the robustness of 3D object detection models when tested in a new environment with a different sensor configuration. To tackle the labelling gap between datasets, we used a new label mapping based on coarse labels. Furthermore, we show how we managed the mix of datasets during training and finally introduce a new cross-dataset augmentation method: cross-dataset object injection. We demonstrate that this training paradigm shows improvements for different types of 3D object detection models. The source code and additional results for this research project will be publicly available on GitHub for interested parties to access and utilize: https://github.com/LouisSF/MDT3D
引用
收藏
页码:5765 / 5772
页数:8
相关论文
共 25 条
[1]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[2]  
Ganin Y, 2015, PR MACH LEARN RES, V37, P1180
[3]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[4]   3D-VField: Adversarial Augmentation of Point Clouds for Domain Generalization in 3D Object Detection [J].
Lehner, Alexander ;
Gasperini, Stefano ;
Marcos-Ramiro, Alvaro ;
Schmidt, Michael ;
Mahani, Mohammad-Ali Nikouei ;
Navab, Nassir ;
Busam, Benjamin ;
Tombari, Federico .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :17274-17283
[5]  
Mao Jiageng, 2021, One million scenes for autonomous driving: ONCE dataset, V1, P3
[6]  
O. D. Team, 2020, Openpcdet: An open-source toolbox for 3D object detection from point clouds
[7]  
Qi CR, 2017, ADV NEUR IN, V30
[8]  
Richter J., 2022, Understanding the domain gap in lidar object detection networks
[9]  
Sanchez J., 2022, Cola: Coarse label pretraining for 3d semantic segmentation of sparse lidar datasets
[10]  
Shi S., 2020, P IEEECVF C COMPUTER, p10 529