Unsupervised Domain Adaptation for 3D Object Detection via Self-Training

被引：0

作者：

Luo, Di ^{[1
]}

机构：

[1] Nankai Univ, Coll Comp Sci, Tianjin Key Lab Network & Data Secur Technol, Tianjin, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT II | 2024年 / 14426卷

关键词：

Autonomous driving; Point cloud; 3D object detection; Unsupervised domain adaptation;

D O I：

10.1007/978-981-99-8432-9_25

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D object detection based on point clouds plays a crucial role in autonomous driving. High quality detection results can provide reliable basis for subsequent stages such as trajectory prediction and path planning. Although many advanced 3D object detectors currently exist, when employing them to another domain, there is often a huge performance drop. In addition, existing domain adaptation methods for 3D object detection only focus on one or two observable factors such as scale mismatch and density variation that lead to domain shift and they do not take some invisible factors (weather, road condition and sensor type, etc.) into account. Therefore, we attempt to propose a self-training pipeline for unsupervised domain adaptation on 3D object detection. Firstly, we pretrain the detectors with a specific data processing paradigm which includes object random scaling, random beam re-sampling and etc. Then, we employ mean-teacher framework which includes cross-domain student model and target-only teacher model. We employ adversarial learning in student model, enforcing the student model to learn domain-invariant features. This process could further eliminate the invisible factors that lead to domain shift. Furthermore, in order to further obtain high-quality pseudo labels, we apply different data augmentation strategy and mutual learning between student model and teacher model. In addition, we adopt domain statistics normalization to ensure a stable training behavior. Extensive experiments under three different adaptation tasks demonstrate the effectiveness of our method.

引用

页码：307 / 318

页数：12

共 34 条

[1] nuScenes: A multimodal dataset for autonomous driving [J].

Caesar, Holger ;

Bankiti, Varun ;

Lang, Alex H. ;

Vora, Sourabh ;

Liong, Venice Erin ;

Xu, Qiang ;

Krishnan, Anush ;

Pan, Yu ;

Baldan, Giancarlo ;

Beijbom, Oscar .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628

[2] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[3]

Ganin Y, 2016, J MACH LEARN RES, V17

[4]

Ganin Y, 2015, PR MACH LEARN RES, V37, P1180

[5]

Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074

[6] Structure Aware Single-stage 3D Object Detection from Point Cloud [J].

He, Chenhang ;

Zeng, Hui ;

Huang, Jianqiang ;

Hua, Xian-Sheng ;

Zhang, Lei .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11870-11879

[7]

Hu QJ, 2023, Arxiv, DOI arXiv:2304.09446

[8] PointPillars: Fast Encoders for Object Detection from Point Clouds [J].

Lang, Alex H. ;

Vora, Sourabh ;

Caesar, Holger ;

Zhou, Lubing ;

Yang, Jiong ;

Beijbom, Oscar .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697

[9] PseCo: Pseudo Labeling and Consistency Training for Semi-Supervised Object Detection [J].

Li, Gang ;

Li, Xiang ;

Wang, Yujie ;

Wu, Yichao ;

Liang, Ding ;

Zhang, Shanshan .

COMPUTER VISION, ECCV 2022, PT IX, 2022, 13669 :457-472

[10] Cross-Domain Adaptive Teacher for Object Detection [J].

Li, Yu-Jhe ;

Dai, Xiaoliang ;

Ma, Chih-Yao ;

Liu, Yen-Cheng ;

Chen, Kan ;

Wu, Bichen ;

He, Zijian ;

Kitani, Kris ;

Vajda, Peter .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, :7571-7580

← 1 2 3 4 →