PLS: UNSUPERVISED DOMAIN ADAPTATION FOR 3D OBJECT DETECTION VIA PSEUDO-LABEL SIZES

被引：0

作者：

Chen, Shijie ^{[1
]}

Wang, Rongquan ^{[1
]}

Li, Xin ^{[1
]}

Wu, Yuchen ^{[1
]}

Liu, Haizhuang ^{[1
]}

Chen, Jiansheng ^{[1
]}

Ma, Huimin ^{[1
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing 100083, Peoples R China

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

Unsupervised domain adaptation; 3D object detection; Pseudo label;

D O I：

10.1109/ICASSP48485.2024.10446579

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

3D object detection has gained increasing attention in modern autonomous driving systems. However, the performance of the detector significantly degrades during cross-domain deployment due to domain shift. The detector is inevitably biased towards its training dataset when employed on a target dataset, particularly towards object sizes. State-of-the-art unsupervised domain adaptation approaches explicitly address the variation in object sizes by appropriately scaling the source data. However, such methods require additional target domain statistics information, which contradicts the original unsupervised assumption. In this work, we present PLS, a novel unsupervised domain adaptation method for 3D object detection to overcome the object sizes bias via Pseudo-Label Sizes, which utilizes only source domain annotations. PLS alternates between generating high-quality pseudo-label sizes through the detector and model training with the pseudo-label sizes to scale and augment the source data. This iterative process enables the detector to be trained with augmented data that resembles the target domain sizes, thereby improving the performance of detector in cross-domain scenarios. Our experimental results show the outstanding performance of our PLS in various scenarios. In addition, PLS is a plug-and-play module that can be used to directly replace existing weakly-supervised scaling methods. Experimental results show that existing excellent architectures with PLS are able to achieve better performance, and making them completely unsupervised.

引用

页码：6370 / 6374

页数：5

共 22 条

[1] nuScenes: A multimodal dataset for autonomous driving [J].

Caesar, Holger ;

Bankiti, Varun ;

Lang, Alex H. ;

Vora, Sourabh ;

Liong, Venice Erin ;

Xu, Qiang ;

Krishnan, Anush ;

Pan, Yu ;

Baldan, Giancarlo ;

Beijbom, Oscar .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628

[2] Domain Adaptive Faster R-CNN for Object Detection in the Wild [J].

Chen, Yuhua ;

Li, Wen ;

Sakaridis, Christos ;

Dai, Dengxin ;

Van Gool, Luc .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :3339-3348

[3] Unsupervised Domain Adaptation for Person Re-Identification through Source-Guided Pseudo-Labeling [J].

Dubourvieux, Fabian ;

Audigier, Romaric ;

Loesch, Angelique ;

Ainouz, Samia ;

Canu, Stephane .

2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, :4957-4964

[4]

Ganin Yaroslav, 2015, PR MACH LEARN RES, V37, P1180, DOI DOI 10.48550/ARXIV.1409.7495

[5]

Geiger Andreas, 2012, IEEE C COMPUTER VISI, P3354

[6] Density-Insensitive Unsupervised Domain Adaption on 3D Object Detection [J].

Hu, Qianjiang ;

Liu, Daizong ;

Hu, Wei .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :17556-17566

[7]

Hurl B, 2019, IEEE INT VEH SYM, P2522, DOI 10.1109/IVS.2019.8813809

[8]

Kingsbury D, 2015, P1, DOI [10.48550/arXiv.1412.6980, DOI 10.1021/bk-2015-1214.ch001]

[9] PointPillars: Fast Encoders for Object Detection from Point Clouds [J].

Lang, Alex H. ;

Vora, Sourabh ;

Caesar, Holger ;

Zhou, Lubing ;

Yang, Jiong ;

Beijbom, Oscar .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697

[10] 3D-SSD: Learning hierarchical features from RGB-D images for amodal 3D object detection [J].

Luo, Qianhui ;

Ma, Huifang ;

Tang, Li ;

Wang, Yue ;

Xiong, Rong .

NEUROCOMPUTING, 2020, 378 :364-374

← 1 2 3 →