LiDAR-Based All-Weather 3D Object Detection via Prompting and Distilling 4D Radar

被引：0

作者：

Chae, Yujeong ^{[1
]}

Kim, Hyeonseong ^{[1
]}

Oh, Changgyoon ^{[1
]}

Kim, Minseok ^{[1
]}

Yoon, Kuk-Jin ^{[1
]}

机构：

[1] Korea Adv Inst Sci & Technol, Visual Intelligence Lab, Daejeon, South Korea

来源：

COMPUTER VISION - ECCV 2024, PT LVI | 2025年 / 15114卷

基金：

新加坡国家研究基金会;

关键词：

LiDAR; 3D object detection; Normal/adverse; 4D radar; Knowledge distillation; 3D prompt learning; Autonomous driving;

D O I：

10.1007/978-3-031-72992-8_21

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

LiDAR-based 3D object detection models show remarkable performance, however their effectiveness diminishes in adverse weather. On the other hand, 4D radar exhibits strengths in adverse weather but faces limitations in standalone use. While fusing LiDAR and 4D radar seems to be the most intuitive approach, this method comes with limitations, including increased computational load due to radar pre-processing, situational constraints when both domain information is present, and the potential loss of sensor advantages through joint optimization. In this paper, we propose a novel LiDAR-only-based 3D object detection framework that works robustly in all-weather (normal and adverse) conditions. Specifically, we first propose 4D radar-based 3D prompt learning to inject auxiliary radar information into a LiDAR-based pre-trained 3D detection model while preserving the precise geometry capabilities of LiDAR. Subsequently, using the preceding model as a teacher, we distill weather-insensitive features and responses into a LiDAR-only student model through our four levels of inter-/intramodal knowledge distillation. Extensive experiments demonstrate that our prompt learning effectively integrates the strengths of LiDAR and 4D radar, and our LiDAR-only student model even surpasses the detection performance of teacher and state-of-the-art models under various weather conditions.

引用

页码：368 / 385

页数：18

共 56 条

[51] Conditional Prompt Learning for Vision-Language Models [J].

Zhou, Kaiyang ;

Yang, Jingkang ;

Loy, Chen Change ;

Liu, Ziwei .

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :16795-16804

[52] Learning to Prompt for Vision-Language Models [J].

Zhou, Kaiyang ;

Yang, Jingkang ;

Loy, Chen Change ;

Liu, Ziwei .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2022, 130 (09) :2337-2348

[53] UniDistill: A Universal Cross-Modality Knowledge Distillation Framework for 3D Object Detection in Bird's-Eye View [J].

Zhou, Shengchao ;

Liu, Weizhou ;

Hu, Chen ;

Zhou, Shuchang ;

Ma, Chao .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, :5116-5125

[54] VoxelNet: End-to-End Learning for Point Cloud Based 3D Object Detection [J].

Zhou, Yin ;

Tuzel, Oncel .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4490-4499

[55] CenterFormer: Center-Based Transformer for 3D Object Detection [J].

Zhou, Zixiang ;

Zhao, Xiangchen ;

Wang, Yu ;

Wang, Panqu ;

Foroosh, Hassan .

COMPUTER VISION, ECCV 2022, PT XXXVIII, 2022, 13698 :496-513

[56] Visual Prompt Multi-Modal Tracking [J].

Zhu, Jiawen ;

Lai, Simiao ;

Chen, Xin ;

Wang, Dong ;

Lu, Huchuan .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :9516-9526

← 1 2 3 4 5 6 →