Image-to-Lidar Relational Distillation for Autonomous Driving Data

被引：0

作者：

Mahmoud, Anas ^{[1
]}

Harakeh, Ali ^{[2
]}

Waslander, Steven ^{[1
]}

机构：

[1] Univ Toronto, Toronto, ON, Canada

[2] Mila Quebec AI Inst, Montreal, PQ, Canada

来源：

COMPUTER VISION - ECCV 2024, PT LXII | 2025年 / 15120卷

关键词：

D O I：

10.1007/978-3-031-73033-7_26

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Pre-trained on extensive and diversemulti-modal datasets, 2D foundation models excel at addressing 2D tasks with little or no downstream supervision, owing to their robust representations. The emergence of 2D-to-3D distillation frameworks has extended these capabilities to 3D models. However, distilling 3D representations for autonomous driving datasets presents challenges like self-similarity, class imbalance, and point cloud sparsity, hindering the effectiveness of contrastive distillation, especially in zero-shot learning contexts. Whereas other methodologies, such as similarity-based distillation, enhance zero-shot performance, they tend to yield less discriminative representations, diminishing few-shot performance. We investigate the gap in structure between the 2Dand the 3D representations that result from state-of-the-art distillation frameworks and reveal a significant mismatch between the two. Additionally, we demonstrate that the observed structural gap is negatively correlated with the efficacy of the distilled representations on zero-shot and few-shot 3D semantic segmentation. To bridge this gap, we propose a relational distillation framework enforcing intra-modal and cross-modal constraints, resulting in distilled 3D representations that closely capture the structure of the 2D representation. This alignment significantly enhances 3D representation performance over those learned through contrastive distillation in zero-shot segmentation tasks. Furthermore, our relational loss consistently improves the quality of 3D representations in both in-distribution and out-of-distribution few-shot segmentation tasks, outperforming approaches that rely on the similarity loss.

引用

页码：459 / 475

页数：17

共 50 条

[31] 2.5D Layered Sub-Image LIDAR Maps for Autonomous Driving in Multilevel Environments
Aldibaja, Mohammad
Suganuma, Naoki
Yanase, Ryo
[J]. REMOTE SENSING, 2022, 14 (22)
[32] Deep Learning Inspired Object Consolidation Approaches Using LiDAR Data for Autonomous Driving: A Review
Mekala, M. S.
Park, Woongkyu
Dhiman, Gaurav
Srivastava, Gautam
Park, Ju H.
Jung, Ho-Youl
[J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2022, 29 (05) : 2579 - 2599
[33] Deep Learning Inspired Object Consolidation Approaches Using LiDAR Data for Autonomous Driving: A Review
M. S. Mekala
Woongkyu Park
Gaurav Dhiman
Gautam Srivastava
Ju H. Park
Ho-Youl Jung
[J]. Archives of Computational Methods in Engineering, 2022, 29 : 2579 - 2599
[34] Deep Learning for LiDAR Point Clouds in Autonomous Driving: A Review
Li, Ying
Ma, Lingfei
Zhong, Zilong
Liu, Fei
Chapman, Michael A.
Cao, Dongpu
Li, Jonathan
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (08) : 3412 - 3432
[35] LiDAR-based Drivable Region Detection for Autonomous Driving
Xue, Hanzhang
Fu, Hao
Ren, Ruike
Zhang, Jintao
Liu, Bokai
Fan, Yiming
Dai, Bin
[J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1110 - 1116
[36] Autonomous Driving Control Based on the Perception of a Lidar Sensor and Odometer
Tsai, Jichiang
Chang, Che-Cheng
Ou, Yu-Cheng
Sieh, Bing-Herng
Ooi, Yee-Ming
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (15):
[37] Lidar Point Cloud Compression, Processing and Learning for Autonomous Driving
Abbasi, Rashid
Bashir, Ali Kashif
Alyamani, Hasan J.
Amin, Farhan
Doh, Jaehyeok
Chen, Jianwen
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) : 962 - 979
[38] A LiDAR Multi-Object Detection Algorithm for Autonomous Driving
Wang, Shuqi
Chen, Meng
[J]. APPLIED SCIENCES-BASEL, 2023, 13 (23):
[39] Exploring Adversarial Robustness of LiDAR Semantic Segmentation in Autonomous Driving
Mahima, K. T. Yasas
Perera, Asanka
Anavatti, Sreenatha
Garratt, Matt
[J]. SENSORS, 2023, 23 (23)
[40] Future pseudo-LiDAR frame prediction for autonomous driving
Xudong Huang
Chunyu Lin
Haojie Liu
Lang Nie
Yao Zhao
[J]. Multimedia Systems, 2022, 28 : 1611 - 1620

← 1 2 3 4 5 →