Image-to-Lidar Relational Distillation for Autonomous Driving Data

被引:0
作者
Mahmoud, Anas [1 ]
Harakeh, Ali [2 ]
Waslander, Steven [1 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Mila Quebec AI Inst, Montreal, PQ, Canada
来源
COMPUTER VISION - ECCV 2024, PT LXII | 2025年 / 15120卷
关键词
D O I
10.1007/978-3-031-73033-7_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained on extensive and diversemulti-modal datasets, 2D foundation models excel at addressing 2D tasks with little or no downstream supervision, owing to their robust representations. The emergence of 2D-to-3D distillation frameworks has extended these capabilities to 3D models. However, distilling 3D representations for autonomous driving datasets presents challenges like self-similarity, class imbalance, and point cloud sparsity, hindering the effectiveness of contrastive distillation, especially in zero-shot learning contexts. Whereas other methodologies, such as similarity-based distillation, enhance zero-shot performance, they tend to yield less discriminative representations, diminishing few-shot performance. We investigate the gap in structure between the 2Dand the 3D representations that result from state-of-the-art distillation frameworks and reveal a significant mismatch between the two. Additionally, we demonstrate that the observed structural gap is negatively correlated with the efficacy of the distilled representations on zero-shot and few-shot 3D semantic segmentation. To bridge this gap, we propose a relational distillation framework enforcing intra-modal and cross-modal constraints, resulting in distilled 3D representations that closely capture the structure of the 2D representation. This alignment significantly enhances 3D representation performance over those learned through contrastive distillation in zero-shot segmentation tasks. Furthermore, our relational loss consistently improves the quality of 3D representations in both in-distribution and out-of-distribution few-shot segmentation tasks, outperforming approaches that rely on the similarity loss.
引用
收藏
页码:459 / 475
页数:17
相关论文
共 50 条
  • [31] 2.5D Layered Sub-Image LIDAR Maps for Autonomous Driving in Multilevel Environments
    Aldibaja, Mohammad
    Suganuma, Naoki
    Yanase, Ryo
    [J]. REMOTE SENSING, 2022, 14 (22)
  • [32] Deep Learning Inspired Object Consolidation Approaches Using LiDAR Data for Autonomous Driving: A Review
    Mekala, M. S.
    Park, Woongkyu
    Dhiman, Gaurav
    Srivastava, Gautam
    Park, Ju H.
    Jung, Ho-Youl
    [J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2022, 29 (05) : 2579 - 2599
  • [33] Deep Learning Inspired Object Consolidation Approaches Using LiDAR Data for Autonomous Driving: A Review
    M. S. Mekala
    Woongkyu Park
    Gaurav Dhiman
    Gautam Srivastava
    Ju H. Park
    Ho-Youl Jung
    [J]. Archives of Computational Methods in Engineering, 2022, 29 : 2579 - 2599
  • [34] Deep Learning for LiDAR Point Clouds in Autonomous Driving: A Review
    Li, Ying
    Ma, Lingfei
    Zhong, Zilong
    Liu, Fei
    Chapman, Michael A.
    Cao, Dongpu
    Li, Jonathan
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (08) : 3412 - 3432
  • [35] LiDAR-based Drivable Region Detection for Autonomous Driving
    Xue, Hanzhang
    Fu, Hao
    Ren, Ruike
    Zhang, Jintao
    Liu, Bokai
    Fan, Yiming
    Dai, Bin
    [J]. 2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 1110 - 1116
  • [36] Autonomous Driving Control Based on the Perception of a Lidar Sensor and Odometer
    Tsai, Jichiang
    Chang, Che-Cheng
    Ou, Yu-Cheng
    Sieh, Bing-Herng
    Ooi, Yee-Ming
    [J]. APPLIED SCIENCES-BASEL, 2022, 12 (15):
  • [37] Lidar Point Cloud Compression, Processing and Learning for Autonomous Driving
    Abbasi, Rashid
    Bashir, Ali Kashif
    Alyamani, Hasan J.
    Amin, Farhan
    Doh, Jaehyeok
    Chen, Jianwen
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (01) : 962 - 979
  • [38] A LiDAR Multi-Object Detection Algorithm for Autonomous Driving
    Wang, Shuqi
    Chen, Meng
    [J]. APPLIED SCIENCES-BASEL, 2023, 13 (23):
  • [39] Exploring Adversarial Robustness of LiDAR Semantic Segmentation in Autonomous Driving
    Mahima, K. T. Yasas
    Perera, Asanka
    Anavatti, Sreenatha
    Garratt, Matt
    [J]. SENSORS, 2023, 23 (23)
  • [40] Future pseudo-LiDAR frame prediction for autonomous driving
    Xudong Huang
    Chunyu Lin
    Haojie Liu
    Lang Nie
    Yao Zhao
    [J]. Multimedia Systems, 2022, 28 : 1611 - 1620