Image-to-Lidar Relational Distillation for Autonomous Driving Data

被引:0
|
作者
Mahmoud, Anas [1 ]
Harakeh, Ali [2 ]
Waslander, Steven [1 ]
机构
[1] Univ Toronto, Toronto, ON, Canada
[2] Mila Quebec AI Inst, Montreal, PQ, Canada
来源
COMPUTER VISION - ECCV 2024, PT LXII | 2025年 / 15120卷
关键词
D O I
10.1007/978-3-031-73033-7_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Pre-trained on extensive and diversemulti-modal datasets, 2D foundation models excel at addressing 2D tasks with little or no downstream supervision, owing to their robust representations. The emergence of 2D-to-3D distillation frameworks has extended these capabilities to 3D models. However, distilling 3D representations for autonomous driving datasets presents challenges like self-similarity, class imbalance, and point cloud sparsity, hindering the effectiveness of contrastive distillation, especially in zero-shot learning contexts. Whereas other methodologies, such as similarity-based distillation, enhance zero-shot performance, they tend to yield less discriminative representations, diminishing few-shot performance. We investigate the gap in structure between the 2Dand the 3D representations that result from state-of-the-art distillation frameworks and reveal a significant mismatch between the two. Additionally, we demonstrate that the observed structural gap is negatively correlated with the efficacy of the distilled representations on zero-shot and few-shot 3D semantic segmentation. To bridge this gap, we propose a relational distillation framework enforcing intra-modal and cross-modal constraints, resulting in distilled 3D representations that closely capture the structure of the 2D representation. This alignment significantly enhances 3D representation performance over those learned through contrastive distillation in zero-shot segmentation tasks. Furthermore, our relational loss consistently improves the quality of 3D representations in both in-distribution and out-of-distribution few-shot segmentation tasks, outperforming approaches that rely on the similarity loss.
引用
收藏
页码:459 / 475
页数:17
相关论文
共 50 条
  • [1] Image-to-Lidar Self-Supervised Distillation for Autonomous Driving Data
    Sautier, Corentin
    Puy, Gilles
    Gidaris, Spyros
    Boulch, Alexandre
    Bursuc, Andrei
    Marlet, Renaud
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 9881 - 9891
  • [2] Self-Distillation for Robust LiDAR Semantic Segmentation in Autonomous Driving
    Li, Jiale
    Dai, Hang
    Ding, Yong
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 659 - 676
  • [3] Investigation of Lidar Data for Autonomous Driving with an Electric Bus
    Feller, Christian
    Haböck, Ulrich
    Maier, Stefan
    Schwenninger, Jochen
    ATZ worldwide, 2019, 121 (02) : 54 - 59
  • [4] Enhanced Temporal Data Organization for LiDAR Data in Autonomous Driving Environments
    Kusenbach, Michael
    Luettel, Thorsten
    Wuensche, Hans-Joachim
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 2701 - 2706
  • [5] Utilizing CNNs for Object Detection with LiDAR Data for Autonomous Driving
    Ponnaganti, Vinay
    Moh, Melody
    Moh, Teng-Sheng
    PROCEEDINGS OF THE 2021 15TH INTERNATIONAL CONFERENCE ON UBIQUITOUS INFORMATION MANAGEMENT AND COMMUNICATION (IMCOM 2021), 2021,
  • [6] How to Build a Curb Dataset with LiDAR Data for Autonomous Driving
    Bai, Dongfeng
    Cao, Tongtong
    Guo, Jingming
    Liu, Bingbing
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2022), 2022, : 2576 - 2582
  • [7] Lidar sensors for autonomous driving
    Schleuning, David
    Droz, Pierre-yves
    HIGH-POWER DIODE LASER TECHNOLOGY XVIII, 2020, 11262
  • [8] Flash LiDAR for Autonomous Driving
    Lin, Chih-Ping
    2021 INTERNATIONAL SYMPOSIUM ON VLSI DESIGN, AUTOMATION AND TEST (VLSI-DAT), 2021,
  • [9] Virtual Lidar Sensor Intensity Data Modeling for Autonomous Driving Simulators
    Lee, Dong-Ju
    Im, Jiung
    Won, Jong-Hoon
    IEEE ACCESS, 2023, 11 : 120694 - 120706
  • [10] Poses as Queries: End-to-End Image-to-LiDAR Map Localization With Transformers
    Miao, Jinyu
    Jiang, Kun
    Wang, Yunlong
    Wen, Tuopu
    Xiao, Zhongyang
    Fu, Zheng
    Yang, Mengmeng
    Liu, Maolin
    Huang, Jin
    Zhong, Zhihua
    Yang, Diange
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (01) : 803 - 810