MKD-Cooper: Cooperative 3D Object Detection for Autonomous Driving via Multi-Teacher Knowledge Distillation

被引:4
作者
Li, Zhiyuan [1 ,2 ]
Liang, Huawei [1 ,2 ,3 ,4 ]
Wang, Hanqi [1 ]
Zhao, Mingzhuo [5 ]
Wang, Jian [1 ,2 ]
Zheng, Xiaokun [1 ,2 ]
机构
[1] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei 230031, Peoples R China
[2] Univ Sci & Technol China, Hefei 230026, Peoples R China
[3] Anhui Engn Lab Intelligent Driving Technol & Appl, Hefei 230031, Peoples R China
[4] Chinese Acad Sci, Innovat Res Inst Robot & Intelligent Mfg, Hefei 230031, Peoples R China
[5] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China
来源
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期
关键词
Three-dimensional displays; Object detection; Feature extraction; Solid modeling; Adaptation models; Point cloud compression; Aggregates; Cooperative perception; 3D object detection; autonomous driving; knowledge distillation; multiple teachers; MULTIOBJECT TRACKING;
D O I
10.1109/TIV.2023.3310580
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accurately detecting objects in 3D point clouds is critical for achieving precise scene understanding in autonomous driving systems. Cooperative perception, through information exchange among neighboring vehicles, can significantly improve object detection performance even under occlusion. This article proposes a novel cooperative perception framework based on multi-teacher knowledge distillation for 3D object detection, namely MKD-Cooper. First, we design a Collaborative Attention Fusion (CAF) module that dynamically captures inter-vehicle interactions through channel and spatial attention. By incorporating the CAF module into the CAF network, we effectively aggregate shared deep learning-based features from neighboring vehicles, resulting in a fused feature map that contains rich contextual information. Second, we propose an adaptive multi-teacher knowledge distillation method that adaptively assigns weights to different teacher models based on their current performance, effectively transferring valuable knowledge from multiple excellent teacher models to the student model. Experimental results on the OPV2V and V2XSim 2.0 datasets demonstrate that our method achieves state-of-the-art performance in detection accuracy while exhibiting excellent comprehensive performance between detection accuracy and efficiency. Moreover, field experiments in real urban environments further validate the effectiveness of our approach.
引用
收藏
页码:1490 / 1500
页数:11
相关论文
共 37 条
  • [1] Variational Information Distillation for Knowledge Transfer
    Ahn, Sungsoo
    Hu, Shell Xu
    Damianou, Andreas
    Lawrence, Neil D.
    Dai, Zhenwen
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9155 - 9163
  • [2] Sau BB, 2016, Arxiv, DOI arXiv:1610.09650
  • [3] F-Cooper: Feature based Cooperative Perception for Autonomous Vehicle Edge Computing System Using 3D Point Clouds
    Chen, Qi
    Ma, Xu
    Tang, Sihai
    Guo, Jingda
    Yang, Qing
    Fu, Song
    [J]. SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 88 - 100
  • [4] Cooper: Cooperative Perception for Connected Autonomous Vehicles based on 3D Point Clouds
    Chen, Qi
    Tang, Sihai
    Yang, Qing
    Fu, Song
    [J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 514 - 524
  • [5] Dosovitskiy A., 2017, P 1 ANN C ROB LEARN, V78, P1, DOI DOI 10.48550/ARXIV.1711.03938
  • [6] Fu H, 2021, AAAI CONF ARTIF INTE, V35, P12830
  • [7] Fast R-CNN
    Girshick, Ross
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
  • [8] Knowledge Distillation: A Survey
    Gou, Jianping
    Yu, Baosheng
    Maybank, Stephen J.
    Tao, Dacheng
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (06) : 1789 - 1819
  • [9] 3D Multi-Object Tracking With Adaptive Cubature Kalman Filter for Autonomous Driving
    Guo, Ge
    Zhao, Shijie
    [J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 512 - 519
  • [10] Hinton G, 2015, Arxiv, DOI arXiv:1503.02531