MKD-Cooper: Cooperative 3D Object Detection for Autonomous Driving via Multi-Teacher Knowledge Distillation

被引：4

作者：

Li, Zhiyuan ^{[1
,2
]}

Liang, Huawei ^{[1
,2
,3
,4
]}

Wang, Hanqi ^{[1
]}

Zhao, Mingzhuo ^{[5
]}

Wang, Jian ^{[1
,2
]}

Zheng, Xiaokun ^{[1
,2
]}

机构：

[1] Chinese Acad Sci, Hefei Inst Phys Sci, Hefei 230031, Peoples R China

[2] Univ Sci & Technol China, Hefei 230026, Peoples R China

[3] Anhui Engn Lab Intelligent Driving Technol & Appl, Hefei 230031, Peoples R China

[4] Chinese Acad Sci, Innovat Res Inst Robot & Intelligent Mfg, Hefei 230031, Peoples R China

[5] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期

关键词：

Three-dimensional displays; Object detection; Feature extraction; Solid modeling; Adaptation models; Point cloud compression; Aggregates; Cooperative perception; 3D object detection; autonomous driving; knowledge distillation; multiple teachers; MULTIOBJECT TRACKING;

D O I：

10.1109/TIV.2023.3310580

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Accurately detecting objects in 3D point clouds is critical for achieving precise scene understanding in autonomous driving systems. Cooperative perception, through information exchange among neighboring vehicles, can significantly improve object detection performance even under occlusion. This article proposes a novel cooperative perception framework based on multi-teacher knowledge distillation for 3D object detection, namely MKD-Cooper. First, we design a Collaborative Attention Fusion (CAF) module that dynamically captures inter-vehicle interactions through channel and spatial attention. By incorporating the CAF module into the CAF network, we effectively aggregate shared deep learning-based features from neighboring vehicles, resulting in a fused feature map that contains rich contextual information. Second, we propose an adaptive multi-teacher knowledge distillation method that adaptively assigns weights to different teacher models based on their current performance, effectively transferring valuable knowledge from multiple excellent teacher models to the student model. Experimental results on the OPV2V and V2XSim 2.0 datasets demonstrate that our method achieves state-of-the-art performance in detection accuracy while exhibiting excellent comprehensive performance between detection accuracy and efficiency. Moreover, field experiments in real urban environments further validate the effectiveness of our approach.

引用

页码：1490 / 1500

页数：11

共 37 条

[1] Variational Information Distillation for Knowledge Transfer
Ahn, Sungsoo
Hu, Shell Xu
Damianou, Andreas
Lawrence, Neil D.
Dai, Zhenwen
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 9155 - 9163
[2] Sau BB, 2016, Arxiv, DOI arXiv:1610.09650
[3] F-Cooper: Feature based Cooperative Perception for Autonomous Vehicle Edge Computing System Using 3D Point Clouds
Chen, Qi
Ma, Xu
Tang, Sihai
Guo, Jingda
Yang, Qing
Fu, Song
[J]. SEC'19: PROCEEDINGS OF THE 4TH ACM/IEEE SYMPOSIUM ON EDGE COMPUTING, 2019, : 88 - 100
[4] Cooper: Cooperative Perception for Connected Autonomous Vehicles based on 3D Point Clouds
Chen, Qi
Tang, Sihai
Yang, Qing
Fu, Song
[J]. 2019 39TH IEEE INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2019), 2019, : 514 - 524
[5] Dosovitskiy A., 2017, P 1 ANN C ROB LEARN, V78, P1, DOI DOI 10.48550/ARXIV.1711.03938
[6] Fu H, 2021, AAAI CONF ARTIF INTE, V35, P12830
[7] Fast R-CNN
Girshick, Ross
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 1440 - 1448
[8] Knowledge Distillation: A Survey
Gou, Jianping
Yu, Baosheng
Maybank, Stephen J.
Tao, Dacheng
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (06) : 1789 - 1819
[9] 3D Multi-Object Tracking With Adaptive Cubature Kalman Filter for Autonomous Driving
Guo, Ge
Zhao, Shijie
[J]. IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 512 - 519
[10] Hinton G, 2015, Arxiv, DOI arXiv:1503.02531

← 1 2 3 4 →