Cross-Modality Knowledge Distillation Network for Monocular 3D Object Detection

被引:50
作者
Hong, Yu [1 ]
Dai, Hang [2 ]
Ding, Yong [1 ]
机构
[1] Zhejiang Univ, Hangzhou, Peoples R China
[2] MBZUAI, Abu Dhabi, U Arab Emirates
来源
COMPUTER VISION, ECCV 2022, PT X | 2022年 / 13670卷
关键词
POINT;
D O I
10.1007/978-3-031-20080-9_6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Leveraging LiDAR-based detectors or real LiDAR point data to guide monocular 3D detection has brought significant improvement, e.g., Pseudo-LiDAR methods. However, the existing methods usually apply non-end-to-end training strategies and insufficiently leverage the LiDAR information, where the rich potential of the LiDAR data has not been well exploited. In this paper, we propose the Cross-Modality Knowledge Distillation (CMKD) network for monocular 3D detection to efficiently and directly transfer the knowledge from LiDAR modality to image modality on both features and responses. Moreover, we further extend CMKD as a semi-supervised training framework by distilling knowledge from large-scale unlabeled data and significantly boost the performance. Until submission, CMKD ranks 1st among the monocular 3D detectors with publications on both KITTI test set and Waymo val set with significant performance gains compared to previous state-of-the-art methods.
引用
收藏
页码:87 / 104
页数:18
相关论文
共 65 条
[1]   M3D-RPN: Monocular 3D Region Proposal Network for Object Detection [J].
Brazil, Garrick ;
Liu, Xiaoming .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9286-9295
[2]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[3]   MonoRUn: Monocular 3D Object Detection by Reconstruction and Uncertainty Propagation [J].
Chen, Hansheng ;
Huang, Yuyao ;
Tian, Wei ;
Gao, Zhong ;
Xiong, Lu .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :10374-10383
[4]  
Chen LB, 2017, IEEE INT SYMP NANO, P1, DOI 10.1109/NANOARCH.2017.8053709
[5]  
Chen XZ, 2015, ADV NEUR IN, V28
[6]   Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving [J].
Chen, Yi-Nan ;
Dai, Hang ;
Ding, Yong .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :877-887
[7]  
Chong Z., 2022, Monodistill: Learning spatial features for monocular 3d object detection
[8]   General Instance Distillation for Object Detection [J].
Dai, Xing ;
Jiang, Zeren ;
Wu, Zhao ;
Bao, Yiping ;
Wang, Zhicheng ;
Liu, Si ;
Zhou, Erjin .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :7838-7847
[9]  
Deng JJ, 2021, AAAI CONF ARTIF INTE, V35, P1201
[10]  
Ding M., 2020, CVPR, DOI DOI 10.1109/CVPRW50498.2020.00508