Multi-Scale Enhanced Depth Knowledge Distillation for Monocular 3D Object Detection with SEFormer

被引：0

作者：

Zhang, Han ^{[1
]}

Li, Jun ^{[1
]}

Tang, Rui ^{[2
]}

Shi, Zhiping ^{[1
]}

Bu, Aojie ^{[1
]}

机构：

[1] Capital Normal Univ, Informat Engn Coll, Beijing, Peoples R China

[2] ZongMu Technol, Comp Vis Percept Dept, Shanghai, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年

关键词：

3D object detection; Knowledge distillation; Autonomous driving;

D O I：

10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00031

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the context of the Internet of Things, where efficient and accurate perception is crucial. Monocular 3D detection has gained attention due to its cost-effectiveness. This paper introduces an efficient method for monocular 3D object detection, termed Multi-Scale Enhanced Depth Knowledge Distillation (MDKD). Our approach simplifies the teacher network, eliminating the need for extra modal data input while improving the student network's performance. Additionally, we present a Multi-Scale Depth Enhancement (MDE) module and a novel lightweight Squeeze-Excitation Former (SEFormer). Our method addresses the growing demand for precise object detection within IoT environments. Extensive experiments on the KITTI dataset validate our method's effectiveness.

引用

页码：38 / 43

页数：6

共 50 条

[21] Small Object Detection Based on Bidirectional Feature Fusion and Multi-scale Distillation
Wang, Lingyu
Zhou, Zijie
Shi, Guanqun
Guo, Junkang
Liu, Zhigang
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 200 - 214
[22] MS3D: A Multi-Scale Feature Fusion 3D Object Detection Method for Autonomous Driving Applications
Li, Ying
Zhuang, Wupeng
Yang, Guangsong
APPLIED SCIENCES-BASEL, 2024, 14 (22):
[23] Multi-Modal and Multi-Scale Fusion 3D Object Detection of 4D Radar and LiDAR for Autonomous Driving
Wang, Li
Zhang, Xinyu
Li, Jun
Xv, Baowei
Fu, Rong
Chen, Haifeng
Yang, Lei
Jin, Dafeng
Zhao, Lijun
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 5628 - 5641
[24] Shape-Aware Monocular 3D Object Detection
Chen, Wei
Zhao, Jie
Zhao, Wan-Lei
Wu, Song-Yuan
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
[25] Monocular 3D object detection for occluded targets based on spatial relationships and decoupled depth predictions
Gao, Yanfei
Miao, Xiongwei
Zhang, Guoye
FRONTIERS IN COMPUTER SCIENCE, 2025, 6
[26] Knowledge Fusion Distillation: Improving Distillation with Multi-scale Attention Mechanisms
Linfeng Li
Weixing Su
Fang Liu
Maowei He
Xiaodan Liang
Neural Processing Letters, 2023, 55 : 6165 - 6180
[27] Knowledge Fusion Distillation: Improving Distillation with Multi-scale Attention Mechanisms
Li, Linfeng
Su, Weixing
Liu, Fang
He, Maowei
Liang, Xiaodan
NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6165 - 6180
[28] Multi-Scale Distillation for Low Scanline Resolution Depth Completion
Huang, Junliang
Zheng, Haifeng
Feng, Xinxin
2024 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS, ICCCS 2024, 2024, : 854 - 859
[29] MonoSAID: Monocular 3D Object Detection based on Scene-Level Adaptive Instance Depth Estimation
Chenxing Xia
Wenjun Zhao
Huidan Han
Zhanpeng Tao
Bin Ge
Xiuju Gao
Kuan-Ching Li
Yan Zhang
Journal of Intelligent & Robotic Systems, 2024, 110
[30] MonoSAID: Monocular 3D Object Detection based on Scene-Level Adaptive Instance Depth Estimation
Xia, Chenxing
Zhao, Wenjun
Han, Huidan
Tao, Zhanpeng
Ge, Bin
Gao, Xiuju
Li, Kuan-Ching
Zhang, Yan
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)

← 1 2 3 4 5 →