Multi-Scale Enhanced Depth Knowledge Distillation for Monocular 3D Object Detection with SEFormer

被引：0

作者：

Zhang, Han ^{[1
]}

Li, Jun ^{[1
]}

Tang, Rui ^{[2
]}

Shi, Zhiping ^{[1
]}

Bu, Aojie ^{[1
]}

机构：

[1] Capital Normal Univ, Informat Engn Coll, Beijing, Peoples R China

[2] ZongMu Technol, Comp Vis Percept Dept, Shanghai, Peoples R China

来源：

2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年

关键词：

3D object detection; Knowledge distillation; Autonomous driving;

D O I：

10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00031

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the context of the Internet of Things, where efficient and accurate perception is crucial. Monocular 3D detection has gained attention due to its cost-effectiveness. This paper introduces an efficient method for monocular 3D object detection, termed Multi-Scale Enhanced Depth Knowledge Distillation (MDKD). Our approach simplifies the teacher network, eliminating the need for extra modal data input while improving the student network's performance. Additionally, we present a Multi-Scale Depth Enhancement (MDE) module and a novel lightweight Squeeze-Excitation Former (SEFormer). Our method addresses the growing demand for precise object detection within IoT environments. Extensive experiments on the KITTI dataset validate our method's effectiveness.

引用

页码：38 / 43

页数：6

共 50 条

[41] DANet: Multi-scale UAV Target Detection with Dynamic Feature Perception and Scale-aware Knowledge Distillation
Fang, Houzhang
Liao, Zikai
Wang, Lu
Li, Qingshan
Chang, Yi
Yan, Luxin
Wang, Xuhua
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2121 - 2130
[42] FCOS3Dformer: enhancing monocular 3D object detection through transformer-assisted fusion of depth information
Hao, Bingsen
Deng, Zhaoxue
Liu, Mingze
Liu, Can
International Journal of Vehicle Systems Modelling and Testing, 2024, 18 (03) : 228 - 244
[43] BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection
Li, Jianing
Lu, Ming
Liu, Jiaming
Guo, Yandong
Du, Yuan
Du, Li
Zhang, Shanghang
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2489 - 2498
[44] SMS-Net: Sparse multi-scale voxel feature aggregation network for LiDAR-based 3D object detection
Liu, Sheng
Huang, Wenhao
Cao, Yifeng
Li, Dingda
Chen, Shengyong
NEUROCOMPUTING, 2022, 501 : 555 - 565
[45] Dynamic distillation based multi-scale lightweight target detection
Kai Sun
Danjing Li
Multimedia Tools and Applications, 2025, 84 (12) : 10221 - 10239
[46] BSM-NET: multi-bandwidth, multi-scale and multi-modal fusion network for 3D object detection of 4D radar and LiDAR
Jiang, Tiezhen
Kang, Runjie
Li, Qingzhu
MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (03)
[47] Diversity Knowledge Distillation for LiDAR-Based 3-D Object Detection
Ning, Kanglin
Liu, Yanfei
Su, Yanzhao
Jiang, Ke
IEEE SENSORS JOURNAL, 2023, 23 (11) : 11181 - 11193
[48] Defect Detection in Freight Trains Using a Lightweight and Effective Multi-Scale Fusion Framework with Knowledge Distillation
Ma, Ziqin
Zhou, Shijie
Lin, Chunyu
ELECTRONICS, 2025, 14 (05):
[49] Leveraging front and side cues for occlusion handling in monocular 3D object detection
Yuying Song
Zecheng Li
Jingxuan Wu
Chunyi Song
Zhiwei Xu
The Visual Computer, 2024, 40 : 1757 - 1773
[50] MonoCAPE: Monocular 3D object detection with coordinate-aware position embeddings
Chen, Wenyu
Chen, Mu
Fang, Jian
Zhao, Huaici
Wang, Guogang
COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120

← 1 2 3 4 5 →