Multi-Scale Enhanced Depth Knowledge Distillation for Monocular 3D Object Detection with SEFormer

被引:0
|
作者
Zhang, Han [1 ]
Li, Jun [1 ]
Tang, Rui [2 ]
Shi, Zhiping [1 ]
Bu, Aojie [1 ]
机构
[1] Capital Normal Univ, Informat Engn Coll, Beijing, Peoples R China
[2] ZongMu Technol, Comp Vis Percept Dept, Shanghai, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年
关键词
3D object detection; Knowledge distillation; Autonomous driving;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of the Internet of Things, where efficient and accurate perception is crucial. Monocular 3D detection has gained attention due to its cost-effectiveness. This paper introduces an efficient method for monocular 3D object detection, termed Multi-Scale Enhanced Depth Knowledge Distillation (MDKD). Our approach simplifies the teacher network, eliminating the need for extra modal data input while improving the student network's performance. Additionally, we present a Multi-Scale Depth Enhancement (MDE) module and a novel lightweight Squeeze-Excitation Former (SEFormer). Our method addresses the growing demand for precise object detection within IoT environments. Extensive experiments on the KITTI dataset validate our method's effectiveness.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [41] DANet: Multi-scale UAV Target Detection with Dynamic Feature Perception and Scale-aware Knowledge Distillation
    Fang, Houzhang
    Liao, Zikai
    Wang, Lu
    Li, Qingshan
    Chang, Yi
    Yan, Luxin
    Wang, Xuhua
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 2121 - 2130
  • [42] FCOS3Dformer: enhancing monocular 3D object detection through transformer-assisted fusion of depth information
    Hao, Bingsen
    Deng, Zhaoxue
    Liu, Mingze
    Liu, Can
    International Journal of Vehicle Systems Modelling and Testing, 2024, 18 (03) : 228 - 244
  • [43] BEV-LGKD: A Unified LiDAR-Guided Knowledge Distillation Framework for Multi-View BEV 3D Object Detection
    Li, Jianing
    Lu, Ming
    Liu, Jiaming
    Guo, Yandong
    Du, Yuan
    Du, Li
    Zhang, Shanghang
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (01): : 2489 - 2498
  • [44] SMS-Net: Sparse multi-scale voxel feature aggregation network for LiDAR-based 3D object detection
    Liu, Sheng
    Huang, Wenhao
    Cao, Yifeng
    Li, Dingda
    Chen, Shengyong
    NEUROCOMPUTING, 2022, 501 : 555 - 565
  • [45] Dynamic distillation based multi-scale lightweight target detection
    Kai Sun
    Danjing Li
    Multimedia Tools and Applications, 2025, 84 (12) : 10221 - 10239
  • [46] BSM-NET: multi-bandwidth, multi-scale and multi-modal fusion network for 3D object detection of 4D radar and LiDAR
    Jiang, Tiezhen
    Kang, Runjie
    Li, Qingzhu
    MEASUREMENT SCIENCE AND TECHNOLOGY, 2025, 36 (03)
  • [47] Diversity Knowledge Distillation for LiDAR-Based 3-D Object Detection
    Ning, Kanglin
    Liu, Yanfei
    Su, Yanzhao
    Jiang, Ke
    IEEE SENSORS JOURNAL, 2023, 23 (11) : 11181 - 11193
  • [48] Defect Detection in Freight Trains Using a Lightweight and Effective Multi-Scale Fusion Framework with Knowledge Distillation
    Ma, Ziqin
    Zhou, Shijie
    Lin, Chunyu
    ELECTRONICS, 2025, 14 (05):
  • [49] Leveraging front and side cues for occlusion handling in monocular 3D object detection
    Yuying Song
    Zecheng Li
    Jingxuan Wu
    Chunyi Song
    Zhiwei Xu
    The Visual Computer, 2024, 40 : 1757 - 1773
  • [50] MonoCAPE: Monocular 3D object detection with coordinate-aware position embeddings
    Chen, Wenyu
    Chen, Mu
    Fang, Jian
    Zhao, Huaici
    Wang, Guogang
    COMPUTERS & ELECTRICAL ENGINEERING, 2024, 120