Multi-Scale Enhanced Depth Knowledge Distillation for Monocular 3D Object Detection with SEFormer

被引:0
|
作者
Zhang, Han [1 ]
Li, Jun [1 ]
Tang, Rui [2 ]
Shi, Zhiping [1 ]
Bu, Aojie [1 ]
机构
[1] Capital Normal Univ, Informat Engn Coll, Beijing, Peoples R China
[2] ZongMu Technol, Comp Vis Percept Dept, Shanghai, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS | 2024年
关键词
3D object detection; Knowledge distillation; Autonomous driving;
D O I
10.1109/iThings-GreenCom-CPSCom-SmartData-Cybermatics60724.2023.00031
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the context of the Internet of Things, where efficient and accurate perception is crucial. Monocular 3D detection has gained attention due to its cost-effectiveness. This paper introduces an efficient method for monocular 3D object detection, termed Multi-Scale Enhanced Depth Knowledge Distillation (MDKD). Our approach simplifies the teacher network, eliminating the need for extra modal data input while improving the student network's performance. Additionally, we present a Multi-Scale Depth Enhancement (MDE) module and a novel lightweight Squeeze-Excitation Former (SEFormer). Our method addresses the growing demand for precise object detection within IoT environments. Extensive experiments on the KITTI dataset validate our method's effectiveness.
引用
收藏
页码:38 / 43
页数:6
相关论文
共 50 条
  • [21] Small Object Detection Based on Bidirectional Feature Fusion and Multi-scale Distillation
    Wang, Lingyu
    Zhou, Zijie
    Shi, Guanqun
    Guo, Junkang
    Liu, Zhigang
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING-ICANN 2024, PT II, 2024, 15017 : 200 - 214
  • [22] MS3D: A Multi-Scale Feature Fusion 3D Object Detection Method for Autonomous Driving Applications
    Li, Ying
    Zhuang, Wupeng
    Yang, Guangsong
    APPLIED SCIENCES-BASEL, 2024, 14 (22):
  • [23] Multi-Modal and Multi-Scale Fusion 3D Object Detection of 4D Radar and LiDAR for Autonomous Driving
    Wang, Li
    Zhang, Xinyu
    Li, Jun
    Xv, Baowei
    Fu, Rong
    Chen, Haifeng
    Yang, Lei
    Jin, Dafeng
    Zhao, Lijun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (05) : 5628 - 5641
  • [24] Shape-Aware Monocular 3D Object Detection
    Chen, Wei
    Zhao, Jie
    Zhao, Wan-Lei
    Wu, Song-Yuan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (06) : 6416 - 6424
  • [25] Monocular 3D object detection for occluded targets based on spatial relationships and decoupled depth predictions
    Gao, Yanfei
    Miao, Xiongwei
    Zhang, Guoye
    FRONTIERS IN COMPUTER SCIENCE, 2025, 6
  • [26] Knowledge Fusion Distillation: Improving Distillation with Multi-scale Attention Mechanisms
    Linfeng Li
    Weixing Su
    Fang Liu
    Maowei He
    Xiaodan Liang
    Neural Processing Letters, 2023, 55 : 6165 - 6180
  • [27] Knowledge Fusion Distillation: Improving Distillation with Multi-scale Attention Mechanisms
    Li, Linfeng
    Su, Weixing
    Liu, Fang
    He, Maowei
    Liang, Xiaodan
    NEURAL PROCESSING LETTERS, 2023, 55 (05) : 6165 - 6180
  • [28] Multi-Scale Distillation for Low Scanline Resolution Depth Completion
    Huang, Junliang
    Zheng, Haifeng
    Feng, Xinxin
    2024 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION SYSTEMS, ICCCS 2024, 2024, : 854 - 859
  • [29] MonoSAID: Monocular 3D Object Detection based on Scene-Level Adaptive Instance Depth Estimation
    Chenxing Xia
    Wenjun Zhao
    Huidan Han
    Zhanpeng Tao
    Bin Ge
    Xiuju Gao
    Kuan-Ching Li
    Yan Zhang
    Journal of Intelligent & Robotic Systems, 2024, 110
  • [30] MonoSAID: Monocular 3D Object Detection based on Scene-Level Adaptive Instance Depth Estimation
    Xia, Chenxing
    Zhao, Wenjun
    Han, Huidan
    Tao, Zhanpeng
    Ge, Bin
    Gao, Xiuju
    Li, Kuan-Ching
    Zhang, Yan
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2024, 110 (01)