DeployFusion: A Deployable Monocular 3D Object Detection with Multi-Sensor Information Fusion in BEV for Edge Devices

被引:0
|
作者
Huang, Fei [1 ]
Liu, Shengshu [1 ]
Zhang, Guangqian [2 ]
Hao, Bingsen [3 ]
Xiang, Yangkai [3 ]
Yuan, Kun [3 ]
机构
[1] China Rd & Bridge Corp, Beijing 100010, Peoples R China
[2] Chongqing Seres Phoenix Intelligent Innovat Techno, Chongqing 400039, Peoples R China
[3] Chongqing Jiaotong Univ, Sch Mechatron & Vehicle Engn, Chongqing 400074, Peoples R China
关键词
multi-sensor information fusion; 3D object detection; BEV; feature fusion; model deployment;
D O I
10.3390/s24217007
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
To address the challenges of suboptimal remote detection and significant computational burden in existing multi-sensor information fusion 3D object detection methods, a novel approach based on Bird's-Eye View (BEV) is proposed. This method utilizes an enhanced lightweight EdgeNeXt feature extraction network, incorporating residual branches to address network degradation caused by the excessive depth of STDA encoding blocks. Meantime, deformable convolution is used to expand the receptive field and reduce computational complexity. The feature fusion module constructs a two-stage fusion network to optimize the fusion and alignment of multi-sensor features. This network aligns image features to supplement environmental information with point cloud features, thereby obtaining the final BEV features. Additionally, a Transformer decoder that emphasizes global spatial cues is employed to process the BEV feature sequence, enabling precise detection of distant small objects. Experimental results demonstrate that this method surpasses the baseline network, with improvements of 4.5% in the NuScenes detection score and 5.5% in average precision for detection objects. Finally, the model is converted and accelerated using TensorRT tools for deployment on mobile devices, achieving an inference time of 138 ms per frame on the Jetson Orin NX embedded platform, thus enabling real-time 3D object detection.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Diffusion Model for Robust Multi-sensor Fusion in 3D Object Detection and BEV Segmentation
    Le, Duy-Tho
    Shi, Hengcan
    Cai, Jianfei
    Rezatofighi, Hamid
    COMPUTER VISION - ECCV 2024, PT XXXVII, 2025, 15095 : 232 - 249
  • [2] Multi-Sensor Fusion 3D Object Detection Based on Multi-Frame Information
    Wu S.
    Geng J.
    Wu C.
    Yan Z.
    Chen K.
    Beijing Ligong Daxue Xuebao/Transaction of Beijing Institute of Technology, 2023, 43 (12): : 1282 - 1289
  • [3] Multi-Task Multi-Sensor Fusion for 3D Object Detection
    Liang, Ming
    Yang, Bin
    Chen, Yun
    Hu, Rui
    Urtasun, Raquel
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 7337 - 7345
  • [4] Deep Continuous Fusion for Multi-sensor 3D Object Detection
    Liang, Ming
    Yang, Bin
    Wang, Shenlong
    Urtasun, Raquel
    COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 663 - 678
  • [5] Multi-Sensor Fusion Technology for 3D Object Detection in Autonomous Driving: A Review
    Wang, Xuan
    Li, Kaiqiang
    Chehri, Abdellah
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (02) : 1148 - 1165
  • [6] MCF3D: Multi-Stage Complementary Fusion for Multi-sensor 3D Object Detection
    Wang, Jiarong
    Zhu, Ming
    Sun, Deyao
    Wang, Bo
    Gao, Wen
    Wei, Hua
    IEEE ACCESS, 2019, 7 : 90801 - 90814
  • [7] Fast All-day 3D Object Detection Based on Multi-sensor Fusion
    Xiao, Liang
    Zhu, Qi
    Chen, Tongtong
    Zhao, Dawei
    Shang, Erke
    Nie, Yiming
    2023 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI, 2023, : 71 - 73
  • [8] AFTR: A Robustness Multi-Sensor Fusion Model for 3D Object Detection Based on Adaptive Fusion Transformer
    Zhang, Yan
    Liu, Kang
    Bao, Hong
    Qian, Xu
    Wang, Zihan
    Ye, Shiqing
    Wang, Weicen
    SENSORS, 2023, 23 (20)
  • [9] MonoMPV: Monocular 3D Object Detection With Multiple Projection Views on Edge Devices
    Deng, Zhaoxue
    Hao, Bingsen
    Liu, Guofang
    Li, Xingquan
    Wei, Hanbing
    Huang, Fei
    Liu, Shengshu
    IEEE ACCESS, 2024, 12 : 136599 - 136612
  • [10] Real-time Detection of 3D Objects Based on Multi-Sensor Information Fusion
    Xie D.
    Xu Y.
    Lu F.
    Pan S.
    Qiche Gongcheng/Automotive Engineering, 2022, 44 (03): : 340 - 349