DeployFusion: A Deployable Monocular 3D Object Detection with Multi-Sensor Information Fusion in BEV for Edge Devices

被引:0
|
作者
Huang, Fei [1 ]
Liu, Shengshu [1 ]
Zhang, Guangqian [2 ]
Hao, Bingsen [3 ]
Xiang, Yangkai [3 ]
Yuan, Kun [3 ]
机构
[1] China Rd & Bridge Corp, Beijing 100010, Peoples R China
[2] Chongqing Seres Phoenix Intelligent Innovat Techno, Chongqing 400039, Peoples R China
[3] Chongqing Jiaotong Univ, Sch Mechatron & Vehicle Engn, Chongqing 400074, Peoples R China
关键词
multi-sensor information fusion; 3D object detection; BEV; feature fusion; model deployment;
D O I
10.3390/s24217007
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
To address the challenges of suboptimal remote detection and significant computational burden in existing multi-sensor information fusion 3D object detection methods, a novel approach based on Bird's-Eye View (BEV) is proposed. This method utilizes an enhanced lightweight EdgeNeXt feature extraction network, incorporating residual branches to address network degradation caused by the excessive depth of STDA encoding blocks. Meantime, deformable convolution is used to expand the receptive field and reduce computational complexity. The feature fusion module constructs a two-stage fusion network to optimize the fusion and alignment of multi-sensor features. This network aligns image features to supplement environmental information with point cloud features, thereby obtaining the final BEV features. Additionally, a Transformer decoder that emphasizes global spatial cues is employed to process the BEV feature sequence, enabling precise detection of distant small objects. Experimental results demonstrate that this method surpasses the baseline network, with improvements of 4.5% in the NuScenes detection score and 5.5% in average precision for detection objects. Finally, the model is converted and accelerated using TensorRT tools for deployment on mobile devices, achieving an inference time of 138 ms per frame on the Jetson Orin NX embedded platform, thus enabling real-time 3D object detection.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Fusion of multi-sensor passive and active 3D imagery
    Fay, DA
    Verly, JG
    Braun, MI
    Frost, C
    Racamato, JP
    Waxman, AM
    ENHANCED AND SYNTHETIC VISION 2001, 2001, 4363 : 219 - 230
  • [22] Multi-Sensor Data Fusion for Robotic 3D Target Detection in CPU Environment
    Lou, Jin
    Liu, Enbo
    Tang, Wei
    Zhang, Renyuan
    Computer Engineering and Applications, 2024, 60 (19) : 120 - 129
  • [23] Asynchronous Multi-Sensor Fusion for 3D Mapping and Localization
    Geneva, Patrick
    Eckenhoff, Kevin
    Huang, Guoquan
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 5994 - 5999
  • [24] An optimal information fusion framework for multi-sensor object recognition
    van Dop, ER
    Regtien, PPL
    Korsten, MJ
    EUROSENSORS XII, VOLS 1 AND 2, 1998, : 1135 - 1138
  • [25] Image Fuzzy Edge Detection Algorithm Based on the Consideration of Multi-sensor Information Fusion
    Cai, Lili
    2019 11TH INTERNATIONAL CONFERENCE ON MEASURING TECHNOLOGY AND MECHATRONICS AUTOMATION (ICMTMA 2019), 2019, : 274 - 278
  • [26] SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection
    Xie, Yichen
    Xu, Chenfeng
    Rakotosaona, Marie-Julie
    Rim, Patrick
    Tombari, Federico
    Keutzer, Kurt
    Tomizuka, Masayoshi
    Zhan, Wei
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 17545 - 17556
  • [27] MULTI-SENSOR INFORMATION FUSION FOR STRUCTURAL DAMAGE DETECTION
    Bao, Yue-Quan
    Xia, Yong
    Li, Hui
    Xu, You-Lin
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL SYMPOSIUM ON STRUCTURAL ENGINEERING, VOL I AND II, 2010, : 1648 - 1653
  • [28] Multi-sensor fusion for robust localization with moving object segmentation in complex dynamic 3D scenes
    Li, Qipeng
    Zhuang, Yuan
    Huai, Jianzhu
    INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2023, 124
  • [29] Multi-View Joint Learning and BEV Feature-Fusion Network for 3D Object Detection
    Liu, Qunming
    Li, Xiaodong
    Zhang, Xiaofei
    Tan, Xiaojun
    Shi, Bodong
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [30] MonoDFNet: Monocular 3D Object Detection with Depth Fusion and Adaptive Optimization
    Gao, Yuhan
    Wang, Peng
    Li, Xiaoyan
    Sun, Mengyu
    Di, Ruohai
    Li, Liangliang
    Hong, Wei
    SENSORS, 2025, 25 (03)