SMURF: Spatial Multi-Representation Fusion for 3D Object Detection With 4D Imaging Radar

被引：16

作者：

Liu, Jianan ^{[2
]}

Zhao, Qiuchi ^{[1
]}

Xiong, Weiyi ^{[1
]}

Huang, Tao ^{[3
]}

Han, Qing-Long ^{[4
]}

Zhu, Bing ^{[1
]}

机构：

[1] Beihang Univ, Sch Automat Sci & Elect Engn, Beijing 100191, Peoples R China

[2] Vitalent Consulting, S-41761 Gothenburg, Sweden

[3] James Cook Univ, Coll Sci & Engn, Cairns, Qld 4878, Australia

[4] Swinburne Univ Technol, Sch Sci Comp & Engn Technol, Melbourne, Vic 3122, Australia

来源：

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES | 2024年 / 9卷 / 01期

基金：

澳大利亚研究理事会; 中国国家自然科学基金;

关键词：

Radar; Radar imaging; Point cloud compression; Radar detection; Feature extraction; Three-dimensional displays; Object detection; 4D imaging radar; radar point cloud; kernel density estimation; multi-dimensional Gaussian mixture; 3D object detection; autonomous driving; MIMO RADAR; NETWORK; CNN;

D O I：

10.1109/TIV.2023.3322729

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The 4D millimeter-Wave (mmWave) radar is a promising technology for vehicle sensing due to its cost-effectiveness and operability in adverse weather conditions. However, the adoption of this technology has been hindered by sparsity and noise issues in radar point cloud data. This article introduces spatial multi-representation fusion (SMURF), a novel approach to 3D object detection using a single 4D imaging radar. SMURF leverages multiple representations of radar detection points, including pillarization and density features of a multi-dimensional Gaussian mixture distribution through kernel density estimation (KDE). KDE effectively mitigates measurement inaccuracy caused by limited angular resolution and multi-path propagation of radar signals. Additionally, KDE helps alleviate point cloud sparsity by capturing density features. Experimental evaluations on View-of-Delft (VoD) and TJ4DRadSet datasets demonstrate the effectiveness and generalization ability of SMURF, outperforming recently proposed 4D imaging radar-based single-representation models. Moreover, while using 4D imaging radar only, SMURF still achieves comparable performance to the state-of-the-art 4D imaging radar and camera fusion-based method, with an increase of 1.22% in the mean average precision on bird's-eye view of TJ4DRadSet dataset and 1.32% in the 3D mean average precision on the entire annotated area of VoD dataset. Our proposed method demonstrates impressive inference time and addresses the challenges of real-time detection, with the inference time no more than 0.05 seconds for most scans on both datasets. This research highlights the benefits of 4D mmWave radar and is a strong benchmark for subsequent works regarding 3D object detection with 4D imaging radar. Index Terms-4D imaging radar, radar point cloud,

引用

页码：799 / 812

页数：14

共 50 条

[21] Multi-Modal Fusion Based on Depth Adaptive Mechanism for 3D Object Detection
Liu, Zhanwen
Cheng, Juanru
Fan, Jin
Lin, Shan
Wang, Yang
Zhao, Xiangmo
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 707 - 717
[22] Super Sparse 3D Object Detection
Fan, Lue
Yang, Yuxue
Wang, Feng
Wang, Naiyan
Zhang, Zhaoxiang
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (10) : 12490 - 12505
[23] Radar-camera fusion for 3D object detection with aggregation transformer
Li, Jun
Zhang, Han
Wu, Zizhang
Xu, Tianhao
APPLIED INTELLIGENCE, 2024, 54 (21) : 10627 - 10639
[24] VoPiFNet: Voxel-Pixel Fusion Network for Multi-Class 3D Object Detection
Wang, Chia-Hung
Chen, Hsueh-Wei
Chen, Yi
Hsiao, Pei-Yung
Fu, Li-Chen
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 8527 - 8537
[25] A Multi-Modal Fusion-Based 3D Multi-Object Tracking Framework With Joint Detection
Wang, Xiyang
Fu, Chunyun
He, Jiawei
Huang, Mingguang
Meng, Ting
Zhang, Siyu
Zhou, Hangning
Xu, Ziyao
Zhang, Chi
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (01): : 532 - 539
[26] LiDAR-Camera Fusion in Perspective View for 3D Object Detection in Surface Mine
Ai, Yunfeng
Yang, Xue
Song, Ruiqi
Cui, Chenglin
Li, Xinqing
Cheng, Qi
Tian, Bin
Chen, Long
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2024, 9 (02): : 3721 - 3730
[27] Transformer-Based Optimized Multimodal Fusion for 3D Object Detection in Autonomous Driving
Alaba, Simegnew Yihunie
Ball, John E.
IEEE ACCESS, 2024, 12 : 50165 - 50176
[28] Multi-Sem Fusion: Multimodal Semantic Fusion for 3-D Object Detection
Xu, Shaoqing
Li, Fang
Song, Ziying
Fang, Jin
Wang, Sifen
Yang, Zhi-Xin
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 : 1 - 14
[29] TSF: Two-Stage Sequential Fusion for 3D Object Detection
Qi, Heng
Shi, Peicheng
Liu, Zhiqiang
Yang, Aixi
IEEE SENSORS JOURNAL, 2022, 22 (12) : 12163 - 12172
[30] Multi-Source Features Fusion Single Stage 3D Object Detection With Transformer
Tong, Guofeng
Li, Zheng
Peng, Hao
Wang, Yaqi
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (04) : 2062 - 2069

← 1 2 3 4 5 →