Fade3D: Fast and Deployable 3D Object Detection for Autonomous Driving

被引：0

作者：

Ye, Wei ^{[1
,2
]}

Xia, Qiming ^{[1
,2
]}

Wu, Hai ^{[1
,2
]}

Dong, Zhen ^{[3
]}

Zhong, Ruofei ^{[4
]}

Wang, Cheng ^{[1
,2
]}

Wen, Chenglu ^{[1
,2
]}

机构：

[1] Xiamen Univ, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen 361005, Peoples R China

[2] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China

[3] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China

[4] Capital Normal Univ, Coll Resource Environm & Tourism, Beijing 100048, Peoples R China

来源：

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS | 2025年

基金：

中国国家自然科学基金;

关键词：

3D object detection; point cloud; real-time inference; outdoor scene perception; autonomous driving;

D O I：

10.1109/TITS.2025.3568418

中图分类号：

TU [建筑科学];

学科分类号：

0813 ;

摘要：

3D object detection is an essential scene perception capability for autonomous vehicles. In intelligent transportation systems, autonomous vehicles require minimal inference latency to sense their surroundings in real-time. However, advanced 3D detection methods often suffer from high inference latency. This limits the real-time deployment of 3D detection models in the real world. To address this problem, this paper proposes a fast and deployable 3D object detection method from the LiDAR point cloud for autonomous driving, named Fade3D. Firstly, we propose a Lightweight Input Encoder (LIE) to extract the most critical features from point clouds. Then, we develop a Spatial Feature Enhancement BEV backbone (SFENet) that efficiently encodes geometry features into compact representations. Additionally, we design an IoU-aware Loss Re-weighting (ILR) that enhances performance by shifting more attention to hard samples. Leveraging LIE and SFENet, our approach is independent of point cloud density and number, achieving significant speed advantages in processing large-scale point clouds and being deployment-friendly. Extensive experiments on KITTI and Waymo Open Dataset (WOD) datasets comparing various baseline detectors demonstrate its universality and superiority. Specifically, our method demonstrates impressive real-time inference capabilities, achieving 51.5 Hz on an RTX3090 GPU and 12.4 Hz on a Jetson Orin embedded development board. Code will be available at https://github.com/wayyeah/Fade3D

引用

页数：13

共 80 条

[1] RAD: Realtime and Accurate 3D Object Detection on Embedded Systems [J].

Aghdam, Hamed H. ;

Heravi, Elnaz J. ;

Demilew, Selameab S. ;

Laganiere, Robert .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021, 2021, :2869-2877

[2] YOLO3D: End-to-End Real-Time 3D Oriented Object Bounding Box Detection from LiDAR Point Cloud [J].

Ali, Waleed ;

Abdelkarim, Sherif ;

Zidan, Mahmoud ;

Zahran, Mohamed ;

El Sallab, Ahmad .

COMPUTER VISION - ECCV 2018 WORKSHOPS, PT III, 2019, 11131 :716-728

[3]

Barrera A, 2020, IEEE INT C INTELL TR

[4] nuScenes: A multimodal dataset for autonomous driving [J].

Caesar, Holger ;

Bankiti, Varun ;

Lang, Alex H. ;

Vora, Sourabh ;

Liong, Venice Erin ;

Xu, Qiang ;

Krishnan, Anush ;

Pan, Yu ;

Baldan, Giancarlo ;

Beijbom, Oscar .

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628

[5] Multi-View 3D Object Detection Network for Autonomous Driving [J].

Chen, Xiaozhi ;

Ma, Huimin ;

Wan, Ji ;

Li, Bo ;

Xia, Tian .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534

[6] LVP: Leverage Virtual Points in Multimodal Early Fusion for 3-D Object Detection [J].

Chen, Yidong ;

Cai, Guorong ;

Song, Ziying ;

Liu, Zhaoliang ;

Zeng, Binghui ;

Li, Jonathan ;

Wang, Zongyue .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2025, 63

[7] VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking [J].

Chen, Yukang ;

Liu, Jianhui ;

Zhang, Xiangyu ;

Qi, Xiaojuan ;

Jia, Jiaya .

2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :21674-21683

[8]

Deng J., 2024, P EUR C COMP VIS SEP, P219

[9]

Deng JJ, 2021, AAAI CONF ARTIF INTE, V35, P1201

[10] RepVGG: Making VGG-style ConvNets Great Again [J].

Ding, Xiaohan ;

Zhang, Xiangyu ;

Ma, Ningning ;

Han, Jungong ;

Ding, Guiguang ;

Sun, Jian .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :13728-13737

← 1 2 3 4 5 6 7 8 →