Fade3D: Fast and Deployable 3D Object Detection for Autonomous Driving

被引:0
作者
Ye, Wei [1 ,2 ]
Xia, Qiming [1 ,2 ]
Wu, Hai [1 ,2 ]
Dong, Zhen [3 ]
Zhong, Ruofei [4 ]
Wang, Cheng [1 ,2 ]
Wen, Chenglu [1 ,2 ]
机构
[1] Xiamen Univ, Fujian Key Lab Sensing & Comp Smart Cities, Xiamen 361005, Peoples R China
[2] Xiamen Univ, Sch Informat, Xiamen 361005, Peoples R China
[3] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & Re, Wuhan 430079, Peoples R China
[4] Capital Normal Univ, Coll Resource Environm & Tourism, Beijing 100048, Peoples R China
基金
中国国家自然科学基金;
关键词
3D object detection; point cloud; real-time inference; outdoor scene perception; autonomous driving;
D O I
10.1109/TITS.2025.3568418
中图分类号
TU [建筑科学];
学科分类号
0813 ;
摘要
3D object detection is an essential scene perception capability for autonomous vehicles. In intelligent transportation systems, autonomous vehicles require minimal inference latency to sense their surroundings in real-time. However, advanced 3D detection methods often suffer from high inference latency. This limits the real-time deployment of 3D detection models in the real world. To address this problem, this paper proposes a fast and deployable 3D object detection method from the LiDAR point cloud for autonomous driving, named Fade3D. Firstly, we propose a Lightweight Input Encoder (LIE) to extract the most critical features from point clouds. Then, we develop a Spatial Feature Enhancement BEV backbone (SFENet) that efficiently encodes geometry features into compact representations. Additionally, we design an IoU-aware Loss Re-weighting (ILR) that enhances performance by shifting more attention to hard samples. Leveraging LIE and SFENet, our approach is independent of point cloud density and number, achieving significant speed advantages in processing large-scale point clouds and being deployment-friendly. Extensive experiments on KITTI and Waymo Open Dataset (WOD) datasets comparing various baseline detectors demonstrate its universality and superiority. Specifically, our method demonstrates impressive real-time inference capabilities, achieving 51.5 Hz on an RTX3090 GPU and 12.4 Hz on a Jetson Orin embedded development board. Code will be available at https://github.com/wayyeah/Fade3D
引用
收藏
页数:13
相关论文
共 80 条
[61]   CasA: A Cascade Attention Network for 3-D Object Detection From LiDAR Point Clouds [J].
Wu, Hai ;
Deng, Jinhao ;
Wen, Chenglu ;
Li, Xin ;
Wang, Cheng ;
Li, Jonathan .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[62]   HINTED: Hard Instance Enhanced Detector with Mixed-Density Feature Fusion for Sparsely-Supervised 3D Object Detection [J].
Xia, Qiming ;
Ye, Wei ;
Wu, Hai ;
Zhao, Shijia ;
Xing, Leyuan ;
Huang, Xun ;
Deng, Jinhao ;
Li, Xin ;
Wang, Chenglu ;
Wang, Cheng .
2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, :15321-15330
[63]   3-D HANet: A Flexible 3-D Heatmap Auxiliary Network for Object Detection [J].
Xia, Qiming ;
Chen, Yidong ;
Cai, Guorong ;
Chen, Guikun ;
Xie, Daoshun ;
Su, Jinhe ;
Wang, Zongyue .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
[64]   PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving [J].
Xiao, Pengchuan ;
Shao, Zhenlei ;
Hao, Steven ;
Zhang, Zishuo ;
Chai, Xiaolin ;
Jiao, Judy ;
Li, Zesong ;
Wu, Jian ;
Sun, Kai ;
Jiang, Kun ;
Wang, Yunlong ;
Yang, Diange .
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, :3095-3101
[65]   SECOND: Sparsely Embedded Convolutional Detection [J].
Yan, Yan ;
Mao, Yuxing ;
Li, Bo .
SENSORS, 2018, 18 (10)
[66]  
Yang B, 2018, PR MACH LEARN RES, V87
[67]   PIXOR: Real-time 3D Object Detection from Point Clouds [J].
Yang, Bin ;
Luo, Wenjie ;
Urtasun, Raquel .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7652-7660
[68]  
Yang JK, 2022, ADV NEUR IN
[69]   3DSSD: Point-based 3D Single Stage Object Detector [J].
Yang, Zetong ;
Sun, Yanan ;
Liu, Shu ;
Jia, Jiaya .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11037-11045
[70]   Center-based 3D Object Detection and Tracking [J].
Yin, Tianwei ;
Zhou, Xingyi ;
Krahenbuhl, Philipp .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11779-11788