LEF: Late-to-Early Temporal Fusion for LiDAR 3D Object Detection

被引:3
作者
He, Tong [1 ]
Sun, Pei
Leng, Zhaoqi
Liu, Chenxi
Anguelov, Dragomir
Tan, Mingxing [1 ]
机构
[1] Waymo LLC, Mountain View, CA 94043 USA
来源
2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS | 2023年
关键词
D O I
10.1109/IROS55552.2023.10341958
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a late-to-early recurrent feature fusion scheme for 3D object detection using temporal LiDAR point clouds. Our main motivation is fusing object-aware latent embeddings into the early stages of a 3D object detector. This feature fusion strategy enables the model to better capture the shapes and poses for challenging objects, compared with learning from raw points directly. Our method conducts late-to-early feature fusion in a recurrent manner. This is achieved by enforcing window-based attention blocks upon temporally calibrated and aligned sparse pillar tokens. Leveraging bird's eye view foreground pillar segmentation, we reduce the number of sparse history features that our model needs to fuse into its current frame by 10x. We also propose a stochastic-length FrameDrop training technique, which generalizes the model to variable frame lengths at inference for improved performance without retraining. We evaluate our method on the widely adopted Waymo Open Dataset and demonstrate improvement on 3D object detection against the baseline model, especially for the challenging category of large objects.
引用
收藏
页码:1637 / 1644
页数:8
相关论文
共 42 条
[1]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01298
[2]  
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00746
[3]  
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.01161
[4]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.01296
[5]  
[Anonymous], 2021, CVPR, DOI DOI 10.1109/CVPR46437.2021.00567
[6]  
[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00680
[7]   nuScenes: A multimodal dataset for autonomous driving [J].
Caesar, Holger ;
Bankiti, Varun ;
Lang, Alex H. ;
Vora, Sourabh ;
Liong, Venice Erin ;
Xu, Qiang ;
Krishnan, Anush ;
Pan, Yu ;
Baldan, Giancarlo ;
Beijbom, Oscar .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :11618-11628
[8]  
Chai Yuning, 2021, CVPR
[9]  
Chen Xuesong, 2022, ARXIV220505979
[10]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554