Center-Aware 3D Object Detection with Attention Mechanism Based on Roadside LiDAR

被引：7

作者：

Shi, Haobo ^{[1
,2
,3
]}

Hou, Dezao ^{[1
,2
,3
]}

Li, Xiyao ^{[1
,2
,3
]}

机构：

[1] Minist Transport, Res Inst Highway, Beijing 100088, Peoples R China

[2] Key Lab Intelligent Transportat Technol & Transpor, Beijing 100088, Peoples R China

[3] Natl Intelligent Transport Syst Ctr Engn & Technol, Beijing 100088, Peoples R China

来源：

SUSTAINABILITY | 2023年 / 15卷 / 03期

基金：

中国国家自然科学基金;

关键词：

vehicle-infrastructure cooperative autonomous driving; roadside 3D detection; LiDAR-based detection; central point representation; deformable attention;

D O I：

10.3390/su15032628

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Infrastructure 3D Object Detection is a pivotal component of Vehicle-Infrastructure Cooperated Autonomous Driving (VICAD). As turning objects account for a high proportion of traffic at intersections, anchor-free representation in the bird's-eye view (BEV) is more suitable for roadside 3D detection. In this work, we propose CetrRoad, a simple yet effective center-aware detector with transformer-based detection head for roadside 3D object detection with single LiDAR (Light Detection and Ranging). CetrRoad firstly utilizes a voxel-based roadside LiDAR feature encoder module that voxelizes and projects the raw point cloud into BEV with dense feature representation, following a one-stage center proposal module that initializes center candidates of objects based on the top N points in the BEV target heatmap with unnormalized 2D Gaussian. Then, taking attending center proposals as query embedding, a detection head with multi-head self-attention and multi-scale multi-head deformable cross attention can refine and predict 3D bounding boxes for different classes moving/parked at the intersection. Extensive experiments and analyses demonstrate that our method achieves state-of-the-art performance on the DAIR-V2X-I benchmark with an acceptable training time cost, especially for Car and Cyclist. CetrRoad also reaches comparable results with the multi-modal fusion method for Pedestrian. An ablation study demonstrates that center-aware query as input can provide denser supervision than a purified feature map in the attention-based detection head. Moreover, we were able to intuitively observe that in complex traffic environment, our proposed model could produce more accurate 3D detection results than other compared methods with fewer false positives, which is helpful for other downstream VICAD tasks.

引用

页数：19

共 50 条

[1] 3D Object Detection with LiDAR Based on Multi-Attention Mechanism
Cao, Jie
Peng, Yiqiang
Fan, Likang
Mo, Lingfan
Wang, Longfei
LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
[2] 3D Object Detection with Fusion Point Attention Mechanism in LiDAR Point Cloud
Liu Weili
Zhu Deli
Luo Huahao
Li Yi
ACTA PHOTONICA SINICA, 2023, 52 (09)
[3] LiDAR-Based Intensity-Aware Outdoor 3D Object Detection
Naich, Ammar Yasir
Carrion, Jesus Requena
SENSORS, 2024, 24 (09)
[4] Density Awareness and Neighborhood Attention for LiDAR-Based 3D Object Detection
Qian, Hanxiang
Wu, Peng
Sun, Xiaoyong
Guo, Xiaojun
Su, Shaojing
PHOTONICS, 2022, 9 (11)
[5] BAFusion: Bidirectional Attention Fusion for 3D Object Detection Based on LiDAR and Camera
Liu, Min
Jia, Yuanjun
Lyu, Youhao
Dong, Qi
Yang, Yanyu
SENSORS, 2024, 24 (14)
[6] 3D Object Detection Based on LiDAR Data
Sahba, Ramin
Sahba, Amin
Jamshidi, Mo
Rad, Paul
2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 511 - 514
[7] MonoGAE: Roadside Monocular 3D Object Detection With Ground-Aware Embeddings
Yang, Lei
Zhang, Xinyu
Yu, Jiaxin
Li, Jun
Zhao, Tong
Wang, Li
Huang, Yi
Zhang, Chuang
Wang, Hong
Li, Yiming
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17587 - 17601
[8] Point Density-Aware Voxels for LiDAR 3D Object Detection
Hu, Jordan S. K.
Kuai, Tianshu
Waslander, Steven L.
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8459 - 8468
[9] Anytime-Lidar: Deadline-aware 3D Object Detection
Soyyigit, Ahmet
Yao, Shuochao
Yun, Heechul
2022 IEEE 28TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA 2022), 2022, : 31 - 40
[10] Pattern-Aware Data Augmentation for LiDAR 3D Object Detection
Hu, Jordan S. K.
Was, Steven L.
2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2703 - 2710

← 1 2 3 4 5 →