Center-Aware 3D Object Detection with Attention Mechanism Based on Roadside LiDAR

被引:7
|
作者
Shi, Haobo [1 ,2 ,3 ]
Hou, Dezao [1 ,2 ,3 ]
Li, Xiyao [1 ,2 ,3 ]
机构
[1] Minist Transport, Res Inst Highway, Beijing 100088, Peoples R China
[2] Key Lab Intelligent Transportat Technol & Transpor, Beijing 100088, Peoples R China
[3] Natl Intelligent Transport Syst Ctr Engn & Technol, Beijing 100088, Peoples R China
基金
中国国家自然科学基金;
关键词
vehicle-infrastructure cooperative autonomous driving; roadside 3D detection; LiDAR-based detection; central point representation; deformable attention;
D O I
10.3390/su15032628
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Infrastructure 3D Object Detection is a pivotal component of Vehicle-Infrastructure Cooperated Autonomous Driving (VICAD). As turning objects account for a high proportion of traffic at intersections, anchor-free representation in the bird's-eye view (BEV) is more suitable for roadside 3D detection. In this work, we propose CetrRoad, a simple yet effective center-aware detector with transformer-based detection head for roadside 3D object detection with single LiDAR (Light Detection and Ranging). CetrRoad firstly utilizes a voxel-based roadside LiDAR feature encoder module that voxelizes and projects the raw point cloud into BEV with dense feature representation, following a one-stage center proposal module that initializes center candidates of objects based on the top N points in the BEV target heatmap with unnormalized 2D Gaussian. Then, taking attending center proposals as query embedding, a detection head with multi-head self-attention and multi-scale multi-head deformable cross attention can refine and predict 3D bounding boxes for different classes moving/parked at the intersection. Extensive experiments and analyses demonstrate that our method achieves state-of-the-art performance on the DAIR-V2X-I benchmark with an acceptable training time cost, especially for Car and Cyclist. CetrRoad also reaches comparable results with the multi-modal fusion method for Pedestrian. An ablation study demonstrates that center-aware query as input can provide denser supervision than a purified feature map in the attention-based detection head. Moreover, we were able to intuitively observe that in complex traffic environment, our proposed model could produce more accurate 3D detection results than other compared methods with fewer false positives, which is helpful for other downstream VICAD tasks.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] 3D Object Detection with LiDAR Based on Multi-Attention Mechanism
    Cao, Jie
    Peng, Yiqiang
    Fan, Likang
    Mo, Lingfan
    Wang, Longfei
    LASER & OPTOELECTRONICS PROGRESS, 2025, 62 (04)
  • [2] 3D Object Detection with Fusion Point Attention Mechanism in LiDAR Point Cloud
    Liu Weili
    Zhu Deli
    Luo Huahao
    Li Yi
    ACTA PHOTONICA SINICA, 2023, 52 (09)
  • [3] LiDAR-Based Intensity-Aware Outdoor 3D Object Detection
    Naich, Ammar Yasir
    Carrion, Jesus Requena
    SENSORS, 2024, 24 (09)
  • [4] Density Awareness and Neighborhood Attention for LiDAR-Based 3D Object Detection
    Qian, Hanxiang
    Wu, Peng
    Sun, Xiaoyong
    Guo, Xiaojun
    Su, Shaojing
    PHOTONICS, 2022, 9 (11)
  • [5] BAFusion: Bidirectional Attention Fusion for 3D Object Detection Based on LiDAR and Camera
    Liu, Min
    Jia, Yuanjun
    Lyu, Youhao
    Dong, Qi
    Yang, Yanyu
    SENSORS, 2024, 24 (14)
  • [6] 3D Object Detection Based on LiDAR Data
    Sahba, Ramin
    Sahba, Amin
    Jamshidi, Mo
    Rad, Paul
    2019 IEEE 10TH ANNUAL UBIQUITOUS COMPUTING, ELECTRONICS & MOBILE COMMUNICATION CONFERENCE (UEMCON), 2019, : 511 - 514
  • [7] MonoGAE: Roadside Monocular 3D Object Detection With Ground-Aware Embeddings
    Yang, Lei
    Zhang, Xinyu
    Yu, Jiaxin
    Li, Jun
    Zhao, Tong
    Wang, Li
    Huang, Yi
    Zhang, Chuang
    Wang, Hong
    Li, Yiming
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 17587 - 17601
  • [8] Point Density-Aware Voxels for LiDAR 3D Object Detection
    Hu, Jordan S. K.
    Kuai, Tianshu
    Waslander, Steven L.
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8459 - 8468
  • [9] Anytime-Lidar: Deadline-aware 3D Object Detection
    Soyyigit, Ahmet
    Yao, Shuochao
    Yun, Heechul
    2022 IEEE 28TH INTERNATIONAL CONFERENCE ON EMBEDDED AND REAL-TIME COMPUTING SYSTEMS AND APPLICATIONS (RTCSA 2022), 2022, : 31 - 40
  • [10] Pattern-Aware Data Augmentation for LiDAR 3D Object Detection
    Hu, Jordan S. K.
    Was, Steven L.
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2703 - 2710