Voxel RCNN-HA: A Point Cloud Multiobject Detection Algorithm With Hybrid Anchors for Autonomous Driving

被引:2
|
作者
Wang, Hai [1 ]
Tao, Le [1 ]
Peng, Yiming [1 ]
Chen, Zhiyu [1 ]
Zhang, Yong [2 ]
机构
[1] Jiangsu Univ, Sch Automot & Traff Engn, Zhenjiang 212013, Peoples R China
[2] Nanjing Forestry Univ, Sch Automot & Traff Engn, Nanjing 212001, Peoples R China
来源
IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION | 2024年 / 10卷 / 03期
基金
中国国家自然科学基金;
关键词
Three-dimensional displays; Proposals; Prediction algorithms; Pedestrians; Feature extraction; Laser radar; Inference algorithms; 3-D object detection; anchor-based; anchor-free; Lidar; self-attention;
D O I
10.1109/TTE.2023.3346375
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The 3-D object detection using Lidar becomes essential for subsequent vehicle decision-making and planning as part of an intelligent vehicle perception system. Voxel region convolutional neural network (RCNN) is a two-stage voxel-based 3-D object detection algorithm that is fast and accurate. However, the detection accuracy for specific categories is insufficient in complex traffic scenarios, and thus, we propose the Voxel RCNN-HA algorithm. First, in light of the shortcomings of Voxel RCNN in detecting pedestrians, a hybrid detection head is proposed to balance the advantages and disadvantages of anchor-based and anchor-free algorithms and significantly improve pedestrian detection performance while maintaining vehicle accuracy. Second, self-attention is introduced in the second stage of the algorithm and a Voxel region of interest (RoI) self-attention pooling module is developed to obtain both local and global features in RoI, which addresses the issue that the original Voxel RoI pooling module is challenging to obtain global features of large objects. On the one million scenes (ONCE) dataset, the proposed Voxel RCNN-HA achieves 66.79% mean average precision (mAP) and 11.7 frames per second (FPS), and outperforms both Voxel RCNN and CenterPoints in terms of detection accuracy. Additionally, experiments on the Waymo Open dataset and Custom-Rslidar dataset further validate the effectiveness and generalization of the proposed method.
引用
收藏
页码:7286 / 7296
页数:11
相关论文
共 6 条
  • [1] Multi-object Detection Algorithm Based on Point Cloud for Autonomous Driving Scenarios
    Tao, Le
    Wang, Hai
    Cai, Yingfeng
    Chen, Long
    Qiche Gongcheng/Automotive Engineering, 2024, 46 (07): : 1208 - 1218and1238
  • [2] Dynamic Multitarget Detection Algorithm of Voxel Point Cloud Fusion Based on PointRCNN
    Luo, Xizhao
    Zhou, Feng
    Tao, Chongben
    Yang, Anjia
    Zhang, Peiyun
    Chen, Yonghua
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 20707 - 20720
  • [3] F-PVNet: Frustum-Level 3-D Object Detection on Point-Voxel Feature Representation for Autonomous Driving
    Tao, Chongben
    Fu, Shiping
    Wang, Chen
    Luo, Xizhao
    Li, Huayi
    Gao, Zhen
    Zhang, Zufeng
    Zheng, Sifa
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (09) : 8031 - 8045
  • [4] FP-RCNN: A Real-Time 3D Target Detection Model based on Multiple Foreground Point Sampling for Autonomous Driving
    Xu, Guoqing
    Xu, Xiaolong
    Gao, Honghao
    Xiao, Fu
    MOBILE NETWORKS & APPLICATIONS, 2023, 28 (01) : 369 - 381
  • [5] FP-RCNN: A Real-Time 3D Target Detection Model based on Multiple Foreground Point Sampling for Autonomous Driving
    Guoqing Xu
    Xiaolong Xu
    Honghao Gao
    Fu Xiao
    Mobile Networks and Applications, 2023, 28 : 369 - 381
  • [6] A Small-Object-Detection Algorithm Based on LiDAR Point-Cloud Clustering for Autonomous Vehicles
    Duan, Zhibing
    Shao, Jinju
    Zhang, Meng
    Zhang, Jinlei
    Zhai, Zhipeng
    SENSORS, 2024, 24 (16)