Voxel RCNN-HA: A Point Cloud Multiobject Detection Algorithm With Hybrid Anchors for Autonomous Driving

被引：2

作者：

Wang, Hai ^{[1
]}

Tao, Le ^{[1
]}

Peng, Yiming ^{[1
]}

Chen, Zhiyu ^{[1
]}

Zhang, Yong ^{[2
]}

机构：

[1] Jiangsu Univ, Sch Automot & Traff Engn, Zhenjiang 212013, Peoples R China

[2] Nanjing Forestry Univ, Sch Automot & Traff Engn, Nanjing 212001, Peoples R China

来源：

IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION | 2024年 / 10卷 / 03期

基金：

中国国家自然科学基金;

关键词：

Three-dimensional displays; Proposals; Prediction algorithms; Pedestrians; Feature extraction; Laser radar; Inference algorithms; 3-D object detection; anchor-based; anchor-free; Lidar; self-attention;

D O I：

10.1109/TTE.2023.3346375

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The 3-D object detection using Lidar becomes essential for subsequent vehicle decision-making and planning as part of an intelligent vehicle perception system. Voxel region convolutional neural network (RCNN) is a two-stage voxel-based 3-D object detection algorithm that is fast and accurate. However, the detection accuracy for specific categories is insufficient in complex traffic scenarios, and thus, we propose the Voxel RCNN-HA algorithm. First, in light of the shortcomings of Voxel RCNN in detecting pedestrians, a hybrid detection head is proposed to balance the advantages and disadvantages of anchor-based and anchor-free algorithms and significantly improve pedestrian detection performance while maintaining vehicle accuracy. Second, self-attention is introduced in the second stage of the algorithm and a Voxel region of interest (RoI) self-attention pooling module is developed to obtain both local and global features in RoI, which addresses the issue that the original Voxel RoI pooling module is challenging to obtain global features of large objects. On the one million scenes (ONCE) dataset, the proposed Voxel RCNN-HA achieves 66.79% mean average precision (mAP) and 11.7 frames per second (FPS), and outperforms both Voxel RCNN and CenterPoints in terms of detection accuracy. Additionally, experiments on the Waymo Open dataset and Custom-Rslidar dataset further validate the effectiveness and generalization of the proposed method.

引用

页码：7286 / 7296

页数：11

共 6 条

[1] Multi-object Detection Algorithm Based on Point Cloud for Autonomous Driving Scenarios
Tao, Le
Wang, Hai
Cai, Yingfeng
Chen, Long
Qiche Gongcheng/Automotive Engineering, 2024, 46 (07): : 1208 - 1218and1238
[2] Dynamic Multitarget Detection Algorithm of Voxel Point Cloud Fusion Based on PointRCNN
Luo, Xizhao
Zhou, Feng
Tao, Chongben
Yang, Anjia
Zhang, Peiyun
Chen, Yonghua
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (11) : 20707 - 20720
[3] F-PVNet: Frustum-Level 3-D Object Detection on Point-Voxel Feature Representation for Autonomous Driving
Tao, Chongben
Fu, Shiping
Wang, Chen
Luo, Xizhao
Li, Huayi
Gao, Zhen
Zhang, Zufeng
Zheng, Sifa
IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (09) : 8031 - 8045
[4] FP-RCNN: A Real-Time 3D Target Detection Model based on Multiple Foreground Point Sampling for Autonomous Driving
Xu, Guoqing
Xu, Xiaolong
Gao, Honghao
Xiao, Fu
MOBILE NETWORKS & APPLICATIONS, 2023, 28 (01) : 369 - 381
[5] FP-RCNN: A Real-Time 3D Target Detection Model based on Multiple Foreground Point Sampling for Autonomous Driving
Guoqing Xu
Xiaolong Xu
Honghao Gao
Fu Xiao
Mobile Networks and Applications, 2023, 28 : 369 - 381
[6] A Small-Object-Detection Algorithm Based on LiDAR Point-Cloud Clustering for Autonomous Vehicles
Duan, Zhibing
Shao, Jinju
Zhang, Meng
Zhang, Jinlei
Zhai, Zhipeng
SENSORS, 2024, 24 (16)

← 1 →