3D object detection based on fusion of image and point cloud in autonomous driving traffic scenarios

被引：0

作者：

Wu D. ^{[1
]}

Zhao J. ^{[2
,3
]}

Yu Z. ^{[2
]}

机构：

[1] School of Traffic and Transportation, Beijing Jiaotong University, Beijing

[2] School of Systems Science, Beijing Jiaotong University, Beijing

[3] Key Laboratory of Transport Industry of Big Data Application Technologies for Comprehensive Transport, Ministry of Transport, Beijing Jiaotong University, Beijing

来源：

Multimedia Tools and Applications | 2025年 / 84卷 / 20期

基金：

中国国家自然科学基金;

关键词：

3D Object Detection; Autonomous Driving; Images; Intelligent Transportation; Point Cloud;

D O I：

10.1007/s11042-024-19399-y

中图分类号：

学科分类号：

摘要：

In order to improve the accuracy of 3D object detection in autonomous driving traffic Scenarios, this paper proposes a 3D object detection method that integrates feature pyramid structure FPN (Feature Pyramid Network) and frustum attention module by fusing image and point cloud data. Firstly, the 2D object detection result of the image is projected into the point cloud and the redundant point cloud is trimmed to generate the 3D data of the frustum with the semantic information of the image; Secondly, according to the distribution pattern of point cloud in the frustum, linearly adjust and generate the sliding stride and height of the frustum sequence; Then, in order to improve the detection accuracy of targets at different scales, a multi-scale 3D object detection module was constructed based on the feature pyramid structure FPN and the fully convolutional network (FCN) to improve the feature extraction ability of the detection model; Next, to suppress the impact of invalid frustum sequences on detection accuracy, it is proposed to incorporate frustum attention modules into the detection model; Finally, experiments were conducted on the KITTI, and the results showed that the proposed improved model improved vehicle detection accuracy by 0.88%, 1.53%, and 2.33%, pedestrian detection accuracy by 0.99%, 1.88%, and 0.10%, and cyclist detection accuracy by 1.18%, 3.08%, and 2.78%, respectively, under the three occlusion types of easy, medium, and difficult occlusion, effectively improving the 3D object detection accuracy in autonomous driving traffic scenes. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

引用

页码：23259 / 23277

页数：18

共 21 条

[1]

Eman M., Mahmoud T.M., Ibrahim M.M., Et al., Innovative hybrid approach for masked face recognition using pretrained mask detection and segmentation, robust PCA, and KNN classifier[J], Sensors, 23, 15, (2023)

[2]

Eliwa E.H.I., El Koshiry A.M., Abd El-Hafeez T., Et al., Utilizing convolutional neural networks to classify monkeypox skin lesions[J], Sci Rep, 13, 1, (2023)

[3]

Shi S., Wang X., Li H., Pointrcnn: 3d object proposal generation and detection from point cloud, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 770-779, (2019)

[4]

Qi C.R., Yi L., Su H., Et al., PointNet++: deep hierarchical feature learning on point sets in a metric space[C]//Proceedings of the 31 st International Conference on Neural Information Processing Systems. Long Beach, pp. 5105-5114, (2017)

[5]

Yang Z., Sun Y., Liu S., Et al., 3dssd: Point-based 3d single stage object detector, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 11040-11048, (2020)

[6]

He C., Zeng H., Huang J., Et al., Structure aware single-stage 3d object detection from point cloud, Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp. 11873-11882, (2020)

[7]

Zhou Y., Tuzel O., Voxelnet: End-to-end learning for point cloud based 3d object detection, Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 4490-4499, (2018)

[8]

Yan Y., Mao Y., Li B., Second: Sparsely embedded convolutional detection[J], Sensors, 18, 10, (2018)

[9]

Shi S., Guo C., Jiang L., Et al., Pv-rcnn: Point-voxel feature set abstraction for 3d object detection, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10529-10538, (2020)

[10]

Shi S., Jiang L., Deng J., Et al., PV-RCNN++: Point-voxel feature set abstraction with local vector representation for 3D object detection[J], Int J Comput Vision, 131, 2, pp. 531-551, (2023)

← 1 2 3 →