3D MSSD: A multilayer spatial structure 3D object detection network for mobile LiDAR point clouds

被引:11
作者
Wang, Zongyue [1 ]
Xia, Qiming [1 ]
Du, Jing [1 ]
Huang, Shangfeng [1 ]
Su, Jinhe [1 ]
Marcato Junior, Jose [4 ]
Li, Jonathan [2 ,3 ]
Cai, Guorong [1 ]
机构
[1] Jimei Univ, Sch Comp Engn, Xiamen 361021, Peoples R China
[2] Univ Waterloo, Dept Geog & Environm Management, Waterloo, ON N2L 3G1, Canada
[3] Univ Waterloo, Dept Syst Design Engn, Waterloo, ON N2L 3G1, Canada
[4] Univ Fed Mato Grosso do Sul, Fac Engn Architecture & Urbanism & Geog, BR-79070900 Campo Grande, MS, Brazil
基金
中国国家自然科学基金;
关键词
Object detection; Autonomous driving; Point cloud; Multilayer spatial structure feature;
D O I
10.1016/j.jag.2021.102406
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Point cloud-based object detection is vital and essential for many real-world applications, such as autonomous driving and robot vision. The PointPillars model has achieved the efficient detection of objects in front of a vehicle. However, the algorithm does not consider the spatial structures semantic information stored in the threedimensional point cloud for a given spatial structure, thus leading to missed or false detections for objects with complex spatial structures or singular structures. We propose an approach based on PointPillars, which considers the spatial structure characteristics of 3D point clouds to enhance the detection accuracy. First, based on the specified range of the z-axis coordinates, the entire point cloud scene is divided into several layers so that the point cloud areas in the same height interval form one layer. Data from several layers are obtained. Second, the point clouds of several layers are processed with Pillar Feature Net to obtain several pseudoimages. Each pseudoimage represents the semantic information from the corresponding level of the point cloud. Third, the obtained pseudoimages from each level are merged with the pseudoimages of the entire scene to obtain a feature map with spatial structure characteristics. We apply a Region Proposal Network, and an object detection operator processes the feature map and obtains the result of object detection. Experiments show that the proposed method has a highly accurate detection effect for objects with complex spatial structures. In addition, the proposed method does not erroneously detect objects with similar semantic information after vertical dimension projection.
引用
收藏
页数:16
相关论文
共 34 条
[1]  
[Anonymous], 2010, ICML
[2]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[3]  
Chen YL, 2019, IEEE I CONF COMP VIS, P9774, DOI [10.1109/ICCV.2019.00987, 10.1109/iccv.2019.00987]
[4]  
Chenhang He, 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Proceedings, P11870, DOI 10.1109/CVPR42600.2020.01189
[5]  
Everingham M., 2010, INT J COMPUT VISION, V88, P303, DOI DOI 10.1007/s11263-009-0275-4
[6]  
Geiger A, 2012, PROC CVPR IEEE, P3354, DOI 10.1109/CVPR.2012.6248074
[7]  
Ioffe S, 2015, PR MACH LEARN RES, V37, P448
[8]  
Ku J, 2018, IEEE INT C INT ROBOT, P5750, DOI 10.1109/IROS.2018.8594049
[9]   PointPillars: Fast Encoders for Object Detection from Point Clouds [J].
Lang, Alex H. ;
Vora, Sourabh ;
Caesar, Holger ;
Zhou, Lubing ;
Yang, Jiong ;
Beijbom, Oscar .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :12689-12697
[10]   Deep Continuous Fusion for Multi-sensor 3D Object Detection [J].
Liang, Ming ;
Yang, Bin ;
Wang, Shenlong ;
Urtasun, Raquel .
COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 :663-678