SurfaceNet: A Surface Focused Network for Pedestrian Detection and Segmentation in 3D Point Clouds

被引：0

作者：

Zhang, Yongcong ^{[1
]}

Chen, Minglin ^{[2
]}

Ao, Sheng ^{[1
]}

Zhang, Xing ^{[1
]}

Guo, Yulan ^{[1
]}

机构：

[1] Sun Yat Sen Univ, Coll Elect & Commun Engn, Shenzhen, Peoples R China

[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China

来源：

16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020) | 2020年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/icarcv50220.2020.9305379

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Pedestrian detection is an important problem for autonomous driving. It is still chanllenging to detect and segment pedestrians from point clouds. In this paper, we propose a method named SurfaceNet to detect and segment pedestrians from point clouds. Specifically, we propose a novel representation, named surface map, to represent a point cloud as a 2D pseudo-image. For pedestrian detection, the proposed method comprises of four modules: 1) a grid feature encoder that can processes arbitrary number of points within each grid; 2) a surface feature convolutional module that employs a set of 2D convolutional layers to extract high level features; 3) a view transform module that transforms features from front view to bird's eye view; and 4) an anchor-free 3D object detection head that produces rotated 3D bounding box predictions. For semantic segmentation, the 2D pseudo-image is used for semantic segmentation and the segmentation results are re-projected to the original point cloud to achieve point cloud segmentation. Experimental results on the KITTI dataset show that our method achieves promising performance on pedestrian detection and segmentation in point clouds.

引用

页码：874 / 879

页数：6

共 25 条

[11] PointPillars: Fast Encoders for Object Detection from Point Clouds
Lang, Alex H.
Vora, Sourabh
Caesar, Holger
Zhou, Lubing
Yang, Jiong
Beijbom, Oscar
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12689 - 12697
[12] Deep Projective 3D Semantic Segmentation
Lawin, Felix Jaremo
Danelljan, Martin
Tosteberg, Patrik
Bhat, Goutam
Khan, Fahad Shahbaz
Felsberg, Michael
[J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, 2017, 10424 : 95 - 107
[13] Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[14] Paszke A., Automatic differentiation in Pytorch. 2017
[15] Qi CR, 2017, ADV NEUR IN, V30
[16] Frustum PointNets for 3D Object Detection from RGB-D Data
Qi, Charles R.
Liu, Wei
Wu, Chenxia
Su, Hao
Guibas, Leonidas J.
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 918 - 927
[17] PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
Qi, Charles R.
Su, Hao
Mo, Kaichun
Guibas, Leonidas J.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 77 - 85
[18] U-Net: Convolutional Networks for Biomedical Image Segmentation
Ronneberger, Olaf
Fischer, Philipp
Brox, Thomas
[J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
[19] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud
Shi, Shaoshuai
Wang, Xiaogang
Li, Hongsheng
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 770 - 779
[20] Disentangling Monocular 3D Object Detection
Simonelli, Andrea
Bulo, Samuel Rota
Porzi, Lorenzo
Lopez-Antequera, Manuel
Kontschieder, Peter
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999

← 1 2 3 →