SurfaceNet: A Surface Focused Network for Pedestrian Detection and Segmentation in 3D Point Clouds

被引:0
作者
Zhang, Yongcong [1 ]
Chen, Minglin [2 ]
Ao, Sheng [1 ]
Zhang, Xing [1 ]
Guo, Yulan [1 ]
机构
[1] Sun Yat Sen Univ, Coll Elect & Commun Engn, Shenzhen, Peoples R China
[2] Chinese Acad Sci, Shenzhen Inst Adv Technol, Shenzhen, Peoples R China
来源
16TH IEEE INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION, ROBOTICS AND VISION (ICARCV 2020) | 2020年
基金
中国国家自然科学基金;
关键词
D O I
10.1109/icarcv50220.2020.9305379
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Pedestrian detection is an important problem for autonomous driving. It is still chanllenging to detect and segment pedestrians from point clouds. In this paper, we propose a method named SurfaceNet to detect and segment pedestrians from point clouds. Specifically, we propose a novel representation, named surface map, to represent a point cloud as a 2D pseudo-image. For pedestrian detection, the proposed method comprises of four modules: 1) a grid feature encoder that can processes arbitrary number of points within each grid; 2) a surface feature convolutional module that employs a set of 2D convolutional layers to extract high level features; 3) a view transform module that transforms features from front view to bird's eye view; and 4) an anchor-free 3D object detection head that produces rotated 3D bounding box predictions. For semantic segmentation, the 2D pseudo-image is used for semantic segmentation and the segmentation results are re-projected to the original point cloud to achieve point cloud segmentation. Experimental results on the KITTI dataset show that our method achieves promising performance on pedestrian detection and segmentation in point clouds.
引用
收藏
页码:874 / 879
页数:6
相关论文
共 25 条
  • [11] PointPillars: Fast Encoders for Object Detection from Point Clouds
    Lang, Alex H.
    Vora, Sourabh
    Caesar, Holger
    Zhou, Lubing
    Yang, Jiong
    Beijbom, Oscar
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 12689 - 12697
  • [12] Deep Projective 3D Semantic Segmentation
    Lawin, Felix Jaremo
    Danelljan, Martin
    Tosteberg, Patrik
    Bhat, Goutam
    Khan, Fahad Shahbaz
    Felsberg, Michael
    [J]. COMPUTER ANALYSIS OF IMAGES AND PATTERNS, 2017, 10424 : 95 - 107
  • [13] Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
  • [14] Paszke A., Automatic differentiation in Pytorch. 2017
  • [15] Qi CR, 2017, ADV NEUR IN, V30
  • [16] Frustum PointNets for 3D Object Detection from RGB-D Data
    Qi, Charles R.
    Liu, Wei
    Wu, Chenxia
    Su, Hao
    Guibas, Leonidas J.
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 918 - 927
  • [17] PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation
    Qi, Charles R.
    Su, Hao
    Mo, Kaichun
    Guibas, Leonidas J.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 77 - 85
  • [18] U-Net: Convolutional Networks for Biomedical Image Segmentation
    Ronneberger, Olaf
    Fischer, Philipp
    Brox, Thomas
    [J]. MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION, PT III, 2015, 9351 : 234 - 241
  • [19] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud
    Shi, Shaoshuai
    Wang, Xiaogang
    Li, Hongsheng
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 770 - 779
  • [20] Disentangling Monocular 3D Object Detection
    Simonelli, Andrea
    Bulo, Samuel Rota
    Porzi, Lorenzo
    Lopez-Antequera, Manuel
    Kontschieder, Peter
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 1991 - 1999