From Points to Parts: 3D Object Detection From Point Cloud With Part-Aware and Part-Aggregation Network

被引:615
作者
Shi, Shaoshuai [1 ]
Wang, Zhe [2 ]
Shi, Jianping [2 ]
Wang, Xiaogang [1 ]
Li, Hongsheng [1 ]
机构
[1] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China
[2] SenseTime Res, Beijing, Peoples R China
关键词
3D object detection; point cloud; part location; LiDAR; convolutional neural network; autonomous driving;
D O I
10.1109/TPAMI.2020.2977026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
3D object detection from LiDAR point cloud is a challenging problem in 3D scene understanding and has many practical applications. In this paper, we extend our preliminary work PointRCNN to a novel and strong point-cloud-based 3D object detection framework, the part-aware and aggregation neural network (Part-A(2) net). The whole framework consists of the part-aware stage and the part-aggregation stage. First, the part-aware stage for the first time fully utilizes free-of-charge part supervisions derived from 3D ground-truth boxes to simultaneously predict high quality 3D proposals and accurate intra-object part locations. The predicted intra-object part locations within the same proposal are grouped by our new-designed RoI-aware point cloud pooling module, which results in an effective representation to encode the geometry-specific features of each 3D proposal. Then the part-aggregation stage learns to re-score the box and refine the box location by exploring the spatial relationship of the pooled intra-object part locations. Extensive experiments are conducted to demonstrate the performance improvements from each component of our proposed framework. Our Part-A(2) net outperforms all existing 3D detection methods and achieves new state-of-the-art on KITTI 3D object detection dataset by utilizing only the LiDAR point cloud data.
引用
收藏
页码:2647 / 2664
页数:18
相关论文
共 63 条
[51]   Associatively Segmenting Instances and Semantics in Point Clouds [J].
Wang, Xinlong ;
Liu, Shu ;
Shen, Xiaoyong ;
Shen, Chunhua ;
Jia, Jiaya .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4091-4100
[52]   Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving [J].
Wang, Yan ;
Chao, Wei-Lun ;
Garg, Divyansh ;
Hariharan, Bharath ;
Campbell, Mark ;
Weinberger, Kilian Q. .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8437-8445
[53]  
Wang ZX, 2019, IEEE INT C INT ROBOT, P1742, DOI [10.1109/IROS40897.2019.8968513, 10.1109/iros40897.2019.8968513]
[54]   PointConv: Deep Convolutional Networks on 3D Point Clouds [J].
Wu, Wenxuan ;
Qi, Zhongang ;
Li Fuxin .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9613-9622
[55]   PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation [J].
Xu, Danfei ;
Anguelov, Dragomir ;
Jain, Ashesh .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :244-253
[56]   SECOND: Sparsely Embedded Convolutional Detection [J].
Yan, Yan ;
Mao, Yuxing ;
Li, Bo .
SENSORS, 2018, 18 (10)
[57]  
Yang B, 2018, PR MACH LEARN RES, V87
[58]   PIXOR: Real-time 3D Object Detection from Point Clouds [J].
Yang, Bin ;
Luo, Wenjie ;
Urtasun, Raquel .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7652-7660
[59]   GSPN: Generative Shape Proposal Network for 3D Instance Segmentation in Point Cloud [J].
Yi, Li ;
Zhao, Wang ;
Wang, He ;
Sung, Minhyuk ;
Guibas, Leonidas .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3942-3951
[60]   PointWeb: Enhancing Local Neighborhood Features for Point Cloud Processing [J].
Zhao, Hengshuang ;
Jiang, Li ;
Fu, Chi-Wing ;
Jia, Jiaya .
2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5550-5558