From Points to Parts: 3D Object Detection From Point Cloud With Part-Aware and Part-Aggregation Network

被引：615

作者：

Shi, Shaoshuai ^{[1
]}

Wang, Zhe ^{[2
]}

Shi, Jianping ^{[2
]}

Wang, Xiaogang ^{[1
]}

Li, Hongsheng ^{[1
]}

机构：

[1] Chinese Univ Hong Kong, Dept Elect Engn, Hong Kong, Peoples R China

[2] SenseTime Res, Beijing, Peoples R China

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2021年 / 43卷 / 08期

关键词：

3D object detection; point cloud; part location; LiDAR; convolutional neural network; autonomous driving;

D O I：

10.1109/TPAMI.2020.2977026

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D object detection from LiDAR point cloud is a challenging problem in 3D scene understanding and has many practical applications. In this paper, we extend our preliminary work PointRCNN to a novel and strong point-cloud-based 3D object detection framework, the part-aware and aggregation neural network (Part-A(2) net). The whole framework consists of the part-aware stage and the part-aggregation stage. First, the part-aware stage for the first time fully utilizes free-of-charge part supervisions derived from 3D ground-truth boxes to simultaneously predict high quality 3D proposals and accurate intra-object part locations. The predicted intra-object part locations within the same proposal are grouped by our new-designed RoI-aware point cloud pooling module, which results in an effective representation to encode the geometry-specific features of each 3D proposal. Then the part-aggregation stage learns to re-score the box and refine the box location by exploring the spatial relationship of the pooled intra-object part locations. Extensive experiments are conducted to demonstrate the performance improvements from each component of our proposed framework. Our Part-A(2) net outperforms all existing 3D detection methods and achieves new state-of-the-art on KITTI 3D object detection dataset by utilizing only the LiDAR point cloud data.

引用

页码：2647 / 2664

页数：18

共 63 条

[51] Associatively Segmenting Instances and Semantics in Point Clouds [J].

Wang, Xinlong ;

Liu, Shu ;

Shen, Xiaoyong ;

Shen, Chunhua ;

Jia, Jiaya .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4091-4100

[52] Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving [J].

Wang, Yan ;

Chao, Wei-Lun ;

Garg, Divyansh ;

Hariharan, Bharath ;

Campbell, Mark ;

Weinberger, Kilian Q. .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8437-8445

[53]

Wang ZX, 2019, IEEE INT C INT ROBOT, P1742, DOI [10.1109/IROS40897.2019.8968513, 10.1109/iros40897.2019.8968513]

[54] PointConv: Deep Convolutional Networks on 3D Point Clouds [J].

Wu, Wenxuan ;

Qi, Zhongang ;

Li Fuxin .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :9613-9622

[55] PointFusion: Deep Sensor Fusion for 3D Bounding Box Estimation [J].

Xu, Danfei ;

Anguelov, Dragomir ;

Jain, Ashesh .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :244-253

[56] SECOND: Sparsely Embedded Convolutional Detection [J].

Yan, Yan ;

Mao, Yuxing ;

Li, Bo .

SENSORS, 2018, 18 (10)

[57]

Yang B, 2018, PR MACH LEARN RES, V87

[58] PIXOR: Real-time 3D Object Detection from Point Clouds [J].

Yang, Bin ;

Luo, Wenjie ;

Urtasun, Raquel .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :7652-7660

[59] GSPN: Generative Shape Proposal Network for 3D Instance Segmentation in Point Cloud [J].

Yi, Li ;

Zhao, Wang ;

Wang, He ;

Sung, Minhyuk ;

Guibas, Leonidas .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :3942-3951

[60] PointWeb: Enhancing Local Neighborhood Features for Point Cloud Processing [J].

Zhao, Hengshuang ;

Jiang, Li ;

Fu, Chi-Wing ;

Jia, Jiaya .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :5550-5558

← 1 2 3 4 5 6 7 →