3D-MPA: Multi Proposal Aggregation for 3D Semantic Instance Segmentation

被引：169

作者：

Engelmann, Francis ^{[1
,2
]}

Bokeloh, Martin ^{[2
]}

Fathi, Alireza ^{[2
]}

Leibe, Bastian ^{[1
]}

Niessner, Matthias ^{[3
]}

机构：

[1] Rhein Westfal TH Aachen, Aachen, Germany

[2] Google, Mountain View, CA 94043 USA

[3] Tech Univ Munich, Munich, Germany

来源：

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020) | 2020年

关键词：

D O I：

10.1109/CVPR42600.2020.00905

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We present 3D-MPA, a method for instance segmentation on 3D point clouds. Given an input point cloud, we propose an object-centric approach where each point votes for its object center. We sample object proposals from the predicted object centers. Then, we learn proposal features from grouped point features that voted for the same object center. A graph convolutional network introduces interproposal relations, providing higher-level feature learning in addition to the lower-level point features. Each proposal comprises a semantic label, a set of associated points over which we define a foreground-background mask, an objectness score and aggregation features. Previous works usually perform non-maximum-suppression (NMS) over proposals to obtain the final object detections or semantic instances. However, NMS can discard potentially correct predictions. Instead, our approach keeps all proposals and groups them together based on the learned aggregation features. We show that grouping proposals improves over NMS and outperforms previous state-of-the-art methods on the tasks of 3D object detection and semantic instance segmentation on the ScanNetV2 benchmark and the S3DIS dataset.

引用

页码：9028 / 9037

页数：10

共 50 条

[11]

Duvenaudt D, 2015, ADV NEUR IN, V28

[12] 3D Bird's-Eye-View Instance Segmentation [J].

Elich, Cathrin ;

Engelmann, Francis ;

Kontogianni, Theodora ;

Leibe, Bastian .

PATTERN RECOGNITION, DAGM GCPR 2019, 2019, 11824 :48-61

[13]

Engelmann F., 2020, INT C ROB AUT ICRA

[14]

Engelmann F., 2018, EUR C COMP VIS WORKS

[15]

Ester M, 1996, KDD 96, P226, DOI DOI 10.5555/3001460.3001507

[16]

Fathi A., 2017, ARXIV170310277

[17] 3D Semantic Segmentation with Submanifold Sparse Convolutional Networks [J].

Graham, Benjamin ;

Engelcke, Martin ;

van der Maaten, Laurens .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :9224-9232

[18]

Hanocka R., 2019, ACM T GRAPHICS TOG

[19]

He KM, 2020, IEEE T PATTERN ANAL, V42, P386, DOI [10.1109/TPAMI.2018.2844175, 10.1109/ICCV.2017.322]

[20] 3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans [J].

Hou, Ji ;

Dai, Angela ;

Niessner, Matthias .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :4416-4425

← 1 2 3 4 5 →