CF-SIS: Semantic-Instance Segmentation of 3D Point Clouds by Context Fusion with Self-Attention

被引：23

作者：

Wen, Xin ^{[1
]}

Han, Zhizhong ^{[2
]}

Youk, Geunhyuk ^{[1
]}

Liu, Yu-Shen ^{[3
]}

机构：

[1] Tsinghua Univ, Sch Software, Beijing, Peoples R China

[2] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA

[3] Tsinghua Univ, Sch Software, BNRist, Beijing, Peoples R China

来源：

MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA | 2020年

基金：

国家重点研发计划;

关键词：

3D shape recognition; 3D shape segmentation; point cloud;

D O I：

10.1145/3394171.3413829

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

3D Semantic-Instance Segmentation (SIS) is a newly emerging research direction that aims to understand visual information of 3D scene on both semantic and instance level. The main difficulty lies in how to coordinate the paradox between mutual aid and sub-optimal problem. Previous methods usually address the mutual aid between instances and semantics by direct feature fusion or hand-crafted constraints to share the common knowledge of the two tasks. However, they neglect the abundant common knowledge of feature context in the feature space. Moreover, the direct feature fusion can raise the sub-optimal problem, since the false prediction of instance object can interfere the prediction of the semantic segmentation and vice versa. To address the above two issues, we propose a novel network of feature context fusion for SIS task, named CF-SIS. The idea is to associatively learn semantic and instance segmentation of 3D point clouds by context fusion with attention in the feature space. Our main contributions are two context fusion modules. First, we propose a novel inter-task context fusion module to take full advantage of mutual aid and relive the sub-optimal problem. It extracts the context in feature space from one task with attention, and selectively fuses the context into the other task using a gate fusion mechanism. Then, in order to enhance the mutual aid effect, the intra-task context fusion module is designed to further integrate the fused context, by selectively merging the similar feature through the self-attention mechanism. We conduct experiments on the S3DIS and ShapeNet datasets and show that CF-SIS outperforms the state-of-the-art methods on semantic and instance segmentation task.

引用

页码：1661 / 1669

页数：9

共 42 条

[21] JS']JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random Fields [J].

Quang-Hieu Pham ;

Duc Thanh Nguyen ;

Binh-Son Hua ;

Roig, Gemma ;

Yeung, Sai-Kit .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :8819-8828

[22]

Ren M., 2017, ARXIV171110108

[23] Hybrid Point Cloud Attribute Compression Using Slice-based Layered Structure and Block-based Intra Prediction [J].

Shao, Yiting ;

Zhang, Qi ;

Li, Ge ;

Li, Zhu ;

Li, Li .

PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, :1199-1207

[24] PointRCNN: 3D Object Proposal Generation and Detection from Point Cloud [J].

Shi, Shaoshuai ;

Wang, Xiaogang ;

Li, Hongsheng .

2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, :770-779

[25] Deep Sliding Shapes for Amodal 3D Object Detection in RGB-D Images [J].

Song, Shuran ;

Xiao, Jianxiong .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :808-816

[26]

Song SR, 2014, LECT NOTES COMPUT SC, V8694, P634, DOI 10.1007/978-3-319-10599-4_41

[27] SPLATNet: Sparse Lattice Networks for Point Cloud Processing [J].

Su, Hang ;

Jampani, Varun ;

Sun, Deqing ;

Maji, Subhransu ;

Kalogerakis, Evangelos ;

Yang, Ming-Hsuan ;

Kautz, Jan .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2530-2539

[28] RGCNN: Regularized Graph CNN for Point Cloud Segmentation [J].

Te, Gusi ;

Hu, Wei ;

Guo, Zongming ;

Zheng, Amin .

PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, :746-754

[29] Robust shape normalization of 3D articulated volumetric models [J].

Wang, Chao ;

Liu, Yu-Shen ;

Liu, Min ;

Yong, Jun-Hai ;

Paul, Jean-Claude .

COMPUTER-AIDED DESIGN, 2012, 44 (12) :1253-1268

[30] SGPN: Similarity Group Proposal Network for 3D Point Cloud Instance Segmentation [J].

Wang, Weiyue ;

Yu, Ronald ;

Huang, Qiangui ;

Neumann, Ulrich .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2569-2578

← 1 2 3 4 5 →