JS']JSPNet: Learning joint semantic & instance segmentation of point clouds via feature self-similarity and cross-task probability

被引:26
作者
Chen, Feng [1 ]
Wu, Fei [1 ]
Gao, Guangwei [2 ]
Ji, Yimu [3 ,4 ]
Xu, Jing [5 ]
Jiang, Guo-Ping [1 ]
Jing, Xiao-Yuan [1 ,6 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing, Peoples R China
[2] Natl Inst Informat, Digital Content & Media Sci Res Div, Tokyo, Japan
[3] Nanjing Ctr HPC China, Nanjing, Peoples R China
[4] Jiangsu HPC & Intelligent Proc Engineer Res Ctr, Nanjing, Peoples R China
[5] Hohai Univ, Sch Law, Nanjing, Peoples R China
[6] Wuhan Univ, Sch Comp Sci, Wuhan, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Instance & semantic segmentation; Point could processing; Multi-task learning; REPRESENTATION;
D O I
10.1016/j.patcog.2021.108250
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a novel method named JSPNet, to segment 3D point cloud in semantic and instance simultaneously. First, we analyze the problem in addressing joint semantic and instance segmentation, including the common ground of cooperation of two tasks, conflict of two tasks, quadruplet relation between semantic and instance distributions, and ignorance of existing works. Then we introduce our method to reinforce mutual cooperation and alleviate the essential conflict. Our method has a shared encoder and two decoders to address two tasks. Specifically, to maintain discriminative features and characterize inconspicuous content, a similarity-based feature fusion module is designed to locate the inconspicuous area in the feature of current branch and then select related features from the other branch to compensate for the unclear content. Furthermore, given the salient semantic feature and the salient instance feature, a cross-task probability-based feature fusion module is developed to establish the probabilistic correlation between semantic and instance features. This module could transform features from one branch and further fuse them with the other branch by multiplying probabilistic matrix. Experimental results on a large-scale 3D indoor point cloud dataset S3DIS and a part-segmentation dataset ShapeNet have demonstrated the superiority of our method over existing state-of-the-arts in both semantic and instance segmentation. The proposed method outperforms PointNet with 12% and 26% improvements and outperforms ASIS with 2.7% and 4.3% improvements in terms of mIoU and mPre. Code of this work has been made available at https://github.com/Chenfeng1271/JSPNet . (c) 2021 Elsevier Ltd. All rights reserved.
引用
收藏
页数:11
相关论文
共 47 条
[1]  
[Anonymous], 2006, MULTITASK FEATURE SE
[2]   3D Semantic Parsing of Large-Scale Indoor Spaces [J].
Armeni, Iro ;
Sener, Ozan ;
Zamir, Amir R. ;
Jiang, Helen ;
Brilakis, Ioannis ;
Fischer, Martin ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   Representation Learning: A Review and New Perspectives [J].
Bengio, Yoshua ;
Courville, Aaron ;
Vincent, Pascal .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) :1798-1828
[5]   Multi-View 3D Object Detection Network for Autonomous Driving [J].
Chen, Xiaozhi ;
Ma, Huimin ;
Wan, Ji ;
Li, Bo ;
Xia, Tian .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6526-6534
[6]   Construction of a Hierarchical Feature Enhancement Network and Its Application in Fault Recognition [J].
Chen, Zhe ;
Lu, Huimin ;
Tian, Shiqing ;
Qiu, Junlin ;
Kamiya, Tohru ;
Serikawa, Seiichi ;
Xu, Lizhong .
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (07) :4827-4836
[7]   Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[8]   Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].
Dai, Angela ;
Qi, Charles Ruizhongtai ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554
[9]  
De Brabandere Bert, 2017, arXiv
[10]  
Engelcke Martin, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P1355, DOI 10.1109/ICRA.2017.7989161