Neighborhood co-occurrence modeling in 3D point cloud segmentation

被引:11
作者
Gong, Jingyu [1 ]
Ye, Zhou [2 ]
Ma, Lizhuang [1 ,3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn, Shanghai 200240, Peoples R China
[2] Shanghai CLS Fintech Co LTD, Shanghai 200030, Peoples R China
[3] Shanghai Jiao Tong Univ, MoE Key Lab Artificial Intelligence, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金; 国家重点研发计划;
关键词
3D vision; point cloud; co-occurrence relation modeling; semantic segmentation;
D O I
10.1007/s41095-021-0244-6
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
A significant performance boost has been achieved in point cloud semantic segmentation by utilization of the encoder-decoder architecture and novel convolution operations for point clouds. However, co-occurrence relationships within a local region which can directly influence segmentation results are usually ignored by current works. In this paper, we propose a neighborhood co-occurrence matrix (NCM) to model local co-occurrence relationships in a point cloud. We generate target NCM and prediction NCM from semantic labels and a prediction map respectively. Then, Kullback-Leibler (KL) divergence is used to maximize the similarity between the target and prediction NCMs to learn the co-occurrence relationship. Moreover, for large scenes where the NCMs for a sampled point cloud and the whole scene differ greatly, we introduce a reverse form of KL divergence which can better handle the difference to supervise the prediction NCMs. We integrate our method into an existing backbone and conduct comprehensive experiments on three datasets: Semantic3D for outdoor space segmentation, and S3DIS and ScanNet v2 for indoor scene segmentation. Results indicate that our method can significantly improve upon the backbone and outperform many leading competitors.
引用
收藏
页码:303 / 315
页数:13
相关论文
共 45 条
[1]   3D Semantic Parsing of Large-Scale Indoor Spaces [J].
Armeni, Iro ;
Sener, Ozan ;
Zamir, Amir R. ;
Jiang, Helen ;
Brilakis, Ioannis ;
Fischer, Martin ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543
[2]   ScanNet: Richly-annotated 3D Reconstructions of Indoor Scenes [J].
Dai, Angela ;
Chang, Angel X. ;
Savva, Manolis ;
Halber, Maciej ;
Funkhouser, Thomas ;
Niessner, Matthias .
30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :2432-2443
[3]   Semantic Segmentation of RGBD Images with Mutex Constraints [J].
Deng, Zhuo ;
Todorovic, Sinisa ;
Latecki, Longin Jan .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1733-1741
[4]   Deep FusionNet for Point Cloud Semantic Segmentation [J].
Zhang, Feihu ;
Fang, Jin ;
Wah, Benjamin ;
Torr, Philip .
COMPUTER VISION - ECCV 2020, PT XXIV, 2020, 12369 :644-663
[5]   Omni-supervised Point Cloud Segmentation via Gradual Receptive Field Component Reasoning [J].
Gong, Jingyu ;
Xu, Jiachen ;
Tan, Xin ;
Song, Haichuan ;
Qu, Yanyun ;
Xie, Yuan ;
Ma, Lizhuang .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11668-11677
[6]  
Gong JY, 2021, AAAI CONF ARTIF INTE, V35, P1424
[7]  
Hackel Timo, 2017, ISPRS ANN PHOTOGRAMM, VIV- 1-W1, P91, DOI DOI 10.5194/ISPRS-ANNALS-IV-1-W1-91-2017
[8]  
Han WK, 2020, AAAI CONF ARTIF INTE, V34, P10925
[9]   Semantic Labeling and Instance Segmentation of 3D Point Clouds Using Patch Context Analysis and Multiscale Processing [J].
Hu, Shi-Min ;
Cai, Jun-Xiong ;
Lai, Yu-Kun .
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (07) :2485-2498
[10]   JS']JSENet: Joint Semantic Segmentation and Edge Detection Network for 3D Point Clouds [J].
Hu, Zeyu ;
Zhen, Mingmin ;
Bai, Xuyang ;
Fu, Hongbo ;
Tai, Chiew-lan .
COMPUTER VISION - ECCV 2020, PT XX, 2020, 12365 :222-239