SCF-Net: Learning Spatial Contextual Features for Large-Scale Point Cloud Segmentation

被引：207

作者：

Fan, Siqi ^{[1
,2
]}

Dong, Qiulei ^{[2
,3
,4
]}

Zhu, Fenghua ^{[1
]}

Lv, Yisheng ^{[1
]}

Ye, Peijun ^{[1
]}

Wang, Fei-Yue ^{[1
]}

机构：

[1] CASIA, State Key Lab Management & Control Complex Syst, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China

[3] CASIA, Natl Lab Pattern Recognit, Beijing, Peoples R China

[4] Chinese Acad Sci, Ctr Excellence Brain Sci & Intelligence Technol, Beijing, Peoples R China

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021 | 2021年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

D O I：

10.1109/CVPR46437.2021.01427

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

How to learn effective features from large-scale point clouds for semantic segmentation has attracted increasing attention in recent years. Addressing this problem, we propose a learnable module that learns Spatial Contextual Features from large-scale point clouds, called SCF in this paper. The proposed module mainly consists of three blocks, including the local polar representation block, the dual-distance attentive pooling block, and the global contextual feature block. For each 3D point, the local polar representation block is firstly explored to construct a spatial representation that is invariant to the z-axis rotation, then the dual-distance attentive pooling block is designed to utilize the representations of its neighbors for learning more discriminative local features according to both the geometric and feature distances among them, and finally, the global contextual feature block is designed to learn a global context for each 3D point by utilizing its spatial location and the volume ratio of the neighborhood to the global point cloud. The proposed module could be easily embedded into various network architectures for point cloud segmentation, naturally resulting in a new 3D semantic segmentation network with an encoder-decoder architecture, called SCF-Net in this work. Extensive experimental results on two public datasets demonstrate that the proposed SCF-Net performs better than several state-of-the-art methods in most cases.

引用

页码：14499 / 14508

页数：10

共 46 条

[1]

[Anonymous], 2018, ECCV, DOI DOI 10.1007/978-3-030-01225-0_37

[2]

[Anonymous], 2016, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2016.170

[3]

[Anonymous], 2018, ECCV, DOI DOI 10.1007/978-3-030-01234-2_25

[4] Pointwise Convolutional Neural Networks [J].

Binh-Son Hua ;

Minh-Khoi Tran ;

Yeung, Sai-Kit .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :984-993

[5]

Boulch A., 2017, 3dor@ Eurographics, V3, P1, DOI [10.2312/3dor.20171047, DOI 10.2312/3DOR.201710473, 10.2312/3dor.201710473]

[6]

Chen L, 2019, J NEURORESTORATOLOGY, V7, P1, DOI [10.26599/JNR.2019.9040004, 10.18178/jacn.2019.7.1.263]

[7]

Chen SH, 2019, IEEE IMAGE PROC, P4395, DOI [10.1109/icip.2019.8803525, 10.1109/ICIP.2019.8803525]

[8] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis [J].

Dai, Angela ;

Qi, Charles Ruizhongtai ;

Niessner, Matthias .

30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, :6545-6554

[9]

Ding YY, 2012, PROC CVPR IEEE, P2895, DOI 10.1109/CVPR.2012.6248016

[10]

Engelmann F, 2020, IEEE INT CONF ROBOT, P9463, DOI [10.1109/icra40945.2020.9197503, 10.1109/ICRA40945.2020.9197503]

← 1 2 3 4 5 →