Enhanced Local Feature Learning With Simple Offset Attention for Semantic Segmentation of Large-Scale Point Clouds

被引：1

作者：

Chen, Dong ^{[1
]}

Wang, Yuebin ^{[1
]}

Zhang, Liqiang ^{[2
]}

Kang, Zhizhong ^{[1
]}

机构：

[1] China Univ Geosci, Sch Land Sci & Technol, Beijing 100083, Peoples R China

[2] Beijing Normal Univ, Fac Geog, Beijing Key Lab Environm Remote Sensing & Digital, Beijing 100875, Peoples R China

来源：

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2024年 / 62卷

基金：

中国国家自然科学基金;

关键词：

Point cloud compression; Feature extraction; Transformers; Training; Three-dimensional displays; Semantics; Semantic segmentation; Attention; large-scale outdoor scenes; point cloud; semantic segmentation; transformer; NETWORK;

D O I：

10.1109/TGRS.2024.3453966

中图分类号：

P3 [地球物理学]; P59 [地球化学];

学科分类号：

0708 ; 070902 ;

摘要：

The semantic segmentation network performance of large-scale outdoor point clouds is usually limited by the number of input point clouds. In the application of most methods, the point cloud is cut into small pieces as the training input, which will not only lead to heavy preprocessing burdens but also destroy the overall geometric structure of the scene. The transformer network has demonstrated remarkable advantages of attention mechanisms in focusing on crucial features and improving model performance. However, its training and inference on large-scale data are confined by computational complexity. To eliminate these challenges, the attention mechanisms are improved to enhance their performance in processing large-scale input data, while removing limitations imposed by computational complexity. Moreover, a novel local feature enhancement (LFE) module is developed to construct the LFE-Net, which can accurately and efficiently extract spatial and attribute features from local point clouds. In particular, an improved attention module, which is called simple offset attention (SOA), is adopted for local point cloud spatial feature learning. Compared with self-attention, SOA requires less memory and can better capture the fine-grained local features. Furthermore, to effectively avoid the destruction of object geometry and diminish the impact of sample imbalance, a training sample collection method based on the number of different classes is designed. To validate the effectiveness of this method, some experiments are conducted based on publicly accessible outdoor point cloud datasets. The results demonstrate that the LFE-Net can achieve substantial improvements compared with other cutting-edge network models.

引用

页数：16

共 44 条

[1] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences [J].

Behley, Jens ;

Garbade, Martin ;

Milioto, Andres ;

Quenzel, Jan ;

Behnke, Sven ;

Stachniss, Cyrill ;

Gall, Juergen .

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :9296-9306

[2]

Boulch Alexandre., 2017, 3dor@ eurographics, V2, P7, DOI [10.2312/3dor.20171047, DOI 10.2312/3DOR.20171047]

[3] End-to-End Object Detection with Transformers [J].

Carion, Nicolas ;

Massa, Francisco ;

Synnaeve, Gabriel ;

Usunier, Nicolas ;

Kirillov, Alexander ;

Zagoruyko, Sergey .

COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :213-229

[4] GAPointNet: Graph attention based point neural network for exploiting local feature of point cloud [J].

Chen, Can ;

Fragonara, Luca Zanotti ;

Tsourdos, Antonios .

NEUROCOMPUTING, 2021, 438 :122-132

[5]

Dosovitskiy A, 2021, Arxiv, DOI arXiv:2010.11929

[6] SCF-Net: Learning Spatial Contextual Features for Large-Scale Point Cloud Segmentation [J].

Fan, Siqi ;

Dong, Qiulei ;

Zhu, Fenghua ;

Lv, Yisheng ;

Ye, Peijun ;

Wang, Fei-Yue .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :14499-14508

[7] Omni-supervised Point Cloud Segmentation via Gradual Receptive Field Component Reasoning [J].

Gong, Jingyu ;

Xu, Jiachen ;

Tan, Xin ;

Song, Haichuan ;

Qu, Yanyun ;

Xie, Yuan ;

Ma, Lizhuang .

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :11668-11677

[8] PCT: Point cloud transformer [J].

Guo, Meng-Hao ;

Cai, Jun-Xiong ;

Liu, Zheng-Ning ;

Mu, Tai-Jiang ;

Martin, Ralph R. ;

Hu, Shi-Min .

COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) :187-199

[9] Deep Learning for 3D Point Clouds: A Survey [J].

Guo, Yulan ;

Wang, Hanyun ;

Hu, Qingyong ;

Liu, Hao ;

Liu, Li ;

Bennamoun, Mohammed .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (12) :4338-4364

[10]

Hackel T, 2017, Arxiv, DOI [arXiv:1704.03847, 10.5194/isprs-annals-IV-1-W1-91-2017, DOI 10.5194/ISPRS-ANNALS-IV-1-W1-91-2017]

← 1 2 3 4 5 →