LSLPCT: An Enhanced Local Semantic Learning Transformer for 3-D Point Cloud Analysis

被引:15
作者
Song, Yupeng [1 ]
He, Fazhi [1 ]
Duan, Yansong [2 ]
Si, Tongzhen [1 ]
Bai, Junwei [1 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Wuhan Univ, Sch Remote Sensing & Informat Engn, Wuhan 430072, Peoples R China
来源
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING | 2022年 / 60卷
基金
中国国家自然科学基金;
关键词
3-D point cloud; classification; deep learning; segmentation; transformer; FEATURES; DESIGN;
D O I
10.1109/TGRS.2022.3202823
中图分类号
P3 [地球物理学]; P59 [地球化学];
学科分类号
0708 ; 070902 ;
摘要
The 3-D point cloud is a common 3-D data representation that has received increasing attention for remote sensing applications. However, processing 3-D point cloud semantics, especially local semantic information, has always been a challenge and has attracted much attention. In this article, we propose a novel enhanced local semantic learning transformer for 3-D point cloud analysis, which aims to enhance the transformer awareness of local semantic features to handle complex point cloud tasks. First, we propose a novel transformer framework, the local semantic learning point cloud transformer (LSLPCT), which not only learns 3-D point clouds of global information, but also enhances the perception of local semantic information end-to-end. Second, we design an efficient local semantic learning self-attention mechanism, namely, LSL-SA, which can parallelize the perception of global contextual information and capture finer grained local semantic features. Third, our proposed LSL-SA is easy to implement and can integrate the existing transformers and convolutional neural network (CNN)-based networks for processing various point cloud tasks. Numerous experiments in different types of point cloud tasks have been conducted, and our method performs better or is competitive with other state-of-the-art methods.
引用
收藏
页数:13
相关论文
共 53 条
[1]   3D Semantic Parsing of Large-Scale Indoor Spaces [J].
Armeni, Iro ;
Sener, Ozan ;
Zamir, Amir R. ;
Jiang, Helen ;
Brilakis, Ioannis ;
Fischer, Martin ;
Savarese, Silvio .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1534-1543
[2]  
Atzmon M, 2018, Arxiv, DOI arXiv:1803.10091
[3]   Scale-invariant heat kernel signatures for non-rigid shape recognition [J].
Bronstein, Michael M. ;
Kokkinos, Iasonas .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :1704-1711
[4]   The evolution, challenges, and future of knowledge representation in product design systems [J].
Chandrasegaran, Senthil K. ;
Ramani, Karthik ;
Sriram, Ram D. ;
Horvath, Imre ;
Bernard, Alain ;
Harik, Ramy F. ;
Gao, Wei .
COMPUTER-AIDED DESIGN, 2013, 45 (02) :204-228
[5]   A Dense Feature Pyramid Network-Based Deep Learning Model for Road Marking Instance Segmentation Using MLS Point Clouds [J].
Chen, Siyun ;
Zhang, Zhenxin ;
Zhong, Ruofei ;
Zhang, Liqiang ;
Ma, Hao ;
Liu, Lirong .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (01) :784-800
[6]   A full migration BBO algorithm with enhanced population quality bounds for multimodal biomedical image registration [J].
Chen, Yilin ;
He, Fazhi ;
Li, Haoran ;
Zhang, Dejun ;
Wu, Yiqi .
APPLIED SOFT COMPUTING, 2020, 93
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]  
Dosovitskiy A, 2021, Arxiv, DOI [arXiv:2010.11929, DOI 10.48550/ARXIV.2010.11929]
[9]   Correction of Low Vegetation Impact on UAV-Derived Point Cloud Heights With U-Net Networks [J].
Gruszczynski, Wojciech ;
Puniach, Edyta ;
Cwiakala, Pawel ;
Matwij, Wojciech .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[10]   PCT: Point cloud transformer [J].
Guo, Meng-Hao ;
Cai, Jun-Xiong ;
Liu, Zheng-Ning ;
Mu, Tai-Jiang ;
Martin, Ralph R. ;
Hu, Shi-Min .
COMPUTATIONAL VISUAL MEDIA, 2021, 7 (02) :187-199