A multi-point focus transformer approach for large-scale ALS point cloud ground filtering

被引：0

作者：

Liu, Tongyang ^{[1
]}

Wei, Bo ^{[2
]}

Hao, Jiaojiao ^{[2
]}

Li, Zexia ^{[1
]}

Ye, Fuqiang ^{[1
]}

Wang, Lili ^{[1
]}

机构：

[1] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou 730070, Peoples R China

[2] Gansu Water Resources & Hydropower Survey Design &, Lanzhou, Gansu, Peoples R China

来源：

INTERNATIONAL JOURNAL OF REMOTE SENSING | 2025年

关键词：

Transformer; 3D point cloud; farthest point sampling; random sampling; multi-point focus mechanism; attention integration module; CONVOLUTIONAL NEURAL-NETWORK; LIDAR DATA; OBJECT DETECTION; DTM EXTRACTION; CLASSIFICATION;

D O I：

10.1080/01431161.2024.2443604

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

In recent years, Transformer networks have achieved a series of advancements in 3D point cloud semantic segmentation and shape classification. In this paper, we propose a multi-point focus transformer network for outdoor large-scale point cloud filtering. It integrates farthest point sampling and random sampling methods to extract both global and local multi-feature information from point clouds. To more accurately compute the self-attention and positional encoding of point clouds, this paper proposes a multi-point focus mechanism that uses a combination of farthest point sampling and random sampling to select multiple focal points from neighbourhoods at different scales for special focused, followed by attention computation and positional encoding for these focal points. Subsequently, an attention integration module is introduced to aggregate the self-attention and positional information from multiple focal points. Finally, the idea of inverse residual MLP was borrowed to obtain deeper level features of point clouds through extended channels. Extensive experiments were conducted on the latest OpenGF dataset for different terrain scenarios, resulting in commendable filtering accuracy. On the Test1 dataset, qualitative visual comparison and quantitative analysis were conducted with other state-of-the-art methods, and the overall accuracy (OA) could reach up to 98.12%, further verifying the effectiveness and competitiveness of the proposed multi-point focusing transformer network.

引用

页码：979 / 999

页数：21

共 48 条

[1] DGCNN: A convolutional neural network over large-scale labeled graphs
Anh Viet Phan
Minh Le Nguyen
Yen Lam Hoang Nguyen
Lam Thu Bui
[J]. NEURAL NETWORKS, 2018, 108 : 533 - 543
[2] [Anonymous], Dynamic Graph CNN with Attention Module for 3D Hand Pose Estimation, DOI [10.1007/978-3-030-22796-810, DOI 10.1007/978-3-030-22796-810]
[3] [Anonymous], NIPS, P5099, DOI [10.48550/arXiv.1706.02413, DOI 10.1109/CVPR.2017.16]
[4] 3D Semantic Parsing of Large-Scale Indoor Spaces
Armeni, Iro
Sener, Ozan
Zamir, Amir R.
Jiang, Helen
Brilakis, Ioannis
Fischer, Martin
Savarese, Silvio
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1534 - 1543
[5] Axelsson P., 2000, The International Archives of the Photogrammetry and Remote Sensing, Amsterdam, The Netherlands, VXXXIII, P110
[6] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
Behley, Jens
Garbade, Martin
Milioto, Andres
Quenzel, Jan
Behnke, Sven
Stachniss, Cyrill
Gall, Juergen
[J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
[7] DTM extraction under forest canopy using LiDAR data and a modified invasive weed optimization algorithm
Bigdeli, Behnaz
Amirkolaee, Hamed Amini
Pahlavani, Parham
[J]. REMOTE SENSING OF ENVIRONMENT, 2018, 216 : 289 - 300
[8] Multiscale Grid Method for Detection and Reconstruction of Building Roofs from Airborne LiDAR Data
Chen, Yanming
Cheng, Liang
Li, Manchun
Wang, Jiechen
Tong, Lihua
Yang, Kang
[J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (10) : 4081 - 4094
[9] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
Dai, Angela
Qi, Charles Ruizhongtai
Niessner, Matthias
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
[10] Multiresolution Tree Networks for 3D Point Cloud Processing
Gadelha, Matheus
Wang, Rui
Maji, Subhransu
[J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 105 - 122

← 1 2 3 4 5 →