A multi-point focus transformer approach for large-scale ALS point cloud ground filtering

被引:0
作者
Liu, Tongyang [1 ]
Wei, Bo [2 ]
Hao, Jiaojiao [2 ]
Li, Zexia [1 ]
Ye, Fuqiang [1 ]
Wang, Lili [1 ]
机构
[1] Northwest Normal Univ, Coll Phys & Elect Engn, Lanzhou 730070, Peoples R China
[2] Gansu Water Resources & Hydropower Survey Design &, Lanzhou, Gansu, Peoples R China
关键词
Transformer; 3D point cloud; farthest point sampling; random sampling; multi-point focus mechanism; attention integration module; CONVOLUTIONAL NEURAL-NETWORK; LIDAR DATA; OBJECT DETECTION; DTM EXTRACTION; CLASSIFICATION;
D O I
10.1080/01431161.2024.2443604
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In recent years, Transformer networks have achieved a series of advancements in 3D point cloud semantic segmentation and shape classification. In this paper, we propose a multi-point focus transformer network for outdoor large-scale point cloud filtering. It integrates farthest point sampling and random sampling methods to extract both global and local multi-feature information from point clouds. To more accurately compute the self-attention and positional encoding of point clouds, this paper proposes a multi-point focus mechanism that uses a combination of farthest point sampling and random sampling to select multiple focal points from neighbourhoods at different scales for special focused, followed by attention computation and positional encoding for these focal points. Subsequently, an attention integration module is introduced to aggregate the self-attention and positional information from multiple focal points. Finally, the idea of inverse residual MLP was borrowed to obtain deeper level features of point clouds through extended channels. Extensive experiments were conducted on the latest OpenGF dataset for different terrain scenarios, resulting in commendable filtering accuracy. On the Test1 dataset, qualitative visual comparison and quantitative analysis were conducted with other state-of-the-art methods, and the overall accuracy (OA) could reach up to 98.12%, further verifying the effectiveness and competitiveness of the proposed multi-point focusing transformer network.
引用
收藏
页码:979 / 999
页数:21
相关论文
共 48 条
  • [1] DGCNN: A convolutional neural network over large-scale labeled graphs
    Anh Viet Phan
    Minh Le Nguyen
    Yen Lam Hoang Nguyen
    Lam Thu Bui
    [J]. NEURAL NETWORKS, 2018, 108 : 533 - 543
  • [2] [Anonymous], Dynamic Graph CNN with Attention Module for 3D Hand Pose Estimation, DOI [10.1007/978-3-030-22796-810, DOI 10.1007/978-3-030-22796-810]
  • [3] [Anonymous], NIPS, P5099, DOI [10.48550/arXiv.1706.02413, DOI 10.1109/CVPR.2017.16]
  • [4] 3D Semantic Parsing of Large-Scale Indoor Spaces
    Armeni, Iro
    Sener, Ozan
    Zamir, Amir R.
    Jiang, Helen
    Brilakis, Ioannis
    Fischer, Martin
    Savarese, Silvio
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 1534 - 1543
  • [5] Axelsson P., 2000, The International Archives of the Photogrammetry and Remote Sensing, Amsterdam, The Netherlands, VXXXIII, P110
  • [6] SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences
    Behley, Jens
    Garbade, Martin
    Milioto, Andres
    Quenzel, Jan
    Behnke, Sven
    Stachniss, Cyrill
    Gall, Juergen
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, : 9296 - 9306
  • [7] DTM extraction under forest canopy using LiDAR data and a modified invasive weed optimization algorithm
    Bigdeli, Behnaz
    Amirkolaee, Hamed Amini
    Pahlavani, Parham
    [J]. REMOTE SENSING OF ENVIRONMENT, 2018, 216 : 289 - 300
  • [8] Multiscale Grid Method for Detection and Reconstruction of Building Roofs from Airborne LiDAR Data
    Chen, Yanming
    Cheng, Liang
    Li, Manchun
    Wang, Jiechen
    Tong, Lihua
    Yang, Kang
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (10) : 4081 - 4094
  • [9] Shape Completion using 3D-Encoder-Predictor CNNs and Shape Synthesis
    Dai, Angela
    Qi, Charles Ruizhongtai
    Niessner, Matthias
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 6545 - 6554
  • [10] Multiresolution Tree Networks for 3D Point Cloud Processing
    Gadelha, Matheus
    Wang, Rui
    Maji, Subhransu
    [J]. COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 : 105 - 122