URBAN CLASSIFICATION BASED ON TOP-VIEW POINT CLOUD AND SAR IMAGE FUSION WITH SWIN TRANSFORMER

被引:0
作者
Xue, R. [1 ,2 ]
Zhang, X. [2 ]
Soergel, U. [2 ]
机构
[1] Xidian Univ, Natl Lab Radar Signal Proc, Xian 710071, Peoples R China
[2] Univ Stuttgart, Inst Photogrammetry, D-70174 Stuttgart, Germany
来源
XXIV ISPRS CONGRESS: IMAGING TODAY, FORESEEING TOMORROW, COMMISSION III | 2022年 / 43-B3卷
关键词
Deep Learning; Transformer; Feature Fusing; Urban Classification; Synthetic Aperture Radar; Point Cloud; LIDAR;
D O I
10.5194/isprs-archives-XLIII-B3-2022-559-2022
中图分类号
P9 [自然地理学];
学科分类号
0705 ; 070501 ;
摘要
Urban areas are complex scenarios consisting of objects with various materials. This variety poses a challenge to single-data classification schemes. In this paper, we propose a feature fusion and classification network on RGB top-view point cloud and SAR images with swin-Transformer. In this network, the heterogeneous features are learned separately by an asymmetric encoder, and then they are concatenated along the channel dimension and fed into a fusing encoder. Finally, the fused features are decoded by an UperNet for generating the semantic labels. As data we use the subset of high-resolution 3D point cloud provided by Hessigheim benchmark which are complemented by TerraSAR-X images. The overall precision and the mean intersection over union (mloU) achieves 87.25% and 73.56%, respectively, which outperforms the single-data swin-Transformer by 4.08% and 1.91%, respectively.
引用
收藏
页码:559 / 564
页数:6
相关论文
共 15 条
[1]  
Dosovitskiy A, 2020, ARXIV
[2]   A comprehensive review of hyperspectral data fusion with lidar and sar data [J].
Kahraman, Sevcan ;
Bacher, Raphael .
ANNUAL REVIEWS IN CONTROL, 2021, 51 :236-253
[3]  
Kolle M., 2021, ISPRS Open J. Photogr. Remote Sens., V1, DOI [DOI 10.1016/J.OPHOTO.2021.100001, 10.1016]
[4]   Pixel level fusion techniques for SAR and optical images: A review [J].
Kulkarni, Samadhan C. ;
Rege, Priti P. .
INFORMATION FUSION, 2020, 59 :13-29
[5]  
Li XH, 2018, PR MACH LEARN RES, V80
[6]   Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].
Liu, Ze ;
Lin, Yutong ;
Cao, Yue ;
Hu, Han ;
Wei, Yixuan ;
Zhang, Zheng ;
Lin, Stephen ;
Guo, Baining .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002
[7]  
Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965
[8]   V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation [J].
Milletari, Fausto ;
Navab, Nassir ;
Ahmadi, Seyed-Ahmad .
PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :565-571
[9]  
Roth A., 2004, P INT 20 ISPRS C, P840
[10]  
Roth A., 2005, P ISPRS WG VII1 HUMA