URBAN CLASSIFICATION BASED ON TOP-VIEW POINT CLOUD AND SAR IMAGE FUSION WITH SWIN TRANSFORMER

被引：0

作者：

Xue, R. ^{[1
,2
]}

Zhang, X. ^{[2
]}

Soergel, U. ^{[2
]}

机构：

[1] Xidian Univ, Natl Lab Radar Signal Proc, Xian 710071, Peoples R China

[2] Univ Stuttgart, Inst Photogrammetry, D-70174 Stuttgart, Germany

来源：

XXIV ISPRS CONGRESS: IMAGING TODAY, FORESEEING TOMORROW, COMMISSION III | 2022年 / 43-B3卷

关键词：

Deep Learning; Transformer; Feature Fusing; Urban Classification; Synthetic Aperture Radar; Point Cloud; LIDAR;

D O I：

10.5194/isprs-archives-XLIII-B3-2022-559-2022

中图分类号：

P9 [自然地理学];

学科分类号：

0705 ; 070501 ;

摘要：

Urban areas are complex scenarios consisting of objects with various materials. This variety poses a challenge to single-data classification schemes. In this paper, we propose a feature fusion and classification network on RGB top-view point cloud and SAR images with swin-Transformer. In this network, the heterogeneous features are learned separately by an asymmetric encoder, and then they are concatenated along the channel dimension and fed into a fusing encoder. Finally, the fused features are decoded by an UperNet for generating the semantic labels. As data we use the subset of high-resolution 3D point cloud provided by Hessigheim benchmark which are complemented by TerraSAR-X images. The overall precision and the mean intersection over union (mloU) achieves 87.25% and 73.56%, respectively, which outperforms the single-data swin-Transformer by 4.08% and 1.91%, respectively.

引用

页码：559 / 564

页数：6

共 15 条

[1]

Dosovitskiy A, 2020, ARXIV

[2] A comprehensive review of hyperspectral data fusion with lidar and sar data [J].

Kahraman, Sevcan ;

Bacher, Raphael .

ANNUAL REVIEWS IN CONTROL, 2021, 51 :236-253

[3]

Kolle M., 2021, ISPRS Open J. Photogr. Remote Sens., V1, DOI [DOI 10.1016/J.OPHOTO.2021.100001, 10.1016]

[4] Pixel level fusion techniques for SAR and optical images: A review [J].

Kulkarni, Samadhan C. ;

Rege, Priti P. .

INFORMATION FUSION, 2020, 59 :13-29

[5]

Li XH, 2018, PR MACH LEARN RES, V80

[6] Swin Transformer: Hierarchical Vision Transformer using Shifted Windows [J].

Liu, Ze ;

Lin, Yutong ;

Cao, Yue ;

Hu, Han ;

Wei, Yixuan ;

Zhang, Zheng ;

Lin, Stephen ;

Guo, Baining .

2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :9992-10002

[7]

Long J, 2015, PROC CVPR IEEE, P3431, DOI 10.1109/CVPR.2015.7298965

[8] V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation [J].

Milletari, Fausto ;

Navab, Nassir ;

Ahmadi, Seyed-Ahmad .

PROCEEDINGS OF 2016 FOURTH INTERNATIONAL CONFERENCE ON 3D VISION (3DV), 2016, :565-571

[9]

Roth A., 2004, P INT 20 ISPRS C, P840

[10]

Roth A., 2005, P ISPRS WG VII1 HUMA

← 1 2 →