共 50 条
Building Extraction From High-Resolution Multispectral and SAR Images Using a Boundary-Link Multimodal Fusion Network
被引:0
|作者:
Zhao, Zhe
[1
,2
]
Zhao, Boya
[1
]
Wu, Yuanfeng
[1
]
He, Zutian
[1
,3
]
Gao, Lianru
[1
]
机构:
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Computat Opt Imaging Technol, Beijing 100094, Peoples R China
[2] Univ Chinese Acad Sci, Coll Resources & Environm, Beijing 100049, Peoples R China
[3] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100049, Peoples R China
基金:
国家重点研发计划;
关键词:
Buildings;
Feature extraction;
Optical imaging;
Data mining;
Optical sensors;
Radar polarimetry;
Adaptive optics;
Synthetic aperture radar;
Optical scattering;
Remote sensing;
Building extraction;
multimodal segmentation;
multispectral;
synthetic aperture radar (SAR);
AUTOMATIC DETECTION;
CLASSIFICATION;
D O I:
10.1109/JSTARS.2025.3525709
中图分类号:
TM [电工技术];
TN [电子技术、通信技术];
学科分类号:
0808 ;
0809 ;
摘要:
Automatically extracting buildings with high precision from remote sensing images is crucial for various applications. Due to their distinct imaging modalities and complementary characteristics, optical and synthetic aperture radar (SAR) images serve as primary data sources for this task. We propose a novel boundary-link multimodal fusion network for joint semantic segmentation to leverage the information in these images. An initial building extraction result is obtained from the multimodal fusion network, followed by refinement using building boundaries. The model achieves high-precision building delineation by leveraging building boundary and semantic information from optical and SAR images. It distinguishes buildings from the background in complex environments, such as dense urban areas or regions with mixed vegetation, particularly when small buildings lack distinct texture or color features. We conducted experiments using the MSAW dataset (RGB-NIR and SAR data) and DFC track2 datasets (RGB and SAR data). The results indicate that our model significantly enhances extraction accuracy and improves building boundary delineation. The intersection over union metric is 2.5% to 3.5% higher than that of other multimodal joint segmentation methods.
引用
收藏
页码:3864 / 3878
页数:15
相关论文