A comprehensive evaluation of deep vision transformers for road extraction from very-high-resolution satellite data

被引：0

作者：

Bolcek, Jan ^{[1
,2
]}

Gibril, Mohamed Barakat A. ^{[1
]}

Al-Ruzouq, Rami ^{[1
]}

Shanableh, Abdallah ^{[1
,3
]}

Jena, Ratiranjan ^{[1
]}

Hammouri, Nezar ^{[1
]}

Sachit, Mourtadha Sarhan ^{[4
]}

Ghorbanzadeh, Omid ^{[5
]}

机构：

[1] Univ Sharjah, Res Inst Sci & Engn, GIS & Remote Sensing Ctr, Sharjah 27272, U Arab Emirates

[2] Brno Univ Technol, Fac Elect Engn & Commun, Dept Radio Elect, Brno Kralovo Pole 61600, Czech Republic

[3] Australian Univ, Sci Res Ctr, Kuwait, Kuwait

[4] Univ Thi Qar, Coll Engn, Dept Civil Engn, Nasiriyah 64001, Thi Qar, Iraq

[5] Univ Nat Resources & Life Sci, Inst Geomat, Peter Jordan Str 82, A-1190 Vienna, Austria

来源：

SCIENCE OF REMOTE SENSING | 2025年 / 11卷

关键词：

Remote sensing; Road extraction; Satellite data; Semantic segmentation; Vision Transformers; REMOTE-SENSING IMAGERY; NETWORK;

D O I：

10.1016/j.srs.2024.100190

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

Transformer-based semantic segmentation architectures excel in extracting road networks from very-high- resolution (VHR) satellite images due to their ability to capture global contextual information. Nonetheless, there is a gap in research regarding their comparative effectiveness, efficiency, and performance in extracting road networks from multicity VHR data. This study evaluates 11 transformer-based models on three publicly available datasets (DeepGlobe Road Extraction Dataset, SpaceNet-3 Road Network Detection Dataset, and Massachusetts Road Dataset) to assess their performance, efficiency, and complexity in mapping road networks from multicity, multidate, and multisensory VHR optical satellite images. The evaluated models include Unified Perceptual Parsing for Scene Understanding (UperNet) based on the Swin transformer (UperNet-SwinT), and Multi-path Vision Transformer (UperNet-MpViT), Twins transformer, Segmenter, SegFormer, K-Net based on SwinT, Mask2Former based on SwinT (Mask2Former-SwinT), TopFormer, UniFormer, and PoolFormer. Results showed that the models recorded mean F-scores (mF-score) ranging from 82.22% to 90.70% for the DeepGlobe dataset, 58.98%-86.95% for the Massachusetts dataset, and 69.02%-86.14% for the SpaceNet-3 dataset. Mask2Former-SwinT, UperNet-MpViT, and SegFormer were the top performers among the evaluated models. The Mask2Former, based on the SwinT, demonstrated a strong balance of high performance across different satellite image datasets and moderate computational efficiency. This investigation aids in selecting the most suitable model for extracting road networks from remote sensing data.

引用

页数：17

共 50 条

[41] Deep Extraction of Cropland Parcels from Very High-Resolution Remotely Sensed Imagery
Xia, Liegang
Luo, Jiancheng
Sun, Yingwei
Yang, Haiping
2018 7TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS (AGRO-GEOINFORMATICS), 2018, : 405 - 409
[42] NEW NETWORK BASED ON D-LINKNET AND DENSENET FOR HIGH RESOLUTION SATELLITE IMAGERY ROAD EXTRACTION
Peng, Bo
Li, Yuxia
Fan, Kunlong
Yuan, Lang
Tong, Ling
He, Lei
2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3939 - 3942
[43] NEW NETWORK BASED ON D-LINKNET AND RESNEXT FOR HIGH RESOLUTION SATELLITE IMAGERY ROAD EXTRACTION
Fan, Kunlong
Li, Yuxia
Yuan, Lang
Si, Yu
Tong, Ling
IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2599 - 2602
[44] Road Extraction From High Spatial Resolution Remote Sensing Image Based on Multi-Task Key Point Constraints
Li, Xungen
Zhang, Zhan
Lv, Shuaishuai
Pan, Mian
Ma, Qi
Yu, Haibin
IEEE ACCESS, 2021, 9 : 95896 - 95910
[45] Fast and Efficient Evaluation of Building Damage From Very High Resolution Optical Satellite Images
Dubois, David
Lepage, Richard
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (10) : 4167 - 4176
[46] Full-Level Domain Adaptation for Building Extraction in Very-High-Resolution Optical Remote-Sensing Images
Peng, Daifeng
Guan, Haiyan
Zang, Yufu
Bruzzone, Lorenzo
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
[47] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
Nurkarim, Wahidya
Wijayanto, Arie Wahyu
EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 515 - 532
[48] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
Wahidya Nurkarim
Arie Wahyu Wijayanto
Earth Science Informatics, 2023, 16 : 515 - 532
[49] Building Extraction From Very High-Resolution Remote Sensing Image With Few Data
Cui, Zhenqi
Nie, Pei
Persello, Claudio
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
[50] An Adaptive Multifeature Method for Semiautomatic Road Extraction From High-Resolution Stereo Mapping Satellite Images
Pan, Hong
Jia, Yonghong
Lv, Zhen
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (02) : 201 - 205

← 1 2 3 4 5 →