A comprehensive evaluation of deep vision transformers for road extraction from very-high-resolution satellite data

被引:0
|
作者
Bolcek, Jan [1 ,2 ]
Gibril, Mohamed Barakat A. [1 ]
Al-Ruzouq, Rami [1 ]
Shanableh, Abdallah [1 ,3 ]
Jena, Ratiranjan [1 ]
Hammouri, Nezar [1 ]
Sachit, Mourtadha Sarhan [4 ]
Ghorbanzadeh, Omid [5 ]
机构
[1] Univ Sharjah, Res Inst Sci & Engn, GIS & Remote Sensing Ctr, Sharjah 27272, U Arab Emirates
[2] Brno Univ Technol, Fac Elect Engn & Commun, Dept Radio Elect, Brno Kralovo Pole 61600, Czech Republic
[3] Australian Univ, Sci Res Ctr, Kuwait, Kuwait
[4] Univ Thi Qar, Coll Engn, Dept Civil Engn, Nasiriyah 64001, Thi Qar, Iraq
[5] Univ Nat Resources & Life Sci, Inst Geomat, Peter Jordan Str 82, A-1190 Vienna, Austria
来源
SCIENCE OF REMOTE SENSING | 2025年 / 11卷
关键词
Remote sensing; Road extraction; Satellite data; Semantic segmentation; Vision Transformers; REMOTE-SENSING IMAGERY; NETWORK;
D O I
10.1016/j.srs.2024.100190
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Transformer-based semantic segmentation architectures excel in extracting road networks from very-high- resolution (VHR) satellite images due to their ability to capture global contextual information. Nonetheless, there is a gap in research regarding their comparative effectiveness, efficiency, and performance in extracting road networks from multicity VHR data. This study evaluates 11 transformer-based models on three publicly available datasets (DeepGlobe Road Extraction Dataset, SpaceNet-3 Road Network Detection Dataset, and Massachusetts Road Dataset) to assess their performance, efficiency, and complexity in mapping road networks from multicity, multidate, and multisensory VHR optical satellite images. The evaluated models include Unified Perceptual Parsing for Scene Understanding (UperNet) based on the Swin transformer (UperNet-SwinT), and Multi-path Vision Transformer (UperNet-MpViT), Twins transformer, Segmenter, SegFormer, K-Net based on SwinT, Mask2Former based on SwinT (Mask2Former-SwinT), TopFormer, UniFormer, and PoolFormer. Results showed that the models recorded mean F-scores (mF-score) ranging from 82.22% to 90.70% for the DeepGlobe dataset, 58.98%-86.95% for the Massachusetts dataset, and 69.02%-86.14% for the SpaceNet-3 dataset. Mask2Former-SwinT, UperNet-MpViT, and SegFormer were the top performers among the evaluated models. The Mask2Former, based on the SwinT, demonstrated a strong balance of high performance across different satellite image datasets and moderate computational efficiency. This investigation aids in selecting the most suitable model for extracting road networks from remote sensing data.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] Deep Extraction of Cropland Parcels from Very High-Resolution Remotely Sensed Imagery
    Xia, Liegang
    Luo, Jiancheng
    Sun, Yingwei
    Yang, Haiping
    2018 7TH INTERNATIONAL CONFERENCE ON AGRO-GEOINFORMATICS (AGRO-GEOINFORMATICS), 2018, : 405 - 409
  • [42] NEW NETWORK BASED ON D-LINKNET AND DENSENET FOR HIGH RESOLUTION SATELLITE IMAGERY ROAD EXTRACTION
    Peng, Bo
    Li, Yuxia
    Fan, Kunlong
    Yuan, Lang
    Tong, Ling
    He, Lei
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 3939 - 3942
  • [43] NEW NETWORK BASED ON D-LINKNET AND RESNEXT FOR HIGH RESOLUTION SATELLITE IMAGERY ROAD EXTRACTION
    Fan, Kunlong
    Li, Yuxia
    Yuan, Lang
    Si, Yu
    Tong, Ling
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 2599 - 2602
  • [44] Road Extraction From High Spatial Resolution Remote Sensing Image Based on Multi-Task Key Point Constraints
    Li, Xungen
    Zhang, Zhan
    Lv, Shuaishuai
    Pan, Mian
    Ma, Qi
    Yu, Haibin
    IEEE ACCESS, 2021, 9 : 95896 - 95910
  • [45] Fast and Efficient Evaluation of Building Damage From Very High Resolution Optical Satellite Images
    Dubois, David
    Lepage, Richard
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2014, 7 (10) : 4167 - 4176
  • [46] Full-Level Domain Adaptation for Building Extraction in Very-High-Resolution Optical Remote-Sensing Images
    Peng, Daifeng
    Guan, Haiyan
    Zang, Yufu
    Bruzzone, Lorenzo
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2022, 60
  • [47] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
    Nurkarim, Wahidya
    Wijayanto, Arie Wahyu
    EARTH SCIENCE INFORMATICS, 2023, 16 (01) : 515 - 532
  • [48] Building footprint extraction and counting on very high-resolution satellite imagery using object detection deep learning framework
    Wahidya Nurkarim
    Arie Wahyu Wijayanto
    Earth Science Informatics, 2023, 16 : 515 - 532
  • [49] Building Extraction From Very High-Resolution Remote Sensing Image With Few Data
    Cui, Zhenqi
    Nie, Pei
    Persello, Claudio
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2024, 21
  • [50] An Adaptive Multifeature Method for Semiautomatic Road Extraction From High-Resolution Stereo Mapping Satellite Images
    Pan, Hong
    Jia, Yonghong
    Lv, Zhen
    IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2019, 16 (02) : 201 - 205