A comprehensive evaluation of deep vision transformers for road extraction from very-high-resolution satellite data

被引:0
|
作者
Bolcek, Jan [1 ,2 ]
Gibril, Mohamed Barakat A. [1 ]
Al-Ruzouq, Rami [1 ]
Shanableh, Abdallah [1 ,3 ]
Jena, Ratiranjan [1 ]
Hammouri, Nezar [1 ]
Sachit, Mourtadha Sarhan [4 ]
Ghorbanzadeh, Omid [5 ]
机构
[1] Univ Sharjah, Res Inst Sci & Engn, GIS & Remote Sensing Ctr, Sharjah 27272, U Arab Emirates
[2] Brno Univ Technol, Fac Elect Engn & Commun, Dept Radio Elect, Brno Kralovo Pole 61600, Czech Republic
[3] Australian Univ, Sci Res Ctr, Kuwait, Kuwait
[4] Univ Thi Qar, Coll Engn, Dept Civil Engn, Nasiriyah 64001, Thi Qar, Iraq
[5] Univ Nat Resources & Life Sci, Inst Geomat, Peter Jordan Str 82, A-1190 Vienna, Austria
来源
SCIENCE OF REMOTE SENSING | 2025年 / 11卷
关键词
Remote sensing; Road extraction; Satellite data; Semantic segmentation; Vision Transformers; REMOTE-SENSING IMAGERY; NETWORK;
D O I
10.1016/j.srs.2024.100190
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Transformer-based semantic segmentation architectures excel in extracting road networks from very-high- resolution (VHR) satellite images due to their ability to capture global contextual information. Nonetheless, there is a gap in research regarding their comparative effectiveness, efficiency, and performance in extracting road networks from multicity VHR data. This study evaluates 11 transformer-based models on three publicly available datasets (DeepGlobe Road Extraction Dataset, SpaceNet-3 Road Network Detection Dataset, and Massachusetts Road Dataset) to assess their performance, efficiency, and complexity in mapping road networks from multicity, multidate, and multisensory VHR optical satellite images. The evaluated models include Unified Perceptual Parsing for Scene Understanding (UperNet) based on the Swin transformer (UperNet-SwinT), and Multi-path Vision Transformer (UperNet-MpViT), Twins transformer, Segmenter, SegFormer, K-Net based on SwinT, Mask2Former based on SwinT (Mask2Former-SwinT), TopFormer, UniFormer, and PoolFormer. Results showed that the models recorded mean F-scores (mF-score) ranging from 82.22% to 90.70% for the DeepGlobe dataset, 58.98%-86.95% for the Massachusetts dataset, and 69.02%-86.14% for the SpaceNet-3 dataset. Mask2Former-SwinT, UperNet-MpViT, and SegFormer were the top performers among the evaluated models. The Mask2Former, based on the SwinT, demonstrated a strong balance of high performance across different satellite image datasets and moderate computational efficiency. This investigation aids in selecting the most suitable model for extracting road networks from remote sensing data.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Cascaded Attention DenseUNet (CADUNet) for Road Extraction from Very-High-Resolution Images
    Li, Jing
    Liu, Yong
    Zhang, Yindan
    Zhang, Yang
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2021, 10 (05)
  • [2] AN IMPROVED DEEP-LEARNING MODEL FOR ROAD EXTRACTION FROM VERY-HIGH-RESOLUTION REMOTE SENSING IMAGES
    Shen, Wangyao
    Chen, Yunping
    Cheng, Yuanlei
    Yang, Kangzhuo
    Guo, Xiang
    Sung, Yuan
    Chen, Yan
    2021 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM IGARSS, 2021, : 4660 - 4663
  • [3] Federated Deep Learning With Prototype Matching for Object Extraction From Very-High-Resolution Remote Sensing Images
    Zhang, Xiaokang
    Zhang, Boning
    Yu, Weikang
    Kang, Xudong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [4] Road Extraction from Very-High-Resolution Remote Sensing Images via a Nested SE-Deeplab Model
    Lin, Yeneng
    Xu, Dongyun
    Wang, Nan
    Shi, Zhou
    Chen, Qiuxiao
    REMOTE SENSING, 2020, 12 (18)
  • [5] Federated Deep Learning With Prototype Matching for Object Extraction From Very-High-Resolution Remote Sensing Images
    Zhang, Xiaokang
    Zhang, Boning
    Yu, Weikang
    Kang, Xudong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61
  • [6] IMAGE REGISTRATION OF VERY-HIGH-RESOLUTION SATELLITE IMAGES USING DEEP LEARNING MODEL FOR OUTLIER ELIMINATION
    Kim, Taeheon
    Yun, Yerin
    Lee, Changhui
    Yeom, Junho
    Han, Youkyung
    2022 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2022), 2022, : 155 - 158
  • [7] Robust road extraction for high resolution satellite images
    Christophe, Emmanuel
    Inglada, Jordi
    2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, : 2689 - 2692
  • [8] A Direction-Guided Ant Colony Optimization Method for Extraction of Urban Road Information From Very-High-Resolution Images
    Yin, Dandong
    Du, Shihong
    Wang, Shaowen
    Guo, Zhou
    IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2015, 8 (10) : 4785 - 4794
  • [9] Road network extraction from high resolution satellite images
    Li Gang
    Lai Shunnan
    Li Sheng
    Computer Aided Drafting,Design and Manufacturing, 2016, (02) : 1 - 7
  • [10] Urban Road Extraction from High-Resolution Optical Satellite Images
    Naouai, Mohamed
    Hamouda, Atef
    Weber, Christiane
    IMAGE ANALYSIS AND RECOGNITION, 2010, PT II, PROCEEDINGS, 2010, 6112 : 420 - +