DSMSA-Net: Deep Spatial and Multi-scale Attention Network for Road Extraction in High Spatial Resolution Satellite Images

被引:33
作者
Khan, Sultan Daud [1 ]
Alarabi, Louai [2 ]
Basalamah, Saleh [3 ]
机构
[1] Natl Univ Technol NUTECH, Dept Comp Sci, Islamabad, Pakistan
[2] Umm Al Qura Univ, Dept Comp Sci, Mecca, Saudi Arabia
[3] Umm Al Qura Univ, Dept Comp Engn, Mecca, Saudi Arabia
关键词
Road extraction; Attention models; Deep learning; Semantic segmentation; CENTERLINE EXTRACTION; NEURAL-NETWORK; SEGMENTATION;
D O I
10.1007/s13369-022-07082-z
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Road segmentation in high spatial resolution satellite images is an important research topic and has numerous applications in traffic monitoring and intelligent transportation systems. With the growing increase in urban population and rapid changes in the urban environment, persistent updates are required in road databases. Therefore, it is required to develop a robust system that automatically analyzes high spatial resolution satellite images and extracts road networks. Such systems can be persistently used for updating road databases and provide meaningful support to intelligent transportation systems. However, road extraction from aerial images is a challenging problem due to cluttered background, inter-class correlation, and occlusions in the scene. To address the above complex problems, we propose an encoder-decoder network, namely, DSMSA-Net, integrated with attention units to cope with road segmentation tasks in high spatial resolution satellite images. The encoder part of the network extracts multi-scale features from different convolutional layers. The decoder part consists of two modules: Scale Attention Unit (SaAU) and Spatial Attention Unit (SpAU). The first module, (SaAU), utilizes feature maps of different residual blocks of the encoder to extract multi-scale information. The second module, SpAU, improves the spatial representation of the region of interest and extracts meaningful contextual information. We use two publicly available challenging benchmark datasets, i.e., DeepGlobe and Massachusetts road dataset to evaluate the performance of the proposed framework. From quantitative and qualitative comparisons, we demonstrate the proposed framework achieves superior performance compared to reference methods.
引用
收藏
页码:1907 / 1920
页数:14
相关论文
共 72 条
[1]   Deep Learning Approaches Applied to Remote Sensing Datasets for Road Extraction: A State-Of-The-Art Review [J].
Abdollahi, Abolfazl ;
Pradhan, Biswajeet ;
Shukla, Nagesh ;
Chakraborty, Subrata ;
Alamri, Abdullah .
REMOTE SENSING, 2020, 12 (09)
[2]   Hierarchical graph-based segmentation for extracting road networks from high-resolution satellite images [J].
Alshehhi, Rasha ;
Marpu, Prashanth Reddy .
ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2017, 126 :245-260
[3]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[4]   Fully Convolutional Network for Automatic Road Extraction from Satellite Imagery [J].
Buslaev, Alexander ;
Seferbekov, Selim ;
Iglovikov, Vladimir ;
Shvets, Alexey .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :197-200
[5]  
Chaurasia A, 2017, 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP)
[6]  
Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[7]   DeepLab: Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs [J].
Chen, Liang-Chieh ;
Papandreou, George ;
Kokkinos, Iasonas ;
Murphy, Kevin ;
Yuille, Alan L. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2018, 40 (04) :834-848
[8]   DeepGlobe 2018: A Challenge to Parse the Earth through Satellite Images [J].
Demir, Ilke ;
Koperski, Krzysztof ;
Lindenbaum, David ;
Pang, Guan ;
Huang, Jing ;
Bast, Saikat ;
Hughes, Forest ;
Tuia, Devis ;
Raskar, Ramesh .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :172-181
[9]   Assessing the population coverage of a health demographic surveillance system using satellite imagery and crowd-sourcing [J].
Di Pasquale, Aurelio ;
McCann, Robert S. ;
Maire, Nicolas .
PLOS ONE, 2017, 12 (08)
[10]   Residual Inception Skip Network for Binary Segmentation [J].
Doshi, Jigar .
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2018, :206-209