Remote Sensing Image Road Segmentation Method Integrating CNN-Transformer and UNet

被引:2
作者
Wang, Rui [1 ]
Cai, Mingxiang [1 ]
Xia, Zixuan [2 ]
Zhou, Zhicui [3 ]
机构
[1] China Transport Telecommun & Informat Ctr, Beijing 100011, Peoples R China
[2] Heilongjiang Univ Technol, Harbin 150022, Heilongjiang, Peoples R China
[3] No 1 Middle Sch Weifang, Jixi 150022, Heilongjiang, Peoples R China
关键词
Road segmentation; deep learning; CNN-transformer; attention; UNet; EXTRACTION;
D O I
10.1109/ACCESS.2023.3344797
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Real-time and accurate road information is crucial for updating electronic navigation maps. To address the problem of low precision and poor robustness in current semantic segmentation methods for road extraction from remote sensing imagery, we proposed a UNet road semantic segmentation model based on attention mechanism improvement. First, we introduce a CNN-Transformer hybrid structure to the encoder to enhance the feature extraction capabilities of global and local details. Second, the traditional upsampling module in the decoder is replaced with a dual upsampling module to improve feature extraction capabilities and segmentation accuracy. Furthermore, the hard-swish activation function is used instead of ReLU activation function to smooth the curve, which helps to improve the generalization and non-linear feature extraction abilities and avoid gradient vanishing. Finally, a comprehensive loss function combining cross entropy and dice is used to strengthen the segmentation result constraints and further improve segmentation accuracy. Experimental validation is performed on the Ottawa Road Dataset and the Massachusetts Road Dataset. Experimental results show that compared with U-Net, PSPNet, DeepLab V3 and TransUNet networks, this algorithm is the best in terms of MIoU, MPA and F1 score. Among them, on the Ottawa road data set, the MPA of this algorithm reached 95.48%. On the Massachusetts road data set, MPA is 92.56%. This method shows good performance in road extraction.
引用
收藏
页码:144446 / 144455
页数:10
相关论文
共 41 条
  • [1] A Multi-Stage, Multi-Feature Machine Learning Approach to Detect Driver Sleepiness in Naturalistic Road Driving Conditions
    Bakker, Bram
    Zablocki, Bartosz
    Baker, Angela
    Riethmeister, Vanessa
    Marx, Bernd
    Iyer, Girish
    Anund, Anna
    Ahlstrom, Christer
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 4791 - 4800
  • [2] Chen J., 2021, arXiv
  • [3] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
  • [4] Corse-to-Fine Road Extraction Based on Local Dirichlet Mixture Models and Multiscale-High-Order Deep Learning
    Chen, Ziyi
    Fan, Wentao
    Zhong, Bineng
    Li, Jonathan
    Du, Jixiang
    Wang, Cheng
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (10) : 4283 - 4293
  • [5] Graph Spectral Regularized Tensor Completion for Traffic Data Imputation
    Deng, Lei
    Liu, Xiao-Yang
    Zheng, Haifeng
    Feng, Xinxin
    Chen, Youjia
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 10996 - 11010
  • [6] Detection of Motorcycles in Urban Traffic Using Video Analysis: A Review
    Espinosa, Jorge E.
    Velastin, Sergio A.
    Branch, John W.
    [J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (10) : 6115 - 6130
  • [7] SUNet: Swin Transformer UNet for Image Denoising
    Fan, Chi-Mao
    Liu, Tsung-Jung
    Liu, Kuan-Hsien
    [J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2333 - 2337
  • [8] Feng HF, 2022, IEEE T INTELL TRANSP, V23, P11052, DOI [10.1109/TITS.2021.3099023, 10.1145/3502871.3502872]
  • [9] Pseudonym Limitation for Privacy in Cooperative Transport Systems
    Fouchal, Hacene
    Boudra, Safia
    Ercan, Secil
    Yahiaoui, Rheri
    [J]. IEEE NETWORK, 2020, 34 (03): : 73 - 77
  • [10] Howard AG, 2017, Arxiv, DOI arXiv:1704.04861