Remote Sensing Image Road Segmentation Method Integrating CNN-Transformer and UNet

被引：2

作者：

Wang, Rui ^{[1
]}

Cai, Mingxiang ^{[1
]}

Xia, Zixuan ^{[2
]}

Zhou, Zhicui ^{[3
]}

机构：

[1] China Transport Telecommun & Informat Ctr, Beijing 100011, Peoples R China

[2] Heilongjiang Univ Technol, Harbin 150022, Heilongjiang, Peoples R China

[3] No 1 Middle Sch Weifang, Jixi 150022, Heilongjiang, Peoples R China

来源：

IEEE ACCESS | 2023年 / 11卷

关键词：

Road segmentation; deep learning; CNN-transformer; attention; UNet; EXTRACTION;

D O I：

10.1109/ACCESS.2023.3344797

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Real-time and accurate road information is crucial for updating electronic navigation maps. To address the problem of low precision and poor robustness in current semantic segmentation methods for road extraction from remote sensing imagery, we proposed a UNet road semantic segmentation model based on attention mechanism improvement. First, we introduce a CNN-Transformer hybrid structure to the encoder to enhance the feature extraction capabilities of global and local details. Second, the traditional upsampling module in the decoder is replaced with a dual upsampling module to improve feature extraction capabilities and segmentation accuracy. Furthermore, the hard-swish activation function is used instead of ReLU activation function to smooth the curve, which helps to improve the generalization and non-linear feature extraction abilities and avoid gradient vanishing. Finally, a comprehensive loss function combining cross entropy and dice is used to strengthen the segmentation result constraints and further improve segmentation accuracy. Experimental validation is performed on the Ottawa Road Dataset and the Massachusetts Road Dataset. Experimental results show that compared with U-Net, PSPNet, DeepLab V3 and TransUNet networks, this algorithm is the best in terms of MIoU, MPA and F1 score. Among them, on the Ottawa road data set, the MPA of this algorithm reached 95.48%. On the Massachusetts road data set, MPA is 92.56%. This method shows good performance in road extraction.

引用

页码：144446 / 144455

页数：10

共 41 条

[1] A Multi-Stage, Multi-Feature Machine Learning Approach to Detect Driver Sleepiness in Naturalistic Road Driving Conditions
Bakker, Bram
Zablocki, Bartosz
Baker, Angela
Riethmeister, Vanessa
Marx, Bernd
Iyer, Girish
Anund, Anna
Ahlstrom, Christer
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (05) : 4791 - 4800
[2] Chen J., 2021, arXiv
[3] Chen LC, 2017, Arxiv, DOI arXiv:1706.05587
[4] Corse-to-Fine Road Extraction Based on Local Dirichlet Mixture Models and Multiscale-High-Order Deep Learning
Chen, Ziyi
Fan, Wentao
Zhong, Bineng
Li, Jonathan
Du, Jixiang
Wang, Cheng
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (10) : 4283 - 4293
[5] Graph Spectral Regularized Tensor Completion for Traffic Data Imputation
Deng, Lei
Liu, Xiao-Yang
Zheng, Haifeng
Feng, Xinxin
Chen, Youjia
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 10996 - 11010
[6] Detection of Motorcycles in Urban Traffic Using Video Analysis: A Review
Espinosa, Jorge E.
Velastin, Sergio A.
Branch, John W.
[J]. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (10) : 6115 - 6130
[7] SUNet: Swin Transformer UNet for Image Denoising
Fan, Chi-Mao
Liu, Tsung-Jung
Liu, Kuan-Hsien
[J]. 2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 2333 - 2337
[8] Feng HF, 2022, IEEE T INTELL TRANSP, V23, P11052, DOI [10.1109/TITS.2021.3099023, 10.1145/3502871.3502872]
[9] Pseudonym Limitation for Privacy in Cooperative Transport Systems
Fouchal, Hacene
Boudra, Safia
Ercan, Secil
Yahiaoui, Rheri
[J]. IEEE NETWORK, 2020, 34 (03): : 73 - 77
[10] Howard AG, 2017, Arxiv, DOI arXiv:1704.04861

← 1 2 3 4 5 →