Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions

被引:7
作者
Khan, Bakht Alam [1 ]
Jung, Jin-Woo [1 ]
机构
[1] Dongguk Univ, Dept Comp Sci & Engn, Seoul 04620, South Korea
来源
APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 09期
关键词
semantic segmentation; U-Net; self-attention; separable convolutions; aerial imagery; remote sensing; RESOLUTION; SATELLITE; NETWORK;
D O I
10.3390/app14093712
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
This research addresses the crucial task of improving accuracy in the semantic segmentation of aerial imagery, essential for applications such as urban planning and environmental monitoring. This study emphasizes the significance of maintaining the Intersection over Union (IOU) score as a metric and employs data augmentation with the Patchify library, using a patch size of 256, to effectively augment the dataset, which is subsequently split into training and testing sets. The core of this investigation lies in a novel architecture that combines a U-Net framework with self-attention mechanisms and separable convolutions. The introduction of self-attention mechanisms enhances the model's understanding of image context, while separable convolutions expedite the training process, contributing to overall efficiency. The proposed model demonstrates a substantial accuracy improvement, surpassing the previous state-of-the-art Dense Plus U-Net, achieving an accuracy of 91% compared to the former's 86%. Visual representations, including original patch images, original masked patches, and predicted patch masks, showcase the model's proficiency in semantic segmentation, marking a significant advancement in aerial image analysis and underscoring the importance of innovative architectural elements for enhanced accuracy and efficiency in such tasks.
引用
收藏
页数:15
相关论文
共 50 条
[31]   Group Equivariant U-Net for the Semantic Segmentation of SAR Images [J].
Turkmenli, Ilter ;
Aptoula, Erchan ;
Kayabol, Koray .
2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
[32]   Knowledge Distillation for Reduced Footprint Semantic Segmentation with the U-Net [J].
Rosa, Ciro B. ;
Hirata, Nina S. T. .
40TH ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2025, :655-662
[33]   SUDS: A Simplified U-Net Architecture with Depth-wise Separable Convolutions [J].
Ionete, Vlad-Constantin ;
Marsavina, Cosmin .
2024 26TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC, 2024, :164-172
[34]   Enhanced segmentation of optic disc and cup using attention-based U-Net with dense dilated series convolutions [J].
Kumar, G. Bharadwaja ;
Kumar, Soham .
Neural Computing and Applications, 2025, 37 (09) :6831-6847
[35]   Coordinate Attention Residual Deformable U-Net for Vessel Segmentation [J].
Wu, Cong ;
Liu, Xiao ;
Li, Shijun ;
Long, Cheng .
NEURAL INFORMATION PROCESSING, ICONIP 2021, PT III, 2021, 13110 :345-356
[36]   LATUP-Net: A lightweight 3D attention U-Net with parallel convolutions for brain tumor segmentation [J].
Alwadee, Ebtihal J. ;
Sun, Xianfang ;
Qin, Yipeng ;
Langbein, Frank C. .
Computers in Biology and Medicine, 2025, 184
[37]   SEMANTIC IMAGES SEGMENTATION FOR AUTONOMOUS DRIVING USING SELF-ATTENTION KNOWLEDGE DISTILLATION [J].
Karine, Ayoub ;
Napoleon, Thibault ;
Jridi, Maher .
2022 16TH INTERNATIONAL CONFERENCE ON SIGNAL-IMAGE TECHNOLOGY & INTERNET-BASED SYSTEMS, SITIS, 2022, :198-202
[38]   Lightweight Self-Attention Network for Semantic Segmentation [J].
Zhou, Yan ;
Zhou, Haibin ;
Li, Nanjun ;
Li, Jianxun ;
Wang, Dongli .
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[39]   Segmentation of the Thoracic Aorta using an Attention-Gated U-Net [J].
Zhong, Jiayang ;
Bian, Zhangxing ;
Hatt, Charles R. ;
Burris, Nicholas S. .
MEDICAL IMAGING 2021: COMPUTER-AIDED DIAGNOSIS, 2021, 11597
[40]   Semantic Segmentation of Hippocampal Subregions With U-Net Architecture [J].
Nasser, Soraya ;
Naoui, Moulkheir ;
Belalem, Ghalem ;
Mahmoudi, Said .
INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS, 2021, 12 (06)