Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions

被引：7

作者：

Khan, Bakht Alam ^{[1
]}

Jung, Jin-Woo ^{[1
]}

机构：

[1] Dongguk Univ, Dept Comp Sci & Engn, Seoul 04620, South Korea

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 09期

关键词：

semantic segmentation; U-Net; self-attention; separable convolutions; aerial imagery; remote sensing; RESOLUTION; SATELLITE; NETWORK;

D O I：

10.3390/app14093712

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This research addresses the crucial task of improving accuracy in the semantic segmentation of aerial imagery, essential for applications such as urban planning and environmental monitoring. This study emphasizes the significance of maintaining the Intersection over Union (IOU) score as a metric and employs data augmentation with the Patchify library, using a patch size of 256, to effectively augment the dataset, which is subsequently split into training and testing sets. The core of this investigation lies in a novel architecture that combines a U-Net framework with self-attention mechanisms and separable convolutions. The introduction of self-attention mechanisms enhances the model's understanding of image context, while separable convolutions expedite the training process, contributing to overall efficiency. The proposed model demonstrates a substantial accuracy improvement, surpassing the previous state-of-the-art Dense Plus U-Net, achieving an accuracy of 91% compared to the former's 86%. Visual representations, including original patch images, original masked patches, and predicted patch masks, showcase the model's proficiency in semantic segmentation, marking a significant advancement in aerial image analysis and underscoring the importance of innovative architectural elements for enhanced accuracy and efficiency in such tasks.

引用

页数：15

共 50 条

[41] MS2A2Net: Multiscale Self-Attention Aggregation Network for Few-Shot Aerial Imagery Segmentation [J].

Li, Jianzhao ;

Gong, Maoguo ;

Li, Weihao ;

Zhang, Mingyang ;

Zhang, Yourun ;

Wang, Shanfeng ;

Wu, Yue .

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2024, 62 :1-16

[42] U-Net with Attention Mechanism for Retinal Vessel Segmentation [J].

Si, Ze ;

Fu, Dongmei ;

Li, Jiahao .

IMAGE AND GRAPHICS, ICIG 2019, PT II, 2019, 11902 :668-677

[43] Self-attention neural architecture search for semantic image segmentation [J].

Fan, Zhenkun ;

Hu, Guosheng ;

Sun, Xin ;

Wang, Gaige ;

Dong, Junyu ;

Su, Chi .

KNOWLEDGE-BASED SYSTEMS, 2022, 239

[44] Attention guided U-Net for accurate iris segmentation [J].

Lian, Sheng ;

Luo, Zhiming ;

Zhong, Zhun ;

Lin, Xiang ;

Su, Songzhi ;

Li, Shaozi .

JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2018, 56 :296-304

[45] Lunet: an enhanced upsampling fusion network with efficient self-attention for semantic segmentation [J].

Zhou, Yan ;

Zhou, Haibin ;

Yang, Yin ;

Li, Jianxun ;

Irampaye, Richard ;

Wang, Dongli ;

Zhang, Zhengpeng .

VISUAL COMPUTER, 2025, 41 (05) :3109-3128

[46] Multiscale Attention U-Net for Skin Lesion Segmentation [J].

Alahmadi, Mohammad D. .

IEEE ACCESS, 2022, 10 :59145-59154

[47] CTMU-Net: An Improved U-Net for Semantic Segmentation of Remote-Sensing Images Based on the Combined Attention Mechanism [J].

Li, Yuanjun ;

Zhu, Zhiyu ;

Li, Yuanjiang ;

Zhang, Jinglin ;

Li, Xi ;

Shang, Shuyao ;

Zhu, Dewen .

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2023, 16 :10148-10161

[48] SEMANTIC SEGMENTATION OF UAV IMAGES BASED ON U-NET IN URBAN AREA [J].

Majidizadeh, A. ;

Hasani, H. ;

Jafari, M. .

ISPRS GEOSPATIAL CONFERENCE 2022, JOINT 6TH SENSORS AND MODELS IN PHOTOGRAMMETRY AND REMOTE SENSING, SMPR/4TH GEOSPATIAL INFORMATION RESEARCH, GIRESEARCH CONFERENCES, VOL. 10-4, 2023, :451-457

[49] A 2.5D semantic segmentation of the pancreas using attention guided dual context embedded U-Net [J].

Li, Jingyuan ;

Liao, Guanqun ;

Sun, Wenfang ;

Sun, Ji ;

Sheng, Tai ;

Zhu, Kaibin ;

von Deneen, Karen M. ;

Zhang, Yi .

NEUROCOMPUTING, 2022, 480 :14-26

[50] Semantic segmentation of brain tumor images using attention-based residual light u-net model [J].

Chakrapani ;

Kumar S. .

Multimedia Tools and Applications, 2025, 84 (10) :7425-7441

← 1 2 3 4 5 →