Semantic Segmentation of Aerial Imagery Using U-Net with Self-Attention and Separable Convolutions

被引：7

作者：

Khan, Bakht Alam ^{[1
]}

Jung, Jin-Woo ^{[1
]}

机构：

[1] Dongguk Univ, Dept Comp Sci & Engn, Seoul 04620, South Korea

来源：

APPLIED SCIENCES-BASEL | 2024年 / 14卷 / 09期

关键词：

semantic segmentation; U-Net; self-attention; separable convolutions; aerial imagery; remote sensing; RESOLUTION; SATELLITE; NETWORK;

D O I：

10.3390/app14093712

中图分类号：

O6 [化学];

学科分类号：

0703 ;

摘要：

This research addresses the crucial task of improving accuracy in the semantic segmentation of aerial imagery, essential for applications such as urban planning and environmental monitoring. This study emphasizes the significance of maintaining the Intersection over Union (IOU) score as a metric and employs data augmentation with the Patchify library, using a patch size of 256, to effectively augment the dataset, which is subsequently split into training and testing sets. The core of this investigation lies in a novel architecture that combines a U-Net framework with self-attention mechanisms and separable convolutions. The introduction of self-attention mechanisms enhances the model's understanding of image context, while separable convolutions expedite the training process, contributing to overall efficiency. The proposed model demonstrates a substantial accuracy improvement, surpassing the previous state-of-the-art Dense Plus U-Net, achieving an accuracy of 91% compared to the former's 86%. Visual representations, including original patch images, original masked patches, and predicted patch masks, showcase the model's proficiency in semantic segmentation, marking a significant advancement in aerial image analysis and underscoring the importance of innovative architectural elements for enhanced accuracy and efficiency in such tasks.

引用

页数：15

共 50 条

[21] Building segmentation from satellite imagery using U-Net with ResNet encoder [J].

Liu, Zhongwei ;

Chen, Baisong ;

Zhang, Ao .

2020 5TH INTERNATIONAL CONFERENCE ON MECHANICAL, CONTROL AND COMPUTER ENGINEERING (ICMCCE 2020), 2020, :1967-1971

[22] RAUNet: Residual Attention U-Net for Semantic Segmentation of Cataract Surgical Instruments [J].

Ni, Zhen-Liang ;

Bian, Gui-Bin ;

Zhou, Xiao-Hu ;

Hou, Zeng-Guang ;

Xie, Xiao-Liang ;

Wang, Chen ;

Zhou, Yan-Jie ;

Li, Rui-Qi ;

Li, Zhen .

NEURAL INFORMATION PROCESSING (ICONIP 2019), PT II, 2019, 11954 :139-149

[23] Attention U-Net for Semantic Segmentation of Moroccan Coastal Upwelling in SST Images [J].

Snoussi, Mohamed ;

El Fellah, Salma ;

Tamim, Ayoub ;

Koutti, Lahcen .

OPEN SCIENCE IN ENGINEERING, 2023, 763 :653-664

[24] TransAttention U-Net for Semantic Segmentation of Poppy [J].

Luo, Zifei ;

Yang, Wenzhu ;

Gou, Ruru ;

Yuan, Yunfeng .

ELECTRONICS, 2023, 12 (03)

[25] Improved U-NET Semantic Segmentation Network [J].

Gao, Xueyan ;

Fang, Lijin .

PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, :7090-7095

[26] Semantic segmentation of chemical plumes from airborne multispectral infrared images using U-Net [J].

Zizi Chen ;

Gary W. Small .

Neural Computing and Applications, 2022, 34 :20757-20771

[27] Semantic segmentation of chemical plumes from airborne multispectral infrared images using U-Net [J].

Chen, Zizi ;

Small, Gary W. .

NEURAL COMPUTING & APPLICATIONS, 2022, 34 (23) :20757-20771

[28] A Nested U-Net With Self-Attention and Dense Connectivity for Monaural Speech Enhancement [J].

Xiang, Xiaoxiao ;

Zhang, Xiaojuan ;

Chen, Haozhe .

IEEE SIGNAL PROCESSING LETTERS, 2022, 29 :105-109

[29] U-Net Transformer: Self and Cross Attention for Medical Image Segmentation [J].

Petit, Olivier ;

Thome, Nicolas ;

Rambour, Clement ;

Themyr, Loic ;

Collins, Toby ;

Soler, Luc .

MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2021, 2021, 12966 :267-276

[30] Semantic segmentation of human cell nucleus using deep U-Net and other versions of U-Net models [J].

Yadavendra ;

Chand, Satish .

NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2022, 33 (3-4) :167-186

← 1 2 3 4 5 →