Semantic Segmentation of High Spatial Resolution Remote Sensing Imagery Based on Weighted Attention U-Net

被引:1
作者
Zhang, Yue [1 ]
Wang, Leiguang [2 ]
Yang, Ruiqi [3 ]
Chen, Nan [1 ]
Zhao, Yili [3 ]
Dai, Qinling [4 ]
机构
[1] Southwest Forestry Univ, Fac Forestry, Kunming, Yunnan, Peoples R China
[2] Southwest Forestry Univ, Inst Big Data & Artificial Intelligence, Kunming, Yunnan, Peoples R China
[3] Southwest Forestry Univ, Coll Big Data & Intelligent Engn, Kunming, Yunnan, Peoples R China
[4] Southwest Forestry Univ, Art & Design Coll, Kunming, Yunnan, Peoples R China
来源
FOURTEENTH INTERNATIONAL CONFERENCE ON GRAPHICS AND IMAGE PROCESSING, ICGIP 2022 | 2022年 / 12705卷
关键词
Semantic segmentation; deep learning; attention gate model; weighted attention U-Net; GID dataset;
D O I
10.1117/12.2680206
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, with the development of deep learning and attention mechanism, more research has been carried out to realize semantic image segmentation based on deep learning integrated attention mechanisms. However, the current semantic segmentation methods have low segmentation accuracy, high computation cost, and serious loss of detailed information. In this paper, a lightweight designed attention gate model was introduced to reduce the computation cost. And because it can suppress irrelevant regions in the input image, while highlighting the salient features of specific tasks, the combination of the two weighting factors input features (x(l)) and gating signal (g) in this structure can improve segmentation accuracy and reduce loss of detail. Therefore, this study used the weighted attention U-Net network to perform semantic segmentation on the GID dataset and finally evaluated it on the four indicators of Precision, Recall, F1-Sorce, and mIoU. This result shows that different weight values have a more significant impact on the experimental results. The attention U-Net with the best weight combination compared with the traditional U-Net network, Precision, Recall, F1-Sorce, and mIoU are increased by 0.88%, 1.4%, 1.13%, and 1.2%, respectively. Compared with the original attention U-Net, Precision, Recall, F1-Sorce, and mIoU are increased by 0.86%, 1.24%, 1.04%, and 1.75%, respectively.
引用
收藏
页数:9
相关论文
共 24 条
[21]   Semantic Annotation of High-Resolution Satellite Images via Weakly Supervised Learning [J].
Yao, Xiwen ;
Han, Junwei ;
Cheng, Gong ;
Qian, Xueming ;
Guo, Lei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2016, 54 (06) :3660-3671
[22]  
Yuan Y., 2018, arXiv
[23]   Multigranularity Multiclass-Layer Markov Random Field Model for Semantic Segmentation of Remote Sensing Images [J].
Zheng, Chen ;
Zhang, Yun ;
Wang, Leiguang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2021, 59 (12) :10555-10574
[24]   Semantic Segmentation of Remote Sensing Imagery Using an Object-Based Markov Random Field Model With Auxiliary Label Fields [J].
Zheng, Chen ;
Zhang, Yun ;
Wang, Leiguang .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2017, 55 (05) :3015-3028