Small sample remote sensing image segmentation based on multiscale feature fusion

被引:0
作者
Wang J. [1 ]
Zhang J. [1 ]
机构
[1] School of Engineering, Ocean University of China, Qingdao
来源
Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition) | 2022年 / 50卷 / 03期
关键词
Attentional mechanism; Multiscale feature fusion; Remote sensing images; Semantic segmentation; Transfer learning;
D O I
10.13245/j.hust.220312
中图分类号
学科分类号
摘要
Aiming at the problems of high cost of drawing remote sensing image labels and low detection accuracy of remote sensing image under the condition of limited training samples in actual scenes, the deep pyramid attention network (DPA-Net) combining multiscale feature fusion and attention mechanism was proposed to extract buildings and roads from small sample remote sensing images by integrating transfer learning method. Due to limited training samples containing information is limited, first of all, on the basis of DeeplabV3+ network architecture, two sources of low-level features are added to make full use of spatial information of low-level features, and the attention mechanism is used to obtain rich context information and enhance the learning ability of the target channel, reduce the response ability to other targets and noises, and to improve the poor detection effect of the model on small samples. Finally, the transfer learning method using open remote sensing image dataset and small sample dataset for joint training reduces the impact of too few training samples on network learning performance. Experimental results show that the accuracy of the proposed method is improved by 3.69% and the annotation cost can be saved by 1/2. © 2022, Editorial Board of Journal of Huazhong University of Science and Technology. All right reserved.
引用
收藏
页码:62 / 67
页数:5
相关论文
共 22 条
[1]  
CHEN X, L Q S, LU X Z, Et al., Automated regional seismic damage assessment of buildings using an unmanned aerial vehicle and a convolutional neural network, Automation in Construction, 109, 1, (2020)
[2]  
(2019)
[3]  
REN S Q, HE K M, GIRSHICK R, Et al., Faster R-CNN: towards real-time object detection with region proposal networks, IEEE Transactions on Pattern Analysis and Machine Intelligence, 39, 6, pp. 1137-1149, (2017)
[4]  
LIU W, ANGUELOV D, ERHAN D, Et al., SSD: single shot multiBoxdetector, Proceedings of the European Conference on Computer Vision, pp. 21-27, (2016)
[5]  
REDOM J, FARHADI A., YOLOv3: an incremental improvement
[6]  
(2019)
[7]  
46, 6
[8]  
BADRINARAVANAN V, KENDALL A, CIPOLLA R., SegNet: a deep convolutional encoder-decoder architecture for image segmentation, IEEE Transactions on Pattern Analysis Machine Intelligence, 39, 12, pp. 2481-2495, (2017)
[9]  
RONNEBERGER O, FISCHER P, BROX T., U-net: convolutional networks for biomedical image segmentation, Proceedings of International Conference on Medical Image Computing and Computer-Assisted Intervention, pp. 234-241, (2015)
[10]  
CHEN L, ZHU Y, PAPANDREOU G, Et al., Encoder- decoder with atrous separable convolution for semantic image segmentation, Proc of European Conference on Computer Vision, pp. 801-818, (2018)