Case instance segmentation of small farmland based on Mask R-CNN of feature pyramid network with double attention mechanism in high resolution satellite images

被引:16
作者
Cao, Yangyang [1 ]
Zhao, Zuoxi [1 ,2 ]
Huang, Yuan [1 ]
Lin, Xu [1 ]
Luo, Shuyuan [1 ]
Xiang, Borui [1 ]
Yang, Houcheng [1 ]
机构
[1] South China Agr Univ, Coll Engn, Guangzhou 510642, Peoples R China
[2] South China Agr Univ, Key Lab Key Technol Agr Machine & Equipment, Minist Educ, Guangzhou 510642, Peoples R China
关键词
Farmland segmentation; Mask R-CNN; Small farms; Attention mechanism; VHR images; EXTRACTION; GROWTH; FIELDS; DETECT;
D O I
10.1016/j.compag.2023.108073
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
Accurate spatial information of farmland in small farms is very important to provide operable information to farmers, managers and decision makers. However, small farms have small area, irregular shape, and use a variety of planting crops, which makes their boundaries blurred, and the standard edge detection algorithm cannot accurately segment the farmland boundary. Therefore, the automatic delimitation of fields in small farms is a challenging task. Aiming at the above problems, this paper proposes an example segmentation method of Mask R-CNN based on dual attention mechanism feature pyramid network (DAFPN) to describe small farms. DAFPN is composed of two attention modules: spatial attention module (SPA) and channel attention module (CHA) to enhance its feature extraction ability. Spatial attention module (SPA) generates spatial attention map by using the spatial relationship of features, and generates information to be emphasized or suppressed in spatial location; The channel attention module (CHA) learns an adaptive channel merging method based on the attention mechanism. Our proposed DAFPN can be easily inserted into the existing FPN model. We have conducted extensive experimental analysis on very high resolution (VHR) satellite images based on the Mask R-CNN deep learning framework of DAFPN. The standard COCO dataset evaluation index and F1-score evaluation strategy are used to compare the algorithm. AP50, AP75 and F1-score reach 82.86%, 55.51% and 70.90% respectively, which is 8.7%, 8.31% and 6.87% higher than Mask R-CNN respectively. Our results highlight the ability of Mask R-CNN based on DAFPN to accurately depict small farms in VHR satellite images, which lays a foundation for the automatic segmentation of small farms.
引用
收藏
页数:11
相关论文
共 53 条
[1]   SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation [J].
Badrinarayanan, Vijay ;
Kendall, Alex ;
Cipolla, Roberto .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) :2481-2495
[2]   A DEEP LEARNING APPROACH TO THE CLASSIFICATION OF SUB-DECIMETRE RESOLUTION AERIAL IMAGES [J].
Bergado, John Ray ;
Persello, Claudio ;
Gevaert, Caroline .
2016 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2016, :1516-1519
[3]  
Bertasius G, 2015, PROC CVPR IEEE, P4380, DOI 10.1109/CVPR.2015.7299067
[5]   Image Segmentation Based on Constrained Spectral Variance Difference and Edge Penalty [J].
Chen, Bo ;
Qiu, Fang ;
Wu, Bingfang ;
Du, Hongyue .
REMOTE SENSING, 2015, 7 (05) :5980-6004
[6]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[7]   When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs [J].
Cheng, Gong ;
Yang, Ceyuan ;
Yao, Xiwen ;
Guo, Lei ;
Han, Junwei .
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05) :2811-2821
[8]   Boundary-Preserving Mask R-CNN [J].
Cheng, Tianheng ;
Wang, Xinggang ;
Huang, Lichao ;
Liu, Wenyu .
COMPUTER VISION - ECCV 2020, PT XIV, 2020, 12359 :660-676
[9]   Contour Detection for UAV-Based Cadastral Mapping [J].
Crommelinck, Sophie ;
Bennett, Rohan ;
Gerke, Markus ;
Yang, Michael Ying ;
Vosselman, George .
REMOTE SENSING, 2017, 9 (02)
[10]   A generalized computer vision approach to mapping crop fields in heterogeneous agricultural landscapes [J].
Debats, Stephanie R. ;
Luo, Dee ;
Estes, Lyndon D. ;
Fuchs, Thomas J. ;
Caylor, Kelly K. .
REMOTE SENSING OF ENVIRONMENT, 2016, 179 :210-221