Segmentation of farmlands in aerial images by deep learning framework with feature fusion and context aggregation modules

被引:13
作者
Khan, Sultan Daud [1 ]
Alarabi, Louai [2 ]
Basalamah, Saleh [3 ]
机构
[1] Natl Univ Technol, Dept Comp Sci, Islamabad, Pakistan
[2] Umm Al Qura Univ, Dept Comp Sci, Mecca, Saudi Arabia
[3] Umm Al Qura Univ, Dept Comp Engn, Mecca, Saudi Arabia
关键词
Smart farming; Agriculture; Semantic segmentation; Deep learning; Aerial images; UNET PLUS PLUS; SEMANTIC SEGMENTATION; AGRICULTURE;
D O I
10.1007/s11042-023-14962-5
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automated segmentation of farmland patterns in high resolution aerial images is very crucial for smart farming. Recently, deep learning techniques have achieved tremendous success in various semantic segmentation tasks, however, little efforts have been made in farmland semantic segmentation in high resolution aerial images. Farmland semantic segmentation in aerial images is a challenging task due to large variation in scales and shapes of agriculture patterns. Furthermore, different agriculture patterns share similar visual features that usually result in mis-classifications of pixels. To efficiently tackle these problems, we propose a deep learning framework that captures scene context and aggregate multi-scale information from different convolutional blocks. Generally, the framework consists of two main modules:(1) feature fusion module and (2) global contextual module. Feature fusion module combines the feature maps of different convolutional blocks to capture wide variation in object scales, while global contextual module aggregate rich contextual information from different regions of the image by employing pyramid pooling module. We gauge the performance of proposed framework on challenging benchmarks dataset, Agriculture-vision and also compare our results with various state-of-the-art methods. From experiment results, we demonstrate that the proposed framework achieves best performance in identifying various complex agriculture patterns and supersedes state-of-the-art methods.
引用
收藏
页码:42353 / 42372
页数:20
相关论文
共 87 条
  • [1] Al-amri Salem Saleh., 2010, Journal of Computing, V2, P83
  • [2] [Anonymous], 2008, BMVC
  • [3] [Anonymous], 2014, arXiv
  • [4] Arbeláez P, 2012, PROC CVPR IEEE, P3378, DOI 10.1109/CVPR.2012.6248077
  • [5] SegNet: A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation
    Badrinarayanan, Vijay
    Kendall, Alex
    Cipolla, Roberto
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2017, 39 (12) : 2481 - 2495
  • [6] Bhatnagar V, 2020, STUD BIG DATA, V67, P1, DOI 10.1007/978-981-15-0663-5_1
  • [7] Bischke B, 2019, IEEE IMAGE PROC, P1480, DOI [10.1109/ICIP.2019.8803050, 10.1109/icip.2019.8803050]
  • [8] Feature-Fused SSD: Fast Detection for Small Objects
    Cao, Guimei
    Xie, Xuemei
    Yang, Wenzhe
    Liao, Quan
    Shi, Guangming
    Wu, Jinjian
    [J]. NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [9] Carreira J, 2012, LECT NOTES COMPUT SC, V7578, P430, DOI 10.1007/978-3-642-33786-4_32
  • [10] MFANet: A Multi-Level Feature Aggregation Network for Semantic Segmentation of Land Cover
    Chen, Bingyu
    Xia, Min
    Huang, Junqing
    [J]. REMOTE SENSING, 2021, 13 (04) : 1 - 20