Instance Segmentation for Large, Multi-Channel Remote Sensing Imagery Using Mask-RCNN and a Mosaicking Approach

被引:66
作者
Carvalho, Osmar Luiz Ferreira de [1 ]
de Carvalho Junior, Osmar Abilio [2 ]
Albuquerque, Anesmar Olino de [2 ]
Bem, Pablo Pozzobon de [2 ]
Silva, Cristiano Rosa [1 ]
Ferreira, Pedro Henrique Guimaraes [1 ]
Moura, Rebeca dos Santos de [2 ]
Gomes, Roberto Arnaldo Trancoso [2 ]
Guimaraes, Renato Fontes [2 ]
Borges, Dibio Leandro [3 ]
机构
[1] Univ Brasilia, Dept Engn Eletr, Campus Univ Darcy Ribeiro,Asa Norte, BR-70910900 Brasilia, DF, Brazil
[2] Univ Brasilia, Dept Geog, Campus Univ Darcy Ribeiro,Asa Norte, BR-70910900 Brasilia, DF, Brazil
[3] Univ Brasilia, Dept Ciencia Computacao, Campus Univ Darcy Ribeiro,Asa Norte, BR-70910900 Brasilia, DF, Brazil
关键词
instance segmentation; multi-channel imagery; mask R-CNN; deep learning; COCO; Landsat-8; center pivot; CONVOLUTIONAL NEURAL-NETWORK; DEEP LEARNING FRAMEWORK; SEMANTIC SEGMENTATION; OBJECT DETECTION; LAND-COVER; CLOUD DETECTION; TIME-SERIES; CLASSIFICATION; AREAS; MODEL;
D O I
10.3390/rs13010039
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Instance segmentation is the state-of-the-art in object detection, and there are numerous applications in remote sensing data where these algorithms can produce significant results. Nevertheless, one of the main problems is that most algorithms use Red, Green, and Blue (RGB) images, whereas Satellite images often present more channels that can be crucial to improve performance. Therefore, the present work brings three contributions: (a) conversion system from ground truth polygon data into the Creating Common Object in Context (COCO) annotation format; (b) Detectron2 software source code adaptation and application on multi-channel imagery; and (c) large scene image mosaicking. We applied the procedure in a Center Pivot Irrigation System (CPIS) dataset with ground truth produced by the Brazilian National Water Agency (ANA) and Landsat-8 Operational Land Imager (OLI) imagery (7 channels with 30-m resolution). Center pivots are a modern irrigation system technique with massive growth potential in Brazil and other world areas. The round shapes with different textures, colors, and spectral behaviors make it appropriate to use Deep Learning instance segmentation. We trained the model using 512 x 512-pixel sized patches using seven different backbone structures (ResNet50- Feature Pyramid Network (FPN), Resnet50-DC5, ResNet50-C4, Resnet101-FPN, Resnet101-DC5, ResNet101-FPN, and ResNeXt101-FPN). The model evaluation used standard COCO metrics (Average Precision (AP), AP(50), AP(75), AP(small), AP(medium), and AR(100)). ResNeXt101-FPN had the best results, with a 3% advantage over the second-best model (ResNet101-FPN). We also compared the ResNeXt101-FPN model in the seven-channel and RGB imagery, where the multi-channel model had a 3% advantage, demonstrating great improvement using a larger number of channels. This research is also the first with a mosaicking algorithm using instance segmentation models, where we tested in a 1536 x 1536-pixel image using a non-max suppression sorted by area method. The proposed methodology is innovative and suitable for many other remote sensing problems and medical imagery that often present more channels.
引用
收藏
页码:1 / 24
页数:24
相关论文
共 132 条
  • [1] Agencia Nacional de aguas, 2019, LEV AGR IRR PIV CENT
  • [2] Agencia Nacional de aguas, 2016, LEV AGR IRR PIV CENT
  • [3] Althoff D., 2019, Irriga, V1, P56, DOI [DOI 10.15809/IRRIGA.2019V1N1P56-61, 10.15809/irriga.2019v1n1p56-61]
  • [4] Deep Learning Approach for Car Detection in UAV Imagery
    Ammour, Nassim
    Alhichri, Haikel
    Bazi, Yakoub
    Benjdira, Bilel
    Alajlan, Naif
    Zuair, Mansour
    [J]. REMOTE SENSING, 2017, 9 (04)
  • [5] ERISNet: deep neural network for Sargassum detection along the coastline of the Mexican Caribbean
    Arellano-Verdejo, Javier
    Lazcano-Hernandez, Hugo E.
    Cabanillas-Teran, Nancy
    [J]. PEERJ, 2019, 7
  • [6] Pixelwise Instance Segmentation with a Dynamically Instantiated Network
    Arnab, Anurag
    Torr, Philip H. S.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 879 - 888
  • [7] Deep semantic segmentation of natural and medical images: a review
    Asgari Taghanaki, Saeid
    Abhishek, Kumar
    Cohen, Joseph Paul
    Cohen-Adad, Julien
    Hamarneh, Ghassan
    [J]. ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (01) : 137 - 178
  • [8] Beyond RGB: Very high resolution urban remote sensing with multimodal deep networks
    Audebert, Nicolas
    Le Saux, Bertrand
    Lefevre, Sebastien
    [J]. ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING, 2018, 140 : 20 - 32
  • [9] Segment-before-Detect: Vehicle Detection and Classification through Semantic Segmentation of Aerial Images
    Audebert, Nicolas
    Le Saux, Bertrand
    Lefevre, Sebastien
    [J]. REMOTE SENSING, 2017, 9 (04)
  • [10] Deep Watershed Transform for Instance Segmentation
    Bai, Min
    Urtasun, Raquel
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2858 - 2866