Instance Segmentation for Large, Multi-Channel Remote Sensing Imagery Using Mask-RCNN and a Mosaicking Approach

被引:69
作者
Carvalho, Osmar Luiz Ferreira de [1 ]
de Carvalho Junior, Osmar Abilio [2 ]
Albuquerque, Anesmar Olino de [2 ]
Bem, Pablo Pozzobon de [2 ]
Silva, Cristiano Rosa [1 ]
Ferreira, Pedro Henrique Guimaraes [1 ]
Moura, Rebeca dos Santos de [2 ]
Gomes, Roberto Arnaldo Trancoso [2 ]
Guimaraes, Renato Fontes [2 ]
Borges, Dibio Leandro [3 ]
机构
[1] Univ Brasilia, Dept Engn Eletr, Campus Univ Darcy Ribeiro,Asa Norte, BR-70910900 Brasilia, DF, Brazil
[2] Univ Brasilia, Dept Geog, Campus Univ Darcy Ribeiro,Asa Norte, BR-70910900 Brasilia, DF, Brazil
[3] Univ Brasilia, Dept Ciencia Computacao, Campus Univ Darcy Ribeiro,Asa Norte, BR-70910900 Brasilia, DF, Brazil
关键词
instance segmentation; multi-channel imagery; mask R-CNN; deep learning; COCO; Landsat-8; center pivot; DEEP LEARNING FRAMEWORK; SEMANTIC SEGMENTATION; OBJECT DETECTION; NEURAL-NETWORK; LAND-COVER; CLOUD DETECTION; TIME-SERIES; CLASSIFICATION; AREAS; MODEL;
D O I
10.3390/rs13010039
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Instance segmentation is the state-of-the-art in object detection, and there are numerous applications in remote sensing data where these algorithms can produce significant results. Nevertheless, one of the main problems is that most algorithms use Red, Green, and Blue (RGB) images, whereas Satellite images often present more channels that can be crucial to improve performance. Therefore, the present work brings three contributions: (a) conversion system from ground truth polygon data into the Creating Common Object in Context (COCO) annotation format; (b) Detectron2 software source code adaptation and application on multi-channel imagery; and (c) large scene image mosaicking. We applied the procedure in a Center Pivot Irrigation System (CPIS) dataset with ground truth produced by the Brazilian National Water Agency (ANA) and Landsat-8 Operational Land Imager (OLI) imagery (7 channels with 30-m resolution). Center pivots are a modern irrigation system technique with massive growth potential in Brazil and other world areas. The round shapes with different textures, colors, and spectral behaviors make it appropriate to use Deep Learning instance segmentation. We trained the model using 512 x 512-pixel sized patches using seven different backbone structures (ResNet50- Feature Pyramid Network (FPN), Resnet50-DC5, ResNet50-C4, Resnet101-FPN, Resnet101-DC5, ResNet101-FPN, and ResNeXt101-FPN). The model evaluation used standard COCO metrics (Average Precision (AP), AP(50), AP(75), AP(small), AP(medium), and AR(100)). ResNeXt101-FPN had the best results, with a 3% advantage over the second-best model (ResNet101-FPN). We also compared the ResNeXt101-FPN model in the seven-channel and RGB imagery, where the multi-channel model had a 3% advantage, demonstrating great improvement using a larger number of channels. This research is also the first with a mosaicking algorithm using instance segmentation models, where we tested in a 1536 x 1536-pixel image using a non-max suppression sorted by area method. The proposed methodology is innovative and suitable for many other remote sensing problems and medical imagery that often present more channels.
引用
收藏
页码:1 / 24
页数:24
相关论文
共 132 条
[41]   Survey of recent progress in semantic image segmentation with CNNs [J].
Geng, Qichuan ;
Zhou, Zhong ;
Cao, Xiaochun .
SCIENCE CHINA-INFORMATION SCIENCES, 2018, 61 (05)
[42]   Region-Based Convolutional Networks for Accurate Object Detection and Segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2016, 38 (01) :142-158
[43]   Rich feature hierarchies for accurate object detection and semantic segmentation [J].
Girshick, Ross ;
Donahue, Jeff ;
Darrell, Trevor ;
Malik, Jitendra .
2014 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2014, :580-587
[44]   Dark Spot Detection in SAR Images of Oil Spill Using Segnet [J].
Guo, Hao ;
Wei, Guo ;
An, Jubai .
APPLIED SCIENCES-BASEL, 2018, 8 (12)
[45]   A review of semantic segmentation using deep neural networks [J].
Guo, Yanming ;
Liu, Yu ;
Georgiou, Theodoros ;
Lew, Michael S. .
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL, 2018, 7 (02) :87-93
[46]  
He K., 2017, IEEE INT C COMPUT VI, P2961, DOI [10.1109/iccv.201, DOI 10.1109/ICCV.2017.322]
[47]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[48]  
HELLER RC, 1979, PHOTOGRAMM ENG REM S, V45, P1379
[49]   Urban land-use mapping using a deep convolutional neural network with high spatial resolution multispectral remote sensing imagery [J].
Huang, Bo ;
Zhao, Bei ;
Song, Yimeng .
REMOTE SENSING OF ENVIRONMENT, 2018, 214 :73-86
[50]   Automatic extraction of urban impervious surfaces based on deep learning and multi-source remote sensing data [J].
Huang, Fenghua ;
Yu, Ying ;
Feng, Tinghao .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 60 :16-27