Multispectral panoptic segmentation: Exploring the beach setting with worldview-3 imagery

被引:8
作者
de Carvalho, Osmar Luiz Ferreira [1 ]
de Carvalho Junior, Osmar Abilio [2 ]
de Albuquerque, Anesmar Olino [2 ]
Santana, Nickolas Castro [2 ]
Borges, Dibio Leandro [1 ]
Luiz, Argelica Saiaka [2 ]
Gomes, Roberto Arnaldo Trancoso [2 ]
Guimaraes, Renato Fontes [2 ]
机构
[1] Univ Brasilia, Dept Comp Sci, Brasilia, Campus Univ Darcy Ribeiro, BR-70910900 Brasilia, DF, Brazil
[2] Univ Brasilia, Dept Geog, Campus Univ Darcy Ribeiro, BR-70910900 Brasilia, Brazil
关键词
Deep learning; Dataset; Worldview-3; Segmentation; High-resolution image; Remote sensing; INSTANCE SEGMENTATION; CNN;
D O I
10.1016/j.jag.2022.102910
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
Panoptic segmentation is a recent and powerful task that tackles individual object recognition ( "things ") and multiple backgrounds ( "stuff ") simultaneously. Remote sensing studies with panoptic segmentation are still restricted and recent, with great application perspectives. In this sense, we propose the first multispectral panoptic segmentation study, considering the "thing "and "stuff'' classes in the beach scenario and evaluating different sets of spectral bands. Our methodology included developing a dataset with 3800 (3200 for training, 300 for validation, and 300 for testing) with 128 x 128 spatial dimensions and eight spectral bands considering fourteen classes (6 "thing "and 8 "stuff'' classes). We used WorldView-3 images from Praia do Futuro, Fortaleza, and pan-sharpening to improve spatial resolution. Five different spectral band configurations were considered: (1) all eight bands, (2) RGB + NIR1 + NIR2, (3) RGB + NIR1, (4) RGB + NIR2, and (5) only RGB. The model training used the Panoptic-FPN architecture with the same hyperparameter settings considering three backbones (ResNeXt-101, ResNet-101, and ResNet-50). The best result considered the ResNeXt-101 with all spectral bands. However, the results from the first four configurations were very similar, and the RGB alone was the only configuration with significantly lower results. We also evaluated 15 semantic segmentation models for a benchmark comparison for the Beach Dataset. We show in visual results that even though the semantic models may be precise, they fail at identifying unique targets, especially in crowded locations such as the beach. The panoptic segmentation allowed a necessary detailing and counting of tourist infrastructures and mapping of other background features, establishing an essential tool for inspecting beach areas.
引用
收藏
页数:10
相关论文
共 46 条
[1]   Instance segmentation of center pivot irrigation systems using multi-temporal SENTINEL-1 SAR images [J].
Albuquerque, Anesmar Olino de ;
Carvalho, Osmar Luiz Ferreira de ;
Silva, Cristiano Rosa e ;
Bem, Pablo Pozzobon de ;
Gomes, Roberto Arnaldo Trancoso ;
Borges, Dibio Leandro ;
Guimaraes, Renato Fontes ;
Pimentel, Concepta Margaret McManus ;
Junior, Osmar Abilio de Carvalho .
REMOTE SENSING APPLICATIONS-SOCIETY AND ENVIRONMENT, 2021, 23
[2]   Multispectral vineyard segmentation: A deep learning comparison study [J].
Barros, T. ;
Conde, P. ;
Goncalves, G. ;
Premebida, C. ;
Monteiro, M. ;
Ferreira, C. S. S. ;
Nunes, U. J. .
COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2022, 195
[3]   YOLACT plus plus Better Real-Time Instance Segmentation [J].
Bolya, Daniel ;
Zhou, Chong ;
Xiao, Fanyi ;
Lee, Yong Jae .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (02) :1108-1121
[4]   COCO-Stuff: Thing and Stuff Classes in Context [J].
Caesar, Holger ;
Uijlings, Jasper ;
Ferrari, Vittorio .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :1209-1218
[5]   Instance Segmentation for Large, Multi-Channel Remote Sensing Imagery Using Mask-RCNN and a Mosaicking Approach [J].
Carvalho, Osmar Luiz Ferreira de ;
de Carvalho Junior, Osmar Abilio ;
Albuquerque, Anesmar Olino de ;
Bem, Pablo Pozzobon de ;
Silva, Cristiano Rosa ;
Ferreira, Pedro Henrique Guimaraes ;
Moura, Rebeca dos Santos de ;
Gomes, Roberto Arnaldo Trancoso ;
Guimaraes, Renato Fontes ;
Borges, Dibio Leandro .
REMOTE SENSING, 2021, 13 (01) :1-24
[6]   Panoptic Segmentation on Panoramic Radiographs: Deep Learning-Based Segmentation of Various Structures Including Maxillary Sinus and Mandibular Canal [J].
Cha, Jun-Young ;
Yoon, Hyung-In ;
Yeo, In-Sung ;
Huh, Kyung-Hoe ;
Han, Jung-Suk .
JOURNAL OF CLINICAL MEDICINE, 2021, 10 (12)
[7]  
Chaurasia A, 2017, 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP)
[8]   Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation [J].
Chen, Liang-Chieh ;
Zhu, Yukun ;
Papandreou, George ;
Schroff, Florian ;
Adam, Hartwig .
COMPUTER VISION - ECCV 2018, PT VII, 2018, 11211 :833-851
[9]   Panoptic-DeepLab: A Simple, Strong, and Fast Baseline for Bottom-Up Panoptic Segmentation [J].
Cheng, Bowen ;
Collins, Maxwell D. ;
Zhu, Yukun ;
Liu, Ting ;
Huang, Thomas S. ;
Adam, Hartwig ;
Chen, Liang-Chieh .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :12472-12482
[10]   The Cityscapes Dataset for Semantic Urban Scene Understanding [J].
Cordts, Marius ;
Omran, Mohamed ;
Ramos, Sebastian ;
Rehfeld, Timo ;
Enzweiler, Markus ;
Benenson, Rodrigo ;
Franke, Uwe ;
Roth, Stefan ;
Schiele, Bernt .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :3213-3223