Automatic water detection from multidimensional hierarchical clustering for Sentinel-2 images and a comparison with Level 2A processors

被引:69
作者
Cordeiro, Mauricio C. R. [1 ,2 ]
Martinez, Jean-Michel [2 ]
Pena-Luque, Santiago [3 ]
机构
[1] Agencia Nacl Aguas ANA, Setor Policial Sul, Area 5,Quadra 3, BR-70610200 Brasilia, DF, Brazil
[2] Univ Toulouse, CNRS, Geosci Environm Toulouse GET, IRD,Unite Mixte Rech 5563, F-31400 Toulouse, France
[3] Ctr Natl Etud Spati CNES, F-31401 Toulouse, France
关键词
Water detection; Water mask; Sentinel-2; Unsupervised clustering; Machine learning; naive bayes classifier; SURFACE-WATER; SATELLITE IMAGERY; CLOUD SHADOW; INDEX NDWI; EXTRACTION; OLI; MACHINE;
D O I
10.1016/j.rse.2020.112209
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Continuous monitoring of water surfaces is essential for water resource management. This study presents a nonparametric unsupervised automatic algorithm for the identification of inland water pixels from multispectral satellite data using multidimensional clustering and a high-performance subsampling approach for large scenes. Clustering analysis is a technique that is used to identify similar samples in a multidimensional data space. The spectral information and derived indices were used to characterize each scene pixel individually. A machine learning approach with random subsampling and generalization through a Na ve Bayes classifier was also proposed to make the application of complex algorithms to large scenes feasible. Accuracy was evaluated using an independent dataset that provides water bodies in 15 Sentinel-2 images over France acquired in different seasons and that covers a large range of water bodies and water colour types. The validation dataset covers a water surface of more than 1200 km(2) (approximately 12 million pixels) including over 80,000 water bodies outlined using a semiautomatic active learning method, which were manually revised. The classification results were compared to the water pixel classification using three of the major Level 2A processors (MAJA, Sen2Cor and FMask) and two of the most common thresholding techniques: Otsu and Canny-edge. An input mask was used to remove coastal waters, clouds, shadows and snow pixels. Water pixels were identified automatically from the clustering process without the need for ancillary or pretrained data. Combinations using up to three water indices (Modified Normalized Difference Water Index-MNDWI, Normalized Difference Water Index-NDWI and Multiband Water Index-MBWI) and two reflectance bands (B8 and B12) were tested in the algorithm, and the best combination was NDWI-B12. Of all the methods, our method achieved the highest mean kappa score, 0.874, across all tested scenes, with a per-scene kappa ranging from 0.608 to 0.980, and the lowest mean standard deviation of 0.091. Standard Otsu's thresholding had the worst performance due to the lack of a bimodal histogram, and the Canny-edge variation achieved an overall kappa of 0.718 when used with the MNDWI. For water masks provided by generic processors, FMask outperformed MAJA and Sen2Cor and obtained an overall kappa of 0.764. In-depth analysis shows a quick drop in performance for all of the methods in identifying water bodies with a surface area below 0.5 ha, but the proposed approach outperformed the second best method by 34% in this size class.
引用
收藏
页数:17
相关论文
共 65 条
  • [1] Evaluation of Water Indices for Surface Water Extraction in a Landsat 8 Scene of Nepal
    Acharya, Tri Dev
    Subedi, Anoj
    Lee, Dong Ha
    [J]. SENSORS, 2018, 18 (08)
  • [2] Identification of Water Bodies in a Landsat 8 OLI Image Using a J48 Decision Tree
    Acharya, Tri Dev
    Lee, Dong Ha
    Yang, In Tae
    Lee, Jae Kang
    [J]. SENSORS, 2016, 16 (07)
  • [3] Retrieval of Chlorophyll a from Sentinel-2 MSI Data for the European Union Water Framework Directive Reporting Purposes
    Ansper, Ave
    Alikas, Krista
    [J]. REMOTE SENSING, 2019, 11 (01)
  • [4] Validation of Copernicus Sentinel-2 Cloud Masks Obtained from MAJA, Sen2Cor, and FMask Processors Using Reference Cloud Masks Generated with a Supervised Active Learning Procedure
    Baetens, Louis
    Desjardins, Camille
    Hagolle, Olivier
    [J]. REMOTE SENSING, 2019, 11 (04)
  • [5] Comparing Thresholding with Machine Learning Classifiers for Mapping Complex Water
    Bangira, Tsitsi
    Alfieri, Silvia Maria
    Menenti, Massimo
    van Niekerk, Adriaan
    [J]. REMOTE SENSING, 2019, 11 (11)
  • [6] Barbosa C.C.F., 2019, Introducao ao Sensoriamento Remoto de Sistemas Aquaticos: Principios e Aplicacoes, VFirst
  • [7] Retrospection and introspection on remote sensing of inland water quality: "Like Deja Vu All Over Again"
    Bukata, Robert P.
    [J]. JOURNAL OF GREAT LAKES RESEARCH, 2013, 39 : 2 - 5
  • [8] Recent Surface Water Extent of Lake Chad from Multispectral Sensors and GRACE
    Buma, Willibroad Gabila
    Lee, Sang-Il
    Seo, Jae Young
    [J]. SENSORS, 2018, 18 (07)
  • [9] Caliski T., 1974, COMMUN STAT, V3, P1, DOI [10.1080/03610927408827101, DOI 10.1080/03610927408827101]
  • [10] Indirect Assessment of Sedimentation in Hydropower Dams Using MODIS Remote Sensing Images
    Conde, Rita de Cassia
    Martinez, Jean-Michel
    Pessotto, Marco Aurelio
    Villar, Raul
    Cochonneau, Gerard
    Henry, Raul
    Lopes, Walszon
    Nogueira, Marcos
    [J]. REMOTE SENSING, 2019, 11 (03)