Automatic water detection from multidimensional hierarchical clustering for Sentinel-2 images and a comparison with Level 2A processors

被引:79
作者
Cordeiro, Mauricio C. R. [1 ,2 ]
Martinez, Jean-Michel [2 ]
Pena-Luque, Santiago [3 ]
机构
[1] Agencia Nacl Aguas ANA, Setor Policial Sul, Area 5,Quadra 3, BR-70610200 Brasilia, DF, Brazil
[2] Univ Toulouse, CNRS, Geosci Environm Toulouse GET, IRD,Unite Mixte Rech 5563, F-31400 Toulouse, France
[3] Ctr Natl Etud Spati CNES, F-31401 Toulouse, France
关键词
Water detection; Water mask; Sentinel-2; Unsupervised clustering; Machine learning; naive bayes classifier; SURFACE-WATER; SATELLITE IMAGERY; CLOUD SHADOW; INDEX NDWI; EXTRACTION; OLI; MACHINE;
D O I
10.1016/j.rse.2020.112209
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Continuous monitoring of water surfaces is essential for water resource management. This study presents a nonparametric unsupervised automatic algorithm for the identification of inland water pixels from multispectral satellite data using multidimensional clustering and a high-performance subsampling approach for large scenes. Clustering analysis is a technique that is used to identify similar samples in a multidimensional data space. The spectral information and derived indices were used to characterize each scene pixel individually. A machine learning approach with random subsampling and generalization through a Na ve Bayes classifier was also proposed to make the application of complex algorithms to large scenes feasible. Accuracy was evaluated using an independent dataset that provides water bodies in 15 Sentinel-2 images over France acquired in different seasons and that covers a large range of water bodies and water colour types. The validation dataset covers a water surface of more than 1200 km(2) (approximately 12 million pixels) including over 80,000 water bodies outlined using a semiautomatic active learning method, which were manually revised. The classification results were compared to the water pixel classification using three of the major Level 2A processors (MAJA, Sen2Cor and FMask) and two of the most common thresholding techniques: Otsu and Canny-edge. An input mask was used to remove coastal waters, clouds, shadows and snow pixels. Water pixels were identified automatically from the clustering process without the need for ancillary or pretrained data. Combinations using up to three water indices (Modified Normalized Difference Water Index-MNDWI, Normalized Difference Water Index-NDWI and Multiband Water Index-MBWI) and two reflectance bands (B8 and B12) were tested in the algorithm, and the best combination was NDWI-B12. Of all the methods, our method achieved the highest mean kappa score, 0.874, across all tested scenes, with a per-scene kappa ranging from 0.608 to 0.980, and the lowest mean standard deviation of 0.091. Standard Otsu's thresholding had the worst performance due to the lack of a bimodal histogram, and the Canny-edge variation achieved an overall kappa of 0.718 when used with the MNDWI. For water masks provided by generic processors, FMask outperformed MAJA and Sen2Cor and obtained an overall kappa of 0.764. In-depth analysis shows a quick drop in performance for all of the methods in identifying water bodies with a surface area below 0.5 ha, but the proposed approach outperformed the second best method by 34% in this size class.
引用
收藏
页数:17
相关论文
共 65 条
[1]   Evaluation of Water Indices for Surface Water Extraction in a Landsat 8 Scene of Nepal [J].
Acharya, Tri Dev ;
Subedi, Anoj ;
Lee, Dong Ha .
SENSORS, 2018, 18 (08)
[2]   Identification of Water Bodies in a Landsat 8 OLI Image Using a J48 Decision Tree [J].
Acharya, Tri Dev ;
Lee, Dong Ha ;
Yang, In Tae ;
Lee, Jae Kang .
SENSORS, 2016, 16 (07)
[3]  
Al Bayati M., 2013, P INT C SOFT COMP, P77, DOI [10.5121/csit.2013.3308, DOI 10.5121/CSIT.2013.3308]
[4]   Retrieval of Chlorophyll a from Sentinel-2 MSI Data for the European Union Water Framework Directive Reporting Purposes [J].
Ansper, Ave ;
Alikas, Krista .
REMOTE SENSING, 2019, 11 (01)
[5]   Validation of Copernicus Sentinel-2 Cloud Masks Obtained from MAJA, Sen2Cor, and FMask Processors Using Reference Cloud Masks Generated with a Supervised Active Learning Procedure [J].
Baetens, Louis ;
Desjardins, Camille ;
Hagolle, Olivier .
REMOTE SENSING, 2019, 11 (04)
[6]   Comparing Thresholding with Machine Learning Classifiers for Mapping Complex Water [J].
Bangira, Tsitsi ;
Alfieri, Silvia Maria ;
Menenti, Massimo ;
van Niekerk, Adriaan .
REMOTE SENSING, 2019, 11 (11)
[7]  
Barbosa C.C.F., 2019, INTRO SENSORIAMENTO, V1st ed.
[8]   Retrospection and introspection on remote sensing of inland water quality: "Like Deja Vu All Over Again" [J].
Bukata, Robert P. .
JOURNAL OF GREAT LAKES RESEARCH, 2013, 39 :2-5
[9]   Recent Surface Water Extent of Lake Chad from Multispectral Sensors and GRACE [J].
Buma, Willibroad Gabila ;
Lee, Sang-Il ;
Seo, Jae Young .
SENSORS, 2018, 18 (07)
[10]  
Calinski Tadeusz, 1974, Commun Stat, V3, P1, DOI [DOI 10.1080/03610927408827101, 10.1080/03610927408827101]