A robust hierarchical clustering for georeferenced data

被引:17
作者
D'Urso, Pierpaolo [1 ]
Vitale, Vincenzina [1 ]
机构
[1] Sapienza Univ Roma, Dipartimento Sci Sociali & Econ, Rome, Italy
关键词
Agglomerative hierarchical clustering; Geostatistics; Kernel function; Robust dissimilarity measure; Multivariate spatial data; Top soil heavy metal concentrations; SPATIAL DATA; OUTLIERS;
D O I
10.1016/j.spasta.2020.100407
中图分类号
P [天文学、地球科学];
学科分类号
07 ;
摘要
The detection of spatially contiguous clusters is a relevant task in geostatistics since near located observations might have similar features than distant ones. Spatially compact groups can also improve clustering results interpretation according to the different detected subregions. In this paper, we propose a robust metric approach to neutralize the effect of possible outliers, i.e. an exponential transformation of a dissimilarity measure between each pair of locations based on non-parametric kernel estimator of the direct and cross variograms (Fouedjio, 2016) and on a different bandwidth identification, suitable for agglomerative hierarchical clustering techniques applied to data indexed by geographical coordinates. Simulation results are very promising showing very good performances of our proposed metric with respect to the baseline ones. Finally, the new clustering approach is applied to two real-word data sets, both giving locations and top soil heavy metal concentrations. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:33
相关论文
共 39 条
[1]   A review of robust clustering methods [J].
Angel Garcia-Escudero, Luis ;
Gordaliza, Alfonso ;
Matran, Carlos ;
Mayo-Iscar, Agustin .
ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2010, 4 (2-3) :89-109
[2]  
[Anonymous], 1994, Outliers in statistical data
[3]  
[Anonymous], 1994, Kernel Smoothing
[4]   LOCAL INDICATORS OF SPATIAL ASSOCIATION - LISA [J].
ANSELIN, L .
GEOGRAPHICAL ANALYSIS, 1995, 27 (02) :93-115
[5]  
Barnett V., 1978, Journal of the Royal Statistical Society Series C (Applied Statistics), V27, P242
[6]   The ordering of spatial data and the detection of multiple outliers [J].
Cerioli, A ;
Riani, M .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1999, 8 (02) :239-258
[7]   On detecting spatial outliers [J].
Chen, Dechang ;
Lu, Chang-Tien ;
Kou, Yufeng ;
Chen, Feng .
GEOINFORMATICA, 2008, 12 (04) :455-475
[8]   A Fuzzy Clustering Model for Multivariate Spatial Time Series [J].
Coppi, Renato ;
D'Urso, Pierpaolo ;
Giordani, Paolo .
JOURNAL OF CLASSIFICATION, 2010, 27 (01) :54-88
[9]   Fuzzy clustering with spatial-temporal information [J].
D'Urso, Pierpaolo ;
De Giovanni, Livia ;
Disegna, Marta ;
Massari, Riccardo .
SPATIAL STATISTICS, 2019, 30 :71-102
[10]   Visualizing regional clusters of Sardinia's EU supported agriculture: A Spatial Fuzzy Partitioning Around Medoids [J].
D'Urso, Pierpaolo ;
Manca, Germana ;
Waters, Nigel ;
Girone, Stefania .
LAND USE POLICY, 2019, 83 :571-580