Clustering via nonparametric density estimation

被引:88
作者
Azzalini, Adelchi [1 ]
Torelli, Nicola
机构
[1] Univ Padua, Dipartimento Sci Stat, I-35100 Padua, Italy
[2] Univ Trieste, Dipartimento Sci Econ & Stat, I-34127 Trieste, Italy
关键词
cluster analysis; Delaunay triangulation; Voronoi tessellation; nonparametric density estimation; kernel method;
D O I
10.1007/s11222-006-9010-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Although Hartigan (1975) had already put forward the idea of connecting identification of subpopulations with regions with high density of the underlying probability distribution, the actual development of methods for cluster analysis has largely shifted towards other directions, for computational convenience. Current computational resources allow us to reconsider this formulation and to develop clustering techniques directly in order to identify local modes of the density. Given a set of observations, a nonparametric estimate of the underlying density function is constructed, and subsets of points with high density are formed through suitable manipulation of the associated Delaunay triangulation. The method is illustrated with some numerical examples.
引用
收藏
页码:71 / 80
页数:10
相关论文
共 18 条
  • [1] AITCHISON J, 1986, STAT ANAL COMPOSITIO
  • [2] ANKERST M, 1999, INT C MAN DAT SIGMOD, P49
  • [3] [Anonymous], 1983, Food Research and Data Analysis
  • [4] [Anonymous], 1975, CLUSTERING ALGORITHM
  • [5] [Anonymous], MULTIVARIATE ANAL
  • [6] The Quickhull algorithm for convex hulls
    Barber, CB
    Dobkin, DP
    Huhdanpaa, H
    [J]. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1996, 22 (04): : 469 - 483
  • [7] DENSITY BASED EXPLORATION OF BIVARIATE DATA
    BOWMAN, A
    FOSTER, P
    [J]. STATISTICS AND COMPUTING, 1993, 3 (04) : 171 - 177
  • [8] Bowman AW, 1997, Applied Smoothing Techniques for Data Analysis: the Kernel Approach with S-Plus Illustrations
  • [9] Estimating the number of clusters
    Cuevas, A
    Febrero, M
    Fraiman, R
    [J]. CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2000, 28 (02): : 367 - 382
  • [10] Cluster analysis: a further approach based on density estimation
    Cuevas, A
    Febrero, M
    Fraiman, R
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2001, 36 (04) : 441 - 459