Approximate clustering of noisy biomedical data

被引:0
作者
Boryczko, Krzysztof [1 ]
Kurdziel, Marcin [1 ]
机构
[1] AGH Univ Sci & Technol, Inst Comp Sci, PL-30059 Krakow, Poland
来源
COMPUTATIONAL SCIENCE - ICCS 2008, PT 1 | 2008年 / 5101卷
关键词
cluster analysis; noisy data; multidimensional data; Shared Nearest Neighbor Graph; Mutual Nearest Neighborhood;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Classical clustering algorithms often perform poorly on data harboring background noise, i.e. large number of observations distributed uniformly in the feature space. Here, we present a new density-based algorithm for approximate clustering of such noisy data. The algorithm employs Shared Nearest Neighbor Graphs for estimating local data density and identification of core points, which are assumed to indicate locations of clusters. Partitioning of core points into clusters is performed by means of Mutual Nearest Neighbor distance measure. This similarity measure is sensitive to changes in local data density, and is thus useful for discovering clusters that differ in this respect. Performance of the presented algorithm was demonstrated on three data sets, two synthetic and one real world. In all cases, meaningful clustering structures were discovered.
引用
收藏
页码:630 / 640
页数:11
相关论文
共 11 条
  • [1] Aggarwal CC, 2001, LECT NOTES COMPUT SC, V1973, P420
  • [2] Boryczko K, 2005, ADV SOFT COMP, P485
  • [3] ERTOZ L, 2003, P 3 SIAM INT C DAT M, V47
  • [4] Ester M., 1996, P 2 INT C KNOWL DISC, P226, DOI DOI 10.5555/3001460.3001507
  • [5] GOWDA KC, 1978, PATTERN RECOGN, V10, P105
  • [6] Guha S., 1998, SIGMOD Record, V27, P73, DOI 10.1145/276305.276312
  • [7] Rock: A robust clustering algorithm for categorical attributes
    Guha, S
    Rastogi, R
    Shim, K
    [J]. INFORMATION SYSTEMS, 2000, 25 (05) : 345 - 366
  • [8] Heath M, 1998, COMP IMAG VIS, V13, P457
  • [9] CLUSTERING USING A SIMILARITY MEASURE BASED ON SHARED NEAR NEIGHBORS
    JARVIS, RA
    PATRICK, EA
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1973, C-22 (11) : 1025 - 1034
  • [10] Chameleon: Hierarchical clustering using dynamic modeling
    Karypis, G
    Han, EH
    Kumar, V
    [J]. COMPUTER, 1999, 32 (08) : 68 - +