Adaptive double self-organizing maps for clustering gene expression profiles

被引:30
作者
Ressom, H [1 ]
Wang, D [1 ]
Natarajan, P [1 ]
机构
[1] Univ Maine, Dept Elect & Comp Engn, Orono, ME 04469 USA
关键词
self-organizing maps; clustering; cluster validation; tree-based index;
D O I
10.1016/S0893-6080(03)00102-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces a new model of self-organizing map (SOM) known as adaptive double self-organizing map (ADSOM). ADSOM has a flexible topology and performs data partitioning and cluster visualization simultaneously without requiring a priori knowledge about the number of clusters. It combines features of the popular SOM with two-dimensional position vectors, which serve as a visualization tool to detect the number of clusters present in the data. ADSOM updates its free parameters and allows convergence of its position vectors to a fairly consistent number of clusters provided its initial number of nodes is greater than the expected number of clusters. A novel index is introduced based on hierarchical clustering of the final locations of position vectors. The index allows automated detection of the number of clusters, thereby reducing human error that could be incurred from counting clusters visually. To test ADSOM's consistency in data partitioning, we examine the number of common profiles found in the clusters that were obtained by varying the initial number of nodes. This provides a confidence measure for the clusters formed by ADSOM and illustrates the effect of different initial number of nodes on data partitioning. The reliance of ADSOM in identifying number of clusters is demonstrated by applying it to publicly available yeast gene expression data. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:633 / 640
页数:8
相关论文
共 26 条
  • [1] [Anonymous], 208 STANF U DEP STAT
  • [2] Clustering gene expression patterns
    Ben-Dor, A
    Shamir, R
    Yakhini, Z
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) : 281 - 297
  • [3] A genome-wide transcriptional analysis of the mitotic cell cycle
    Cho, RJ
    Campbell, MJ
    Winzeler, EA
    Steinmetz, L
    Conway, A
    Wodicka, L
    Wolfsberg, TG
    Gabrielian, AE
    Landsman, D
    Lockhart, DJ
    Davis, RW
    [J]. MOLECULAR CELL, 1998, 2 (01) : 65 - 73
  • [4] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [5] Model-based clustering, discriminant analysis, and density estimation
    Fraley, C
    Raftery, AE
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (458) : 611 - 631
  • [6] GRANZOW M, 2001, TUMOR CLASSIFICATION, P16
  • [7] GUTHKE R, 2000, P EUR S INT TECHN ES, P170
  • [8] Combining hierarchical clustering and self-organizing maps for exploratory analysis of gene expression patterns
    Herrero, J
    Dopazo, J
    [J]. JOURNAL OF PROTEOME RESEARCH, 2002, 1 (05) : 467 - 470
  • [9] COMPARING PARTITIONS
    HUBERT, L
    ARABIE, P
    [J]. JOURNAL OF CLASSIFICATION, 1985, 2 (2-3) : 193 - 218
  • [10] Jain A.K., 1998, ALGORITHMS CLUSTERIN