The local maximum clustering method and its application in microarray gene expression data analysis

被引:18
作者
Wu, XW [1 ]
Chen, YD
Brooks, BR
Su, YA
机构
[1] Natl Heart Lung & Blood Inst, Lab Biophys Chem, NIH, Bethesda, MD 20892 USA
[2] Natl Human Genome Res Inst, NIH, Bethesda, MD 20892 USA
[3] Loyola Univ, Ctr Med, Dept Pathol, Maywood, IL 60153 USA
关键词
data cluster; clustering method; microarray; gene expression; classification; model data sets;
D O I
10.1155/S1110865704309145
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
An unsupervised data clustering method, called the local maximum clustering (LMC) method, is proposed for identifying clusters in experiment data sets based on research interest. A magnitude property is defined according to research purposes, and data sets are clustered around each local maximum of the magnitude property. By properly defining a magnitude property, this method can overcome many difficulties in microarray data clustering such as reduced projection in similarities, noises, and arbitrary gene distribution. To critically evaluate the performance of this clustering method in comparison with other methods, we designed three model data sets with known cluster distributions and applied the LMC method as well as the hierarchic clustering method, the K-mean clustering method, and the self-organized map method to these model data sets. The results show that the LMC method produces the most accurate clustering results. As an example of application, we applied the method to cluster the leukemia samples reported in the microarray study of Golub et al. (1999).
引用
收藏
页码:53 / 63
页数:11
相关论文
共 13 条
  • [1] Gene expression data analysis
    Brazma, A
    Vilo, J
    [J]. FEBS LETTERS, 2000, 480 (01) : 17 - 24
  • [2] Knowledge-based analysis of microarray gene expression data by using support vector machines
    Brown, MPS
    Grundy, WN
    Lin, D
    Cristianini, N
    Sugnet, CW
    Furey, TS
    Ares, M
    Haussler, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (01) : 262 - 267
  • [3] New developments in the analysis of gene expression
    Burgess, JK
    Hazelton, RH
    [J]. REDOX REPORT, 2000, 5 (2-3) : 63 - 73
  • [4] Carulli JP, 1998, J CELL BIOCHEM, P286
  • [5] Computational methods for the identification of differential and coordinated gene expression
    Claverie, JM
    [J]. HUMAN MOLECULAR GENETICS, 1999, 8 (10) : 1821 - 1832
  • [6] Cluster analysis and display of genome-wide expression patterns
    Eisen, MB
    Spellman, PT
    Brown, PO
    Botstein, D
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) : 14863 - 14868
  • [7] Data management and analysis for gene expression arrays
    Ermolaeva, O
    Rastogi, M
    Pruitt, KD
    Schuler, GD
    Bittner, ML
    Chen, YD
    Simon, R
    Meltzer, P
    Trent, JM
    Boguski, MS
    [J]. NATURE GENETICS, 1998, 20 (01) : 19 - 23
  • [8] Coupled two-way clustering analysis of gene microarray data
    Getz, G
    Levine, E
    Domany, E
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) : 12079 - 12084
  • [9] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [10] MEILA M, 2002, 418 UW DEP STAT