Cluster validation techniques for genome expression data

被引:193
作者
Bolshakova, N [1 ]
Azuaje, F
机构
[1] Trinity Coll Dublin, Dept Comp Sci, Dublin, Ireland
[2] Univ Ulster, Sch Comp & Math, Jordanstown BT37 0QB, Antrim, North Ireland
关键词
genome expression; clustering; cluster validation; genomic data mining;
D O I
10.1016/S0165-1684(02)00475-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Several clustering algorithms have been suggested to analyse genome expression data, but fewer solutions have been implemented to guide the design of clustering-based experiments and assess the quality of their outcomes. A cluster validity framework provides insights into the problem of predicting the correct the number of clusters. This paper presents several validation techniques for gene expression data analysis. Normalisation and validity aggregation strategies are proposed to improve the prediction about the number of relevant clusters. The results obtained indicate that this systematic evaluation approach may significantly support genome expression analyses for knowledge discovery applications. (C) 2002 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:825 / 833
页数:9
相关论文
共 25 条
  • [1] Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling
    Alizadeh, AA
    Eisen, MB
    Davis, RE
    Ma, C
    Lossos, IS
    Rosenwald, A
    Boldrick, JG
    Sabet, H
    Tran, T
    Yu, X
    Powell, JI
    Yang, LM
    Marti, GE
    Moore, T
    Hudson, J
    Lu, LS
    Lewis, DB
    Tibshirani, R
    Sherlock, G
    Chan, WC
    Greiner, TC
    Weisenburger, DD
    Armitage, JO
    Warnke, R
    Levy, R
    Wilson, W
    Grever, MR
    Byrd, JC
    Botstein, D
    Brown, PO
    Staudt, LM
    [J]. NATURE, 2000, 403 (6769) : 503 - 511
  • [2] Selection bias in gene extraction on the basis of microarray gene-expression data
    Ambroise, C
    McLachlan, GJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) : 6562 - 6566
  • [3] [Anonymous], 2001, P 3 IAPR TC 15 WORKS
  • [4] A cluster validity framework for genome expression data
    Azuaje, F
    [J]. BIOINFORMATICS, 2002, 18 (02) : 319 - 320
  • [5] A computational neural approach to support the discovery of gene function and classes of cancer
    Azuaje, F
    [J]. IEEE TRANSACTIONS ON BIOMEDICAL ENGINEERING, 2001, 48 (03) : 332 - 339
  • [6] AZUAJE F, 2002, IN PRESS UNDERSTANDI
  • [7] Cancer - Gene expression in diagnosis
    Berns, A
    [J]. NATURE, 2000, 403 (6769) : 491 - 492
  • [8] Some new indexes of cluster validity
    Bezdek, JC
    Pal, NR
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 1998, 28 (03): : 301 - 315
  • [9] Molecular classification of cutaneous malignant melanoma by gene expression profiling
    Bittner, M
    Meitzer, P
    Chen, Y
    Jiang, Y
    Seftor, E
    Hendrix, M
    Radmacher, M
    Simon, R
    Yakhini, Z
    Ben-Dor, A
    Sampas, N
    Dougherty, E
    Wang, E
    Marincola, F
    Gooden, C
    Lueders, J
    Glatfelter, A
    Pollock, P
    Carpten, J
    Gillanders, E
    Leja, D
    Dietrich, K
    Beaudry, C
    Berens, M
    Alberts, D
    Sondak, V
    Hayward, N
    Trent, J
    [J]. NATURE, 2000, 406 (6795) : 536 - 540
  • [10] BOLSHAKOVA N, 2003, UNPUB 4 ANN IEEE EMB