Between-group analysis of microarray data

被引:176
作者
Culhane, AC [1 ]
Perrière, G
Considine, EC
Cotter, TG
Higgins, DG
机构
[1] Univ Coll Cork, Dept Biochem, Cork, Ireland
[2] Univ Lyon 1, CNRS, UMR 5558, Lab Biometrie & Biol Evolut, F-69622 Villeurbanne, France
关键词
D O I
10.1093/bioinformatics/18.12.1600
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Most supervised classification methods are limited by the requirement for more cases than variables. In microarray data the number of variables (genes) far exceeds the number of cases (arrays), and thus filtering and pre-selection of genes is required. We describe the application of Between Group Analysis (BGA) to the analysis of microarray data. A feature of BGA is that it can be used when the number of variables (genes) exceeds the number of cases (arrays). BGA is based on carrying out an ordination of groups of samples, using a standard method such as Correspondence Analysis (COA), rather than an ordination of the individual microarray samples. As such, it can be viewed as a method of carrying out COA with grouped data. Results: We illustrate the power of the method using two cancer data sets. In both cases, we can quickly and accurately classify test samples from any number of specified a priori groups and identify the genes which characterize these groups. We obtained very high rates of correct classification, as determined by jack-knife or validation experiments with training and test sets. The results are comparable to those from other methods in terms of accuracy but the power and flexibility of BGA make it an especially attractive method for the analysis of microarray cancer data.
引用
收藏
页码:1600 / 1608
页数:9
相关论文
共 13 条
  • [1] High expression of the chemokine receptor CXCR4 predicts extramedullary organ infiltration in childhood acute lymphoblastic leukaemia
    Crazzolara, R
    Kreczy, A
    Mann, G
    Heitger, A
    Eibl, G
    Fink, FM
    Möhle, R
    Meister, B
    [J]. BRITISH JOURNAL OF HAEMATOLOGY, 2001, 115 (03) : 545 - 553
  • [2] DOLEDEC S, 1987, ACTA OECOL-OEC GEN, V8, P403
  • [3] Comparison of discrimination methods for the classification of tumors using gene expression data
    Dudoit, S
    Fridlyand, J
    Speed, TP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2002, 97 (457) : 77 - 87
  • [4] Correspondence analysis applied to microarray data
    Fellenberg, K
    Hauser, NC
    Brors, B
    Neutzner, A
    Hoheisel, JD
    Vingron, M
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (19) : 10781 - 10786
  • [5] Support vector machine classification and validation of cancer tissue samples using microarray expression data
    Furey, TS
    Cristianini, N
    Duffy, N
    Bednarski, DW
    Schummer, M
    Haussler, D
    [J]. BIOINFORMATICS, 2000, 16 (10) : 906 - 914
  • [6] Molecular classification of cancer: Class discovery and class prediction by gene expression monitoring
    Golub, TR
    Slonim, DK
    Tamayo, P
    Huard, C
    Gaasenbeek, M
    Mesirov, JP
    Coller, H
    Loh, ML
    Downing, JR
    Caligiuri, MA
    Bloomfield, CD
    Lander, ES
    [J]. SCIENCE, 1999, 286 (5439) : 531 - 537
  • [7] GRANT GR, 2001, METHODS MICROARRAY D, P37
  • [8] Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks
    Khan, J
    Wei, JS
    Ringnér, M
    Saal, LH
    Ladanyi, M
    Westermann, F
    Berthold, F
    Schwab, M
    Antonescu, CR
    Peterson, C
    Meltzer, PS
    [J]. NATURE MEDICINE, 2001, 7 (06) : 673 - 679
  • [9] Li W, 2001, METHODS MICROARRAY D, P137
  • [10] The role of TCL1 in human T-cell leukemia
    Pekarsky, Y
    Hallas, C
    Croce, CM
    [J]. ONCOGENE, 2001, 20 (40) : 5638 - 5643