Stability and aggregation of ranked gene lists

被引:121
作者
Boulesteix, Anne-Laure [1 ]
Slawski, Martin [1 ]
机构
[1] Univ Munich, Fac Med, D-80539 Munich, Germany
关键词
Univariate analysis; differential expression; top-list; ranking; variability; bootstrap; DIFFERENTIALLY EXPRESSED GENES; MICROARRAY DATA; BIOCONDUCTOR PACKAGE; T-TEST; REPRODUCIBILITY; CLASSIFICATION; METAANALYSIS; FRAMEWORK; SELECTION; RANKING;
D O I
10.1093/bib/bbp034
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Ranked gene lists are highly instable in the sense that similar measures of differential gene expression may yield very different rankings, and that a small change of the data set usually affects the obtained gene list considerably. Stability issues have long been under-considered in the literature, but they have grown to a hot topic in the last few years, perhaps as a consequence of the increasing skepticism on the reproducibility and clinical applicability of molecular research findings. In this article, we review existing approaches for the assessment of stability of ranked gene lists and the related problem of aggregation, give some practical recommendations, and warn against potential misuse of these methods. This overview is illustrated through an application to a recent leukemia data set using the freely available Bioconductor package GeneSelector.
引用
收藏
页码:556 / 568
页数:13
相关论文
共 61 条
  • [1] Gene prioritization through genomic data fusion
    Aerts, S
    Lambrechts, D
    Maity, S
    Van Loo, P
    Coessens, B
    De Smet, F
    Tranchevent, LC
    De Moor, B
    Marynen, P
    Hassan, B
    Carmeliet, P
    Moreau, Y
    [J]. NATURE BIOTECHNOLOGY, 2006, 24 (05) : 537 - 544
  • [2] Microarray data analysis: from disarray to consolidation and consensus
    Allison, DB
    Cui, XQ
    Page, GP
    Sabripour, M
    [J]. NATURE REVIEWS GENETICS, 2006, 7 (01) : 55 - 65
  • [3] [Anonymous], 2008, STABILITY SELECTION
  • [4] Development of biomarker classifiers from high-dimensional data
    Baek, Songjoon
    Tsai, Chen-An
    Chen, James J.
    [J]. BRIEFINGS IN BIOINFORMATICS, 2009, 10 (05) : 537 - 546
  • [5] A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes
    Baldi, P
    Long, AD
    [J]. BIOINFORMATICS, 2001, 17 (06) : 509 - 519
  • [6] Machine learning methods for predictive proteomics
    Barla, Annalisa
    Jurman, Giuseppe
    Riccadonna, Samantha
    Merler, Stefano
    Chierici, Marco
    Furlanello, Cesare
    [J]. BRIEFINGS IN BIOINFORMATICS, 2008, 9 (02) : 119 - 128
  • [7] The bootstrap in hypothesis testing
    Bickel, PJ
    Ren, JJ
    [J]. STATE OF THE ART IN PROBABILITY AND STATISTICS: FESTSCHRIFT FOR WILLEM R VAN ZWET, 2001, 36 : 91 - 112
  • [8] Boulesteix AL, 2008, CANCER INFORM, V6, P77
  • [9] BOULESTEIX AL, 2009, 58 U MUN DEP STAT
  • [10] Iterative Group Analysis (iGA): A simple tool to enhance sensitivity and facilitate interpretation of microarray experiments
    Breitling, R
    Amtmann, A
    Herzyk, P
    [J]. BMC BIOINFORMATICS, 2004, 5 (1)