Correcting the estimated level of differential expression for gene selection bias: Application to a microarray study

被引:0
|
作者
Bickel, David R. [1 ]
机构
[1] Univ Ottawa, Ottawa Inst Syst Biol, Dept Biochem Microbiol & Immunol, Ottawa, ON K1N 6N5, Canada
关键词
conditional bias; conditionally biased estimation; feature selection bias; shrinkage; empirical Bayes; gene rank; data resampling; transcriptional microarray; differential gene expression; fold change estimation; multiple comparisons; cross validation;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The level of differential gene expression may be defined as a fold change, a frequency of upregulation, or some other measure of the degree or extent of a difference in expression across groups of interest. On the basis of expression data for hundreds or thousands of genes, inferring which genes are differentially expressed or ranking genes in order of priority introduces a bias in estimates of their differential expression levels. A previous correction of this feature selection bias suffers from a lack of generality in the method of ranking genes, from requiring many biological replicates, and from unnecessarily overcompensating for the bias. For any method of ranking genes on the basis of gene expression measured for as few as three biological replicates, a simple leave-one-out algorithm corrects, with less overcompensation, the bias in estimates of the level of differential gene expression. In a microarray data set, the bias correction reduces estimates of the probability of upregulation or downregulation from 100% to as low as 60%, even for genes with estimated local false discovery rates close to 0. A simulation study quantifies both the advantage of smoothing estimates of bias before correction and the degree of overcompensation.
引用
收藏
页数:26
相关论文
共 50 条
  • [11] The Impact of Gene Selection on Imbalanced Microarray Expression Data
    Kamal, Abu H. M.
    Zhu, Xingquan
    Pandya, Abhijit S.
    Hsu, Sam
    Shoaib, Muhammad
    BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, PROCEEDINGS, 2009, 5462 : 259 - 269
  • [12] MIDGET:Detecting differential gene expression on microarray data
    Angelescu, Radu
    Dobrescu, Radu
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 211
  • [13] Differential analysis of DNA microarray gene expression data
    Hatfield, GW
    Hung, SP
    Baldi, P
    MOLECULAR MICROBIOLOGY, 2003, 47 (04) : 871 - 877
  • [14] Exploiting the Accumulated Evidence for Gene Selection in Microarray Gene Expression Data
    Prat-Masramon, Gabriel
    Belanche-Munoz, Lluis A.
    ECAI 2010 - 19TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2010, 215 : 989 - +
  • [15] Application of a priori established gene sets to discover biologically important differential expression in microarray data
    Bild, A
    Febbo, PG
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (43) : 15278 - 15279
  • [16] Gene Selection for Microarray Expression Data with Imbalanced Sample Distributions
    Kamal, Abu H. M.
    Zhu, Xingquan
    Narayanan, Ramaswamy
    2009 INTERNATIONAL JOINT CONFERENCE ON BIOINFORMATICS, SYSTEMS BIOLOGY AND INTELLIGENT COMPUTING, PROCEEDINGS, 2009, : 3 - +
  • [17] A hybrid feature selection approach for microarray gene expression data
    Tan, Feng
    Fu, Xuezheng
    Wang, Hao
    Zhang, Yanqing
    Bourgeois, Anu
    COMPUTATIONAL SCIENCE - ICCS 2006, PT 2, PROCEEDINGS, 2006, 3992 : 678 - 685
  • [18] Unsupervised selection of informative genes in microarray gene expression data
    Liaghat, Samaneh
    Mansoori, Eghbal G.
    INTERNATIONAL JOURNAL OF APPLIED PATTERN RECOGNITION, 2016, 3 (04) : 351 - 367
  • [19] Parsimonious Selection of Useful Genes in Microarray Gene Expression Data
    Gonzalez-Navarro, Felix F.
    Belanche-Munoz, Lluis A.
    SOFTWARE TOOLS AND ALGORITHMS FOR BIOLOGICAL SYSTEMS, 2011, 696 : 45 - 55
  • [20] Quality of feature selection based on microarray gene expression data
    Maciejewski, Henryk
    COMPUTATIONAL SCIENCE - ICCS 2008, PT 3, 2008, 5103 : 140 - 147