Correcting the estimated level of differential expression for gene selection bias: Application to a microarray study

被引:0
|
作者
Bickel, David R. [1 ]
机构
[1] Univ Ottawa, Ottawa Inst Syst Biol, Dept Biochem Microbiol & Immunol, Ottawa, ON K1N 6N5, Canada
关键词
conditional bias; conditionally biased estimation; feature selection bias; shrinkage; empirical Bayes; gene rank; data resampling; transcriptional microarray; differential gene expression; fold change estimation; multiple comparisons; cross validation;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The level of differential gene expression may be defined as a fold change, a frequency of upregulation, or some other measure of the degree or extent of a difference in expression across groups of interest. On the basis of expression data for hundreds or thousands of genes, inferring which genes are differentially expressed or ranking genes in order of priority introduces a bias in estimates of their differential expression levels. A previous correction of this feature selection bias suffers from a lack of generality in the method of ranking genes, from requiring many biological replicates, and from unnecessarily overcompensating for the bias. For any method of ranking genes on the basis of gene expression measured for as few as three biological replicates, a simple leave-one-out algorithm corrects, with less overcompensation, the bias in estimates of the level of differential gene expression. In a microarray data set, the bias correction reduces estimates of the probability of upregulation or downregulation from 100% to as low as 60%, even for genes with estimated local false discovery rates close to 0. A simulation study quantifies both the advantage of smoothing estimates of bias before correction and the degree of overcompensation.
引用
收藏
页数:26
相关论文
共 50 条
  • [31] Gene ontology driven feature selection from microarray gene expression data
    Qi, Jianlong
    Tang, Jian
    PROCEEDINGS OF THE 2006 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE IN BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2006, : 428 - +
  • [32] Differential gene expression of scopolamine-treated rat hippocampus-application of cDNA microarray technology
    Hsieh, MT
    Hsieh, CL
    Lin, LW
    Wu, CR
    Huang, GS
    LIFE SCIENCES, 2003, 73 (08) : 1007 - 1016
  • [33] A study of crossover operators for gene selection of microarray data
    Hernandez, Jose Crispin Hernandez
    Duval, Beatrice
    Hao, Jin-Kao
    ARTIFICIAL EVOLUTION, 2008, 4926 : 243 - 254
  • [34] Analysis of Differential Expression of microRNAs and Their Target Genes in Prostate Cancer: A Bioinformatics Study on Microarray Gene Expression Data
    Khorasani, Maryam
    Shahbazi, Shirin
    Hosseinkhan, Nazanin
    Mahdian, Reza
    INTERNATIONAL JOURNAL OF MOLECULAR AND CELLULAR MEDICINE, 2019, 8 (02)
  • [35] Subgroups in chronic lymphocytic leukaemia (CLL) - A microarray-based differential gene expression study
    Herold, T.
    Jurinovic, V
    Seiler, T.
    Mulaw, M.
    Mansmann, U.
    Hiddemann, W.
    Buske, C.
    Bohlander, S.
    ONKOLOGIE, 2010, 33 : 44 - 44
  • [36] Recall and bias of retrieving gene expression microarray datasets through PubMed identifiers
    Piwowar, Heather A.
    Chapman, Wendy W.
    JOURNAL OF BIOMEDICAL DISCOVERY AND COLLABORATION, 2010, 5 : 7 - 20
  • [37] Differential Expression Gene Selection Algorithms for Unbalanced Gene Datasets
    Xie J.-Y.
    Wang M.-Z.
    Zhou Y.
    Gao H.-C.
    Xu S.-Q.
    Jisuanji Xuebao/Chinese Journal of Computers, 2019, 42 (06): : 1232 - 1251
  • [38] A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis
    Lazar, Cosmin
    Taminau, Jonatan
    Meganck, Stijn
    Steenhoff, David
    Coletta, Alain
    Molter, Colin
    de Schaetzen, Virginie
    Duque, Robin
    Bersini, Hugues
    Nowe, Ann
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2012, 9 (04) : 1106 - 1119
  • [39] Fuzzy-granular gene selection from microarray expression data
    He, Yuanchen
    Tang, Yuchun
    Zhang, Yan-Qing
    Sunderraman, Rajshekhar
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 153 - 157
  • [40] Gene selection for tumor classification using microarray gone expression data
    Yendrapalli, K.
    Basnet, R.
    Mukkamala, S.
    Sung, A. H.
    WORLD CONGRESS ON ENGINEERING 2007, VOLS 1 AND 2, 2007, : 290 - +