Statistical tests for identification of differentially expressed genes in cDNA microarray experiments

被引:0
作者
Sreekumar, J. [1 ]
Jose, K. K. [2 ]
机构
[1] Cent Tuber Crops Res Inst, Thiruvananthapuram 695017, Kerala, India
[2] St Thomas Coll, Dept Stat, Pala 586574, Kottayam, India
来源
INDIAN JOURNAL OF BIOTECHNOLOGY | 2008年 / 7卷 / 04期
关键词
ANOVA; Bayesian inference; bioinformatics; differential gene expression; DNA microarrays; t-test;
D O I
暂无
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Microarrays experiments are becoming a common laboratory tool for monitoring expression level in cells for thousand of penes simultaneously. The new data promise to enhance fundamental understanding of life on a molecular level and may prove useful in medical diagnosis, treatment and drug design. The greatest challenge to array technology lies in the analysis of gene expression data to identify which genes are differentially expressed across tissue samples or experimental conditions. A simple fold change was used to test the differential expression of genes. Ordinary t-test and t-test approaches with minor variations are usually used in finding differentially expressed genes under two conditions. Analysis of variance (ANOVA) and mixed model ANOVA proved to be powerful under multiple conditions or several sources of variation. Since thousands of hypotheses are tested simultaneously there is increased chance of false positives and it becomes necessary to adjust for multiple testing when assessing statistical significance of findings. Bayesian variable selection and empirical Bayesian approaches offer yet another avenue.
引用
收藏
页码:423 / 436
页数:14
相关论文
共 66 条
  • [51] Storey J., 2001, ESTIMATING FALSE DIS
  • [52] A direct approach to false discovery rates
    Storey, JD
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2002, 64 : 479 - 498
  • [53] Storey JD, 2001, POSITIVE FALSE DISCO
  • [54] Storey John D, 2003, Methods Mol Biol, V224, P149
  • [55] Bayesian variable selection in clustering high-dimensional data
    Tadesse, MG
    Sha, N
    Vannucci, M
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2005, 100 (470) : 602 - 617
  • [56] Genome-wide expression profiling of mid-gestation placenta and embryo using a 15,000 mouse developmental cDNA microarray
    Tanaka, TS
    Jaradat, SA
    Lim, MK
    Kargul, GJ
    Wang, XH
    Grahovac, MJ
    Pantano, S
    Sano, Y
    Piao, Y
    Nagaraja, R
    Doi, H
    Wood, WH
    Becker, KG
    Ko, MSH
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (16) : 9127 - 9132
  • [57] Bayesian estimation of fold-changes in the analysis of gene expression: The PFOLD algorithm
    Theilhaber, J
    Bushnell, S
    Jackson, A
    Fuchs, R
    [J]. JOURNAL OF COMPUTATIONAL BIOLOGY, 2001, 8 (06) : 585 - 614
  • [58] An efficient and robust statistical modeling approach to discover differentially expressed genes using genomic expression profiles
    Thomas, JG
    Olson, JM
    Tapscott, SJ
    Zhao, LP
    [J]. GENOME RESEARCH, 2001, 11 (07) : 1227 - 1236
  • [59] Diagnosis of multiple cancer types by shrunken centroids of gene expression
    Tibshirani, R
    Hastie, T
    Narasimhan, B
    Chu, G
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (10) : 6567 - 6572
  • [60] Estimation of false discovery rates in multiple testing: Application to gene microarray data
    Tsai, CA
    Hsueh, HM
    Chen, JJ
    [J]. BIOMETRICS, 2003, 59 (04) : 1071 - 1081