A mixture model approach for the analysis of small exploratory microarray experiments

被引:4
作者
Muir, W. M. [1 ]
Rosa, G. J. M. [2 ]
Pittendrigh, B. R. [3 ]
Xu, Z. [4 ]
Rider, S. D. [5 ]
Fountain, M. [5 ]
Ogas, J. [5 ]
机构
[1] Purdue Univ, Dept Anim Sci, W Lafayette, IN 47907 USA
[2] Univ Wisconsin, Dept Dairy Sci, Madison, WI 53706 USA
[3] Purdue Univ, Dept Entomol, W Lafayette, IN 47907 USA
[4] Univ Calif Riverside, Dept Bot & Plant Sci, Riverside, CA 92521 USA
[5] Purdue Univ, Dept Biochem, W Lafayette, IN 47907 USA
基金
美国国家卫生研究院;
关键词
DIFFERENTIAL GENE-EXPRESSION; FALSE DISCOVERY RATE; STATISTICAL-METHODS; UNIFIED APPROACH; EM ALGORITHM; CLASSIFICATION; IDENTIFICATION; TESTS;
D O I
10.1016/j.csda.2008.06.011
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The microarray is an important and powerful tool for prescreening of genes for further research. However, alternative solutions are needed to increase power in small microarray experiments. Use of traditional parametric and even non-parametric tests for such small experiments lack power and have distributional problems. A mixture model is described that is performed directly on expression differences assuming that genes in alternative treatments are expressed or not in all combinations(i) not expressed in either condition, (ii) expressed only under the first condition, (iii) expressed Only under the second condition, and (iv) expressed under both conditions, giving rise to 4 possible clusters with two treatments. The approach is termed a Mean-Difference-Mixture-Model (MD-MM) method. Accuracy and power of the MD-MM was compared to other commonly used methods, using both simulations, microarray data, and quantitative real time PCR (qRT-PCR). The MD-MM was found to be generally superior to other methods in most situations. The advantage was greatest in situations where there were few replicates, poor signal to noise ratios, or non-homogeneous variances. Published by Elsevier B.V.
引用
收藏
页码:1566 / 1576
页数:11
相关论文
共 35 条
[1]   MIXTURE-MODELS, OUTLIERS, AND THE EM ALGORITHM [J].
AITKIN, M ;
WILSON, GT .
TECHNOMETRICS, 1980, 22 (03) :325-331
[2]  
AKAIKE H, 1974, IEEE T AUTOMATIC CON, V19, P16
[3]   Class discovery and classification of tumor samples using mixture modeling of gene expression data - a unified approach [J].
Alexandridis, R ;
Lin, SL ;
Irwin, M .
BIOINFORMATICS, 2004, 20 (16) :2545-2552
[4]   A mixture model approach for the analysis of microarray gene expression data [J].
Allison, DB ;
Gadbury, GL ;
Heo, MS ;
Fernández, JR ;
Lee, CK ;
Prolla, TA ;
Weindruch, R .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2002, 39 (01) :1-20
[5]   Reliability analysis of microarray data using fuzzy c-means and normal mixture modeling based classification methods [J].
Asyali, MH ;
Alci, M .
BIOINFORMATICS, 2005, 21 (05) :644-649
[6]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[8]   A Bayesian mixture model for differential gene expression [J].
Do, KA ;
Müller, P ;
Tang, F .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2005, 54 :627-644
[10]   Empirical Bayes analysis of a microarray experiment [J].
Efron, B ;
Tibshirani, R ;
Storey, JD ;
Tusher, V .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1151-1160