Bayesian Joint Analysis of Gene Expression Data and Gene Functional Annotations

被引:1
作者
Wang X. [1 ]
Chen M. [2 ]
Khodursky A.B. [3 ]
Xiao G. [2 ]
机构
[1] Department of Statistical Science, Southern Methodist University, Dallas, TX
[2] Division of Biostatistics, Department of Clinical Sciences, The University of Texas Southwestern Medical Center at Dallas, Dallas, TX
[3] Department of Biochemistry, Molecular Biology and Biophysics, The University of Minnesota, St. Paul, MN
基金
美国国家航空航天局; 美国国家科学基金会;
关键词
Bayesian hierarchical models; Co-expression; Differentially expressed genes; Down-regulated; Functional categories; Functional groups; Gene expression; Gene set enrichment; Joint modeling; Pathway analysis; Up-regulated;
D O I
10.1007/s12561-012-9065-6
中图分类号
学科分类号
摘要
Identifying which genes and which gene sets are differentially expressed (DE) under two experimental conditions are both key questions in microarray analysis. Although closely related and seemingly similar, they cannot replace each other, due to their own importance and merits in scientific discoveries. Existing approaches have been developed to address only one of the two questions. Further, most of the methods for detecting DE genes purely rely on gene expression analysis, without using the information about gene functional grouping. Methods for detecting altered gene sets often use a two-step procedure, of which the first step conducts differential expression analysis using expression data only, and the second step takes results from the first step and tries to examine whether each predefined gene set is overrepresented by DE genes through some testing procedure. Such a sequential manner in analysis might cause information loss by just focusing on summary results without using the entire expression data in the second step. Here, we propose a Bayesian joint modeling approach to address the two key questions in parallel, which incorporates the information of functional annotations into expression data analysis and meanwhile infer the enrichment of functional groups. Simulation results and analysis of experimental data obtained for E. coli show improved statistical power of our integrated approach in both identifying DE genes and altered gene sets, when compared to conventional methods. © 2012 International Chinese Statistical Association.
引用
收藏
页码:300 / 318
页数:18
相关论文
共 38 条
[1]  
Baldi P., Long A.D., A Bayesian framework for the analysis of microarray expression data: regularized t-test and statistical inferences of gene changes, Bioinformatics, 17, 6, pp. 509-519, (2001)
[2]  
Barry W.T., Nobel A.B., Wright F.A., Significance analysis of functional categories in gene expression studies: a structured permutation approach, Bioinformatics, 21, 9, pp. 1943-1949, (2005)
[3]  
Broet P., Richardson S., Radvanyi F., Bayesian hierarchical model for identifying changes in gene expression from microarray experiments, J Comput Biol, 9, 4, pp. 671-683, (2002)
[4]  
Brooks S., Roberts G., Convergence assessment techniques for Markov chain Monte Carlo, Stat Comput, 8, pp. 319-335, (1998)
[5]  
Brown M.P., Grundy W.N., Lin D., Cristianini N., Sugnet C.W., Furey T.S., Ares M.J., Haussler D., Knowledge-based analysis of microarray gene expression data by using support vector machines, Proc Natl Acad Sci USA, 97, 1, pp. 262-267, (2000)
[6]  
Courcelle J., Khodursky A., Peter B., Brown P.O., Hanawalt P.C., Comparative gene expression profiles following UV exposure in wild-type and SOS-deficient Escherichia coli, Genetics, 158, pp. 41-64, (2001)
[7]  
Efron B., Tibshirani R., On testing the significance of sets of genes, Ann Appl Stat, 1, pp. 107-129, (2007)
[8]  
Efron B., Tishirani R., Storey J., Tusher V., Empirical Bayes analysis of a microarray experiment, J Am Stat Assoc, 96, pp. 1151-1160, (2001)
[9]  
Eisen M.B., Spellman P.T., Brown P.O., Botstein D., Cluster analysis and display of genome-wide expression patterns, Proc Natl Acad Sci USA, 95, 25, pp. 14863-14868, (1998)
[10]  
Gelman A., Rubin D.B., Inference from iterative simulation using multiple sequences, Stat Sci, 7, pp. 457-511, (1992)