Meta-analysis for pathway enrichment analysis when combining multiple genomic studies

被引:68
作者
Shen, Kui [1 ]
Tseng, George C. [1 ,2 ,3 ]
机构
[1] Univ Pittsburgh, Sch Med, Dept Computat Biol, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Biostat, Pittsburgh, PA 15261 USA
[3] Univ Pittsburgh, Grad Sch Publ Hlth, Dept Human Genet, Pittsburgh, PA 15261 USA
基金
美国国家卫生研究院;
关键词
FALSE DISCOVERY RATE; GENE-EXPRESSION DATA; BREAST-CANCER; SIGNATURE; SETS;
D O I
10.1093/bioinformatics/btq148
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Many pathway analysis (or gene set enrichment analysis) methods have been developed to identify enriched pathways under different biological states within a genomic study. As more and more microarray datasets accumulate, meta-analysis methods have also been developed to integrate information among multiple studies. Currently, most meta-analysis methods for combining genomic studies focus on biomarker detection and meta-analysis for pathway analysis has not been systematically pursued. Results: We investigated two approaches of meta-analysis for pathway enrichment (MAPE) by combining statistical significance across studies at the gene level (MAPE_G) or at the pathway level (MAPE_P). Simulation results showed increased statistical power of meta-analysis approaches compared to a single study analysis and showed complementary advantages of MAPE_G and MAPE_P under different scenarios. We also developed an integrated method (MAPE_I) that incorporates advantages of both approaches. Comprehensive simulations and applications to real data on drug response of breast cancer cell lines and lung cancer tissues were evaluated to compare the performance of three MAPE variations. MAPE_P has the advantage of not requiring gene matching across studies. When MAPE_G and MAPE_P show complementary advantages, the hybrid version of MAPE_I is generally recommended. Availability: http://www.biostat.pitt.edu/bioinfo/ Contact: ctseng@pitt.edu Supplementary information: Supplementary data are available at Bioinformatics online.
引用
收藏
页码:1316 / 1323
页数:8
相关论文
共 30 条
[1]  
[Anonymous], [No title captured]
[2]   A STATISTICAL FRAMEWORK FOR TESTING FUNCTIONAL CATEGORIES IN MICROARRAY DATA [J].
Barry, William T. ;
Nobel, Andrew B. ;
Wright, Fred A. .
ANNALS OF APPLIED STATISTICS, 2008, 2 (01) :286-315
[3]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[4]   COMBINING INDEPENDENT TESTS OF SIGNIFICANCE [J].
BIRNBAUM, A .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1954, 49 (267) :559-574
[5]   Genome-wide expression profiling of human blood reveals biomarkers for Huntington's disease [J].
Borovecki, F ;
Lovrecic, L ;
Zhou, J ;
Jeong, H ;
Then, F ;
Rosas, HD ;
Hersch, SM ;
Hogarth, P ;
Bouzou, B ;
Jensen, RV ;
Krainc, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2005, 102 (31) :11023-11028
[6]   Expression and genomic profiling of colorectal cancer [J].
Cardoso, J. ;
Boer, J. ;
Morreau, H. ;
Fodde, R. .
BIOCHIMICA ET BIOPHYSICA ACTA-REVIEWS ON CANCER, 2007, 1775 (01) :103-137
[7]   A latent variable approach for meta-analysis of gene expression data from multiple microarray experiments [J].
Choi, Hyungwon ;
Shen, Ronglai ;
Chinnaiyan, Arul M. ;
Ghosh, Debashis .
BMC BIOINFORMATICS, 2007, 8 (1)
[8]   Combining multiple microarray studies and modeling interstudy variation [J].
Choi, Jung Kyoon ;
Yu, Ungsik ;
Kim, Sangsoo ;
Yoo, Ook Joon .
BIOINFORMATICS, 2003, 19 :i84-i90
[9]   ON TESTING THE SIGNIFICANCE OF SETS OF GENES [J].
Efron, Bradley ;
Tibshirani, Robert .
ANNALS OF APPLIED STATISTICS, 2007, 1 (01) :107-129
[10]   Outcome signature genes in breast cancer: is there a unique set? [J].
Ein-Dor, L ;
Kela, I ;
Getz, G ;
Givol, D ;
Domany, E .
BIOINFORMATICS, 2005, 21 (02) :171-178