Pathway analysis with next-generation sequencing data

被引:4
作者
Zhao, Jinying [1 ]
Zhu, Yun [1 ]
Boerwinkle, Eric [2 ]
Xiong, Momiao [2 ]
机构
[1] Tulane Univ, Dept Epidemiol, Sch Publ Hlth & Trop Med, New Orleans, LA 70118 USA
[2] Univ Texas Hlth Sci Ctr Houston, Ctr Human Genet, Div Biostat, POB 20186, Houston, TX 77225 USA
基金
美国国家卫生研究院;
关键词
SET ENRICHMENT ANALYSIS; THERAPEUTIC ANGIOGENESIS; CARDIOVASCULAR-DISEASE; RARE VARIANTS; GENE; ASSOCIATION; SNPS;
D O I
10.1038/ejhg.2014.121
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although pathway analysis methods have been developed and successfully applied to association studies of common variants, the statistical methods for pathway-based association analysis of rare variants have not been well developed. Many investigators observed highly inflated false-positive rates and low power in pathway-based tests of association of rare variants. The inflated false-positive rates and low true-positive rates of the current methods are mainly due to their lack of ability to account for gametic phase disequilibrium. To overcome these serious limitations, we develop a novel statistic that is based on the smoothed functional principal component analysis (SFPCA) for pathway association tests with next-generation sequencing data. The developed statistic has the ability to capture position-level variant information and account for gametic phase disequilibrium. By intensive simulations, we demonstrate that the SFPCA-based statistic for testing pathway association with either rare or common or both rare and common variants has the correct type 1 error rates. Also the power of the SFPCA-based statistic and 22 additional existing statistics are evaluated. We found that the SFPCA-based statistic has a much higher power than other existing statistics in all the scenarios considered. To further evaluate its performance, the SFPCA-based statistic is applied to pathway analysis of exome sequencing data in the early-onset myocardial infarction (EOMI) project. We identify three pathways significantly associated with EOMI after the Bonferroni correction. In addition, our preliminary results show that the SFPCA-based statistic has much smaller P-values to identify pathway association than other existing methods.
引用
收藏
页码:507 / 515
页数:9
相关论文
共 45 条
[1]   A general modular framework for gene set enrichment analysis [J].
Ackermann, Marit ;
Strimmer, Korbinian .
BMC BIOINFORMATICS, 2009, 10
[2]   GLOSSI: a method to assess the association of genetic loci-sets with complex diseases [J].
Chai, High-Seng ;
Sicotte, Hugues ;
Bailey, Kent R. ;
Turner, Stephen T. ;
Asmann, Yan W. ;
Kocher, Jean-Pierre A. .
BMC BIOINFORMATICS, 2009, 10
[3]   On the Utility of Gene Set Methods in Genomewide Association Studies of Quantitative Traits [J].
Chasman, Daniel I. .
GENETIC EPIDEMIOLOGY, 2008, 32 (07) :658-668
[4]   Prioritizing risk pathways: a novel association approach to searching for disease pathways fusing SNPs and pathways [J].
Chen, Lina ;
Zhang, Liangcai ;
Zhao, Yan ;
Xu, Liangde ;
Shang, Yukui ;
Wang, Qian ;
Li, Wan ;
Wang, Hong ;
Li, Xia .
BIOINFORMATICS, 2009, 25 (02) :237-242
[5]   Gene, Region and Pathway Level Analyses in Whole-Genome Studies [J].
De la Cruz, Omar ;
Wen, Xiaoquan ;
Ke, Baoguan ;
Song, Minsun ;
Nicolae, Dan L. .
GENETIC EPIDEMIOLOGY, 2010, 34 (03) :222-231
[6]   Retention of atherogenic lipoproteins in the artery wall and its role in atherogenesis [J].
Fogelstrand, P. ;
Boren, J. .
NUTRITION METABOLISM AND CARDIOVASCULAR DISEASES, 2012, 22 (01) :1-7
[7]   Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants [J].
Fu, Wenqing ;
O'Connor, Timothy D. ;
Jun, Goo ;
Kang, Hyun Min ;
Abecasis, Goncalo ;
Leal, Suzanne M. ;
Gabriel, Stacey ;
Altshuler, David ;
Shendure, Jay ;
Nickerson, Deborah A. ;
Bamshad, Michael J. ;
Akey, Joshua M. .
NATURE, 2013, 493 (7431) :216-220
[8]   VEGF gene therapy: therapeutic angiogenesis in the clinic and beyond [J].
Giacca, M. ;
Zacchigna, S. .
GENE THERAPY, 2012, 19 (06) :622-629
[9]   A new permutation strategy of pathway-based approach for genome-wide association study [J].
Guo, Yan-Fang ;
Li, Jian ;
Chen, Yuan ;
Zhang, Li-Shu ;
Deng, Hong-Wen .
BMC BIOINFORMATICS, 2009, 10
[10]  
Henderson D, 2006, STOCHASTIC DIFFERENTIAL EQUATIONS IN SCIENCE AND ENGINEERING, P1, DOI 10.1142/9789812774798