Pathway analysis with next-generation sequencing data

被引:4
|
作者
Zhao, Jinying [1 ]
Zhu, Yun [1 ]
Boerwinkle, Eric [2 ]
Xiong, Momiao [2 ]
机构
[1] Tulane Univ, Dept Epidemiol, Sch Publ Hlth & Trop Med, New Orleans, LA 70118 USA
[2] Univ Texas Hlth Sci Ctr Houston, Ctr Human Genet, Div Biostat, POB 20186, Houston, TX 77225 USA
基金
美国国家卫生研究院;
关键词
SET ENRICHMENT ANALYSIS; THERAPEUTIC ANGIOGENESIS; CARDIOVASCULAR-DISEASE; RARE VARIANTS; GENE; ASSOCIATION; SNPS;
D O I
10.1038/ejhg.2014.121
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although pathway analysis methods have been developed and successfully applied to association studies of common variants, the statistical methods for pathway-based association analysis of rare variants have not been well developed. Many investigators observed highly inflated false-positive rates and low power in pathway-based tests of association of rare variants. The inflated false-positive rates and low true-positive rates of the current methods are mainly due to their lack of ability to account for gametic phase disequilibrium. To overcome these serious limitations, we develop a novel statistic that is based on the smoothed functional principal component analysis (SFPCA) for pathway association tests with next-generation sequencing data. The developed statistic has the ability to capture position-level variant information and account for gametic phase disequilibrium. By intensive simulations, we demonstrate that the SFPCA-based statistic for testing pathway association with either rare or common or both rare and common variants has the correct type 1 error rates. Also the power of the SFPCA-based statistic and 22 additional existing statistics are evaluated. We found that the SFPCA-based statistic has a much higher power than other existing statistics in all the scenarios considered. To further evaluate its performance, the SFPCA-based statistic is applied to pathway analysis of exome sequencing data in the early-onset myocardial infarction (EOMI) project. We identify three pathways significantly associated with EOMI after the Bonferroni correction. In addition, our preliminary results show that the SFPCA-based statistic has much smaller P-values to identify pathway association than other existing methods.
引用
收藏
页码:507 / 515
页数:9
相关论文
共 50 条
  • [41] Influence of Environmental Factors on Salivary Microbiota and Their Metabolic Pathway: Next-Generation Sequencing Approach
    Zhang, Wei
    Qi, Tao
    Yao, Lihe
    Wang, Wei
    Yu, Fanrong
    Yan, Yuqin
    Salama, El-Sayed
    Su, Shaochen
    Bai, Ming
    MICROBIAL ECOLOGY, 2023, 85 (01) : 317 - 329
  • [42] Are Next-Generation Sequencing Tools Ready for the Cloud?
    Celesti, Antonio
    Celesti, Fabrizio
    Fazio, Maria
    Bramanti, Placido
    Villari, Massimo
    TRENDS IN BIOTECHNOLOGY, 2017, 35 (06) : 486 - 489
  • [43] Analysis of Pilin Antigenic Variation in Neisseria meningitidis by Next-Generation Sequencing
    Xu, Jing
    Seifert, H. Steven
    JOURNAL OF BACTERIOLOGY, 2018, 200 (22)
  • [44] Harnessing virtual machines to simplify next-generation DNA sequencing analysis
    Nocq, Julie
    Celton, Magalie
    Gendron, Patrick
    Lemieux, Sebastien
    Wilhelm, Brian T.
    BIOINFORMATICS, 2013, 29 (17) : 2075 - 2083
  • [45] Mutational analysis of CFTR in the Ecuadorian population using next-generation sequencing
    Carlos Ruiz-Cabezas, Juan
    Barros, Francisco
    Sobrino, Beatriz
    Garcia, Gustavo
    Burgos, Ramiro
    Farhat, Carlos
    Castro, Antonella
    Munoz, Lenin
    Karina Zambrano, Ana
    Martinez, Mariela
    Montalvan, Martha
    Paz-y-Mino, Cesar
    GENE, 2019, 696 : 28 - 32
  • [46] Environmental Monitoring: Inferring the Diatom Index from Next-Generation Sequencing Data
    Visco, Joana Amorim
    Apotheloz-Perret-Gentil, Laure
    Cordonier, Arielle
    Esling, Philippe
    Pillet, Loic
    Pawlowski, Jan
    ENVIRONMENTAL SCIENCE & TECHNOLOGY, 2015, 49 (13) : 7597 - 7605
  • [47] Shared Segment Analysis and Next-Generation Sequencing Implicates the Retinoic Acid Signaling Pathway in Total Anomalous Pulmonary Venous Return (TAPVR)
    Nash, Dustin
    Arrington, Cammon B.
    Kennedy, Brett J.
    Yandell, Mark
    Wu, Wilfred
    Zhang, Wenying
    Ware, Stephanie
    Jorde, Lynn B.
    Gruber, Peter J.
    Yost, H. Joseph
    Bowles, Neil E.
    Bleyl, Steven B.
    PLOS ONE, 2015, 10 (06):
  • [48] Real-world data analysis of next-generation sequencing and corresponding clinical characteristics in thyroid tumor
    Chen, Xu-Feng
    He, Cong
    Yu, Peng-Cheng
    Ye, Wei-Dong
    Han, Pei-Zheng
    Hu, Jia-Qian
    Wang, Yu-Long
    ENDOCRINE CONNECTIONS, 2024, 13 (11)
  • [49] ShRangeSim: Simulation of Single Nucleotide Polymorphism Clusters in Next-Generation Sequencing Data
    Boenn, Markus
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2018, 25 (06) : 613 - 622
  • [50] VarWalker: Personalized Mutation Network Analysis of Putative Cancer Genes from Next-Generation Sequencing Data
    Jia, Peilin
    Zhao, Zhongming
    PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (02)