Pathway analysis with next-generation sequencing data

被引:4
|
作者
Zhao, Jinying [1 ]
Zhu, Yun [1 ]
Boerwinkle, Eric [2 ]
Xiong, Momiao [2 ]
机构
[1] Tulane Univ, Dept Epidemiol, Sch Publ Hlth & Trop Med, New Orleans, LA 70118 USA
[2] Univ Texas Hlth Sci Ctr Houston, Ctr Human Genet, Div Biostat, POB 20186, Houston, TX 77225 USA
基金
美国国家卫生研究院;
关键词
SET ENRICHMENT ANALYSIS; THERAPEUTIC ANGIOGENESIS; CARDIOVASCULAR-DISEASE; RARE VARIANTS; GENE; ASSOCIATION; SNPS;
D O I
10.1038/ejhg.2014.121
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although pathway analysis methods have been developed and successfully applied to association studies of common variants, the statistical methods for pathway-based association analysis of rare variants have not been well developed. Many investigators observed highly inflated false-positive rates and low power in pathway-based tests of association of rare variants. The inflated false-positive rates and low true-positive rates of the current methods are mainly due to their lack of ability to account for gametic phase disequilibrium. To overcome these serious limitations, we develop a novel statistic that is based on the smoothed functional principal component analysis (SFPCA) for pathway association tests with next-generation sequencing data. The developed statistic has the ability to capture position-level variant information and account for gametic phase disequilibrium. By intensive simulations, we demonstrate that the SFPCA-based statistic for testing pathway association with either rare or common or both rare and common variants has the correct type 1 error rates. Also the power of the SFPCA-based statistic and 22 additional existing statistics are evaluated. We found that the SFPCA-based statistic has a much higher power than other existing statistics in all the scenarios considered. To further evaluate its performance, the SFPCA-based statistic is applied to pathway analysis of exome sequencing data in the early-onset myocardial infarction (EOMI) project. We identify three pathways significantly associated with EOMI after the Bonferroni correction. In addition, our preliminary results show that the SFPCA-based statistic has much smaller P-values to identify pathway association than other existing methods.
引用
收藏
页码:507 / 515
页数:9
相关论文
共 50 条
  • [21] Next-generation sequencing: recent applications to the analysis of colorectal cancer
    Del Vecchio, Filippo
    Mastroiaco, Valentina
    Di Marco, Antinisca
    Compagnoni, Chiara
    Capece, Daria
    Zazzeroni, Francesca
    Capalbo, Carlo
    Alesse, Edoardo
    Tessitore, Alessandra
    JOURNAL OF TRANSLATIONAL MEDICINE, 2017, 15
  • [22] Challenges in clinical interpretation of next-generation sequencing data: Advantages and Pitfalls
    Karakoyun, Hilal Keskin
    Sayar, Ceyhan
    Yararbas, Kanay
    RESULTS IN ENGINEERING, 2023, 20
  • [23] Improving the estimation of genetic distances from Next-Generation Sequencing data
    Vieira, Filipe G.
    Lassalle, Florent
    Korneliussen, Thorfinn S.
    Fumagalli, Matteo
    BIOLOGICAL JOURNAL OF THE LINNEAN SOCIETY, 2016, 117 (01) : 139 - 149
  • [24] Identification of recombination events in outbred species with next-generation sequencing data
    Tao, Shentong
    Wu, Jiyan
    Yao, Dan
    Chen, Yuhua
    Yang, Wenguo
    Tong, Chunfa
    BMC GENOMICS, 2018, 19
  • [25] Quantifying Population Genetic Differentiation from Next-Generation Sequencing Data
    Fumagalli, Matteo
    Vieira, Filipe G.
    Korneliussen, Thorfinn Sand
    Linderoth, Tyler
    Huerta-Sanchez, Emilia
    Albrechtsen, Anders
    Nielsen, Rasmus
    GENETICS, 2013, 195 (03) : 979 - +
  • [26] HomSI: a homozygous stretch identifier from next-generation sequencing data
    Gormez, Zeliha
    Bakir-Gungor, Burcu
    Sagiroglu, Mahmut Samil
    BIOINFORMATICS, 2014, 30 (03) : 445 - 447
  • [27] Targeted next-generation sequencing analysis in Italian patients with keratoconus
    Lombardo, Marco
    Camellin, Umberto
    Gioia, Raffaella
    Serrao, Sebastiano
    Scorcia, Vincenzo
    Roszkowska, Anna Maria
    Lombardo, Giuseppe
    Bertelli, Matteo
    Medori, Maria Chiara
    Fegatelli, Danilo Alunni
    Vestri, Annarita
    Mencucci, Rita
    Lomoriello, Domenico Schiano
    EYE, 2024, 38 (13) : 2610 - 2618
  • [28] HLAscan: genotyping of the HLA region using next-generation sequencing data
    Ka, Sojeong
    Lee, Sunho
    Hong, Jonghee
    Cho, Yangrae
    Sung, Joohon
    Kim, Han-Na
    Kim, Hyung-Lae
    Jung, Jongsun
    BMC BIOINFORMATICS, 2017, 18
  • [29] SNVer: a statistical tool for variant calling in analysis of pooled or individual next-generation sequencing data
    Wei, Zhi
    Wang, Wei
    Hu, Pingzhao
    Lyon, Gholson J.
    Hakonarson, Hakon
    NUCLEIC ACIDS RESEARCH, 2011, 39 (19)
  • [30] Identification and interaction analysis of molecular markers in myocardial infarction by bioinformatics and next-generation sequencing data analysis
    Vastrad, Basavaraj
    Vastrad, Chanabasayya
    EGYPTIAN JOURNAL OF MEDICAL HUMAN GENETICS, 2024, 25 (01)