Pathway analysis with next-generation sequencing data

被引:4
|
作者
Zhao, Jinying [1 ]
Zhu, Yun [1 ]
Boerwinkle, Eric [2 ]
Xiong, Momiao [2 ]
机构
[1] Tulane Univ, Dept Epidemiol, Sch Publ Hlth & Trop Med, New Orleans, LA 70118 USA
[2] Univ Texas Hlth Sci Ctr Houston, Ctr Human Genet, Div Biostat, POB 20186, Houston, TX 77225 USA
基金
美国国家卫生研究院;
关键词
SET ENRICHMENT ANALYSIS; THERAPEUTIC ANGIOGENESIS; CARDIOVASCULAR-DISEASE; RARE VARIANTS; GENE; ASSOCIATION; SNPS;
D O I
10.1038/ejhg.2014.121
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Although pathway analysis methods have been developed and successfully applied to association studies of common variants, the statistical methods for pathway-based association analysis of rare variants have not been well developed. Many investigators observed highly inflated false-positive rates and low power in pathway-based tests of association of rare variants. The inflated false-positive rates and low true-positive rates of the current methods are mainly due to their lack of ability to account for gametic phase disequilibrium. To overcome these serious limitations, we develop a novel statistic that is based on the smoothed functional principal component analysis (SFPCA) for pathway association tests with next-generation sequencing data. The developed statistic has the ability to capture position-level variant information and account for gametic phase disequilibrium. By intensive simulations, we demonstrate that the SFPCA-based statistic for testing pathway association with either rare or common or both rare and common variants has the correct type 1 error rates. Also the power of the SFPCA-based statistic and 22 additional existing statistics are evaluated. We found that the SFPCA-based statistic has a much higher power than other existing statistics in all the scenarios considered. To further evaluate its performance, the SFPCA-based statistic is applied to pathway analysis of exome sequencing data in the early-onset myocardial infarction (EOMI) project. We identify three pathways significantly associated with EOMI after the Bonferroni correction. In addition, our preliminary results show that the SFPCA-based statistic has much smaller P-values to identify pathway association than other existing methods.
引用
收藏
页码:507 / 515
页数:9
相关论文
共 50 条
  • [1] Applications and data analysis of next-generation sequencing
    Vogl, Ina
    Benet-Pages, Anna
    Eck, Sebastian H.
    Kuhn, Marius
    Vosberg, Sebastian
    Greif, Philipp A.
    Metzeler, Klaus H.
    Biskup, Saskia
    Mueller-Reible, Clemens
    Klein, Hanns-Georg
    LABORATORIUMSMEDIZIN-JOURNAL OF LABORATORY MEDICINE, 2013, 37 (06): : 305 - 315
  • [2] Seq2pathway: an R/Bioconductor package for pathway analysis of next-generation sequencing data
    Wang, Bin
    Cunningham, John M.
    Yang, Xinan
    BIOINFORMATICS, 2015, 31 (18) : 3043 - 3045
  • [3] Association Testing for Next-Generation Sequencing Data Using Score Statistics
    Skotte, Line
    Korneliussen, Thorfinn Sand
    Albrechtsen, Anders
    GENETIC EPIDEMIOLOGY, 2012, 36 (05) : 430 - 437
  • [4] Zseq: An Approach for Preprocessing Next-Generation Sequencing Data
    Alkhateeb, Abedalrhman
    Rueda, Luis
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2017, 24 (08) : 746 - 755
  • [5] Next-generation sequencing in dermatology
    King, Andrew D.
    Deirawan, Hany
    Klein, Paytra A.
    Dasgeb, Bahar
    Dumur, Catherine I.
    Mehregan, Darius R.
    FRONTIERS IN MEDICINE, 2023, 10
  • [6] Next-generation sequencing of patients with congenital anosmia
    Alkelai, Anna
    Olender, Tsviya
    Dode, Catherine
    Shushan, Sagit
    Tatarskyy, Pavel
    Furman-Haran, Edna
    Boyko, Valery
    Gross-Isseroff, Ruth
    Halvorsen, Matthew
    Greenbaum, Lior
    Milgrom, Roni
    Yamada, Kazuya
    Haneishi, Ayumi
    Blau, Ilan
    Lancet, Doron
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2017, 25 (12) : 1377 - 1387
  • [7] Using population data for assessing next-generation sequencing performance
    Houniet, Darren T.
    Rahman, Thahira J.
    Al Turki, Saeed
    Hurles, Matthew E.
    Xu, Yaobo
    Goodship, Judith
    Keavney, Bernard
    Santibanez Koref, Mauro
    BIOINFORMATICS, 2015, 31 (01) : 56 - 61
  • [8] sPIS: A Small RNA Illustration System for Plant Next-Generation Sequencing Data Analysis
    Tseng, Kuan-Chieh
    Chiang-Hsieh, Yi-Fan
    Pai, Hsuan
    Wu, Nai-Yun
    Zheng, Han-Qin
    Chow, Chi-Nga
    Lee, Tzong-Yi
    Chang, Song-Bin
    Lin, Na-Sheng
    Chang, Wen-Chi
    PLANT AND CELL PHYSIOLOGY, 2020, 61 (06) : 1204 - 1212
  • [9] Linear mixed models for association analysis of quantitative traits with next-generation sequencing data
    Chiu, Chi-yang
    Yuan, Fang
    Zhang, Bingsong
    Yuan, Ao
    Li, Xin
    Fang, Hong-Bin
    Lange, Kenneth
    Weeks, Daniel E.
    Wilson, Alexander F.
    Bailey-Wilson, Joan E.
    Musolf, Anthony M.
    Stambolian, Dwight
    Lakhal-Chaieb, M'Hamed Lajmi
    Cook, Richard J.
    McMahon, Francis J.
    Amos, Christopher I.
    Xiong, Momiao
    Fan, Ruzong
    GENETIC EPIDEMIOLOGY, 2019, 43 (02) : 189 - 206
  • [10] Analysis of population-specific pharmacogenomic variants using next-generation sequencing data
    Ahn, Eunyong
    Park, Taesung
    SCIENTIFIC REPORTS, 2017, 7