AP-SKAT: highly-efficient genome-wide rare variant association test

被引:5
|
作者
Hasegawa, Takanori [1 ]
Kojima, Kaname [1 ]
Kawai, Yosuke [1 ]
Misawa, Kazuharu [1 ]
Mimori, Takahiro [1 ]
Nagasaki, Masao [1 ]
机构
[1] Tohoku Univ, Dept Integrat Genom, Tohoku Med Megabank Org, Aoba Ku, 2-1 Seiryo Machi, Sendai, Miyagi, Japan
来源
BMC GENOMICS | 2016年 / 17卷
基金
英国惠康基金;
关键词
Genome wide association study; Multiple test; Rare variants;
D O I
10.1186/s12864-016-3094-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Genome-wide association studies have revealed associations between single-nucleotide polymorphisms (SNPs) and phenotypes such as disease symptoms and drug tolerance. To address the small sample size for rare variants, association studies tend to group gene or pathway level variants and evaluate the effect on the set of variants. One of such strategies, known as the sequential kernel association test (SKAT), is a widely used collapsing method. However, the reported p-values from SKAT tend to be biased because the asymptotic property of the statistic is used to calculate the p-value. Although this bias can be corrected by applying permutation procedures for the test statistics, the computational cost of obtaining p-values with high resolution is prohibitive. Results: To address this problem, we devise an adaptive SKAT procedure termed AP-SKAT that efficiently classifies significant SNP sets and ranks them according to the permuted p-values. Our procedure adaptively stops the permutation test when the significance level is outside some confidence interval of the estimated p-value for a binomial distribution. To evaluate the performance, we first compare the power and sample size calculation and the type I error rates estimate of SKAT, SKAT-O, and the proposed procedure using genotype data in the SKAT R package and from 1000 Genome Project. Through computational experiments using whole genome sequencing and SNP array data, we show that our proposed procedure is highly efficient and has comparable accuracy to the standard procedure. Conclusions: For several types of genetic data, the developed procedure could achieve competitive power and sample size under small and large sample size conditions with controlling considerable type I error rates, and estimate p-values of significant SNP sets that are consistent with those estimated by the standard permutation test within a realistic time. This demonstrates that the procedure is sufficiently powerful for recent whole genome sequencing and SNP array data with increasing numbers of phenotypes. Additionally, this procedure can be used in other association tests by employing alternative methods to calculate the statistics.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] AP-SKAT: highly-efficient genome-wide rare variant association test
    Takanori Hasegawa
    Kaname Kojima
    Yosuke Kawai
    Kazuharu Misawa
    Takahiro Mimori
    Masao Nagasaki
    BMC Genomics, 17
  • [2] Rare-Variant Studies to Complement Genome-Wide Association Studies
    Sazonovs, A.
    Barrett, J. C.
    ANNUAL REVIEW OF GENOMICS AND HUMAN GENETICS, VOL 19, 2018, 19 : 97 - 112
  • [3] Rare-variant genome-wide association studies: a new frontier in genetic analysis of complex traits
    Wagner, Michael J.
    PHARMACOGENOMICS, 2013, 14 (04) : 413 - 424
  • [4] Multi-SKAT: General framework to test for rare-variant association with multiple phenotypes
    Dutta, Diptavo
    Scott, Laura
    Boehnke, Michael
    Lee, Seunggeun
    GENETIC EPIDEMIOLOGY, 2019, 43 (01) : 4 - 23
  • [5] A Genome-Wide Association Study and Rare Variant Analysis for Dupuytren Disease in a North American Population
    Grandizio, Louis C.
    Smelser, Diane T.
    Haley, Jeremy S.
    Delma, Stephanie
    Klena, Joel C.
    Carey, David J.
    JOURNAL OF HAND SURGERY-AMERICAN VOLUME, 2025, 50 (02): : 147 - 155
  • [6] An efficient genome-wide association test for multivariate phenotypes based on the Fisher combination function
    James J. Yang
    Jia Li
    L. Keoki Williams
    Anne Buu
    BMC Bioinformatics, 17
  • [7] An efficient genome-wide association test for multivariate phenotypes based on the Fisher combination function
    Yang, James J.
    Li, Jia
    Williams, L. Keoki
    Buu, Anne
    BMC BIOINFORMATICS, 2016, 17
  • [8] Efficient verification for outsourced genome-wide association studies
    Wang, Xinyue
    Jiang, Xiaoqian
    Vaidya, Jaideep
    JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 117
  • [9] A genome-wide association study of bronchodilator response in Latinos implicates rare variants
    Drake, Katherine A.
    Torgerson, Dara G.
    Gignoux, Christopher R.
    Galanter, Joshua M.
    Roth, Lindsey A.
    Huntsman, Scott
    Eng, Celeste
    Oh, Sam S.
    Yee, Sook Wah
    Lin, Lawrence
    Bustamante, Carlos D.
    Moreno-Estrada, Andres
    Sandoval, Karla
    Davis, Adam
    Borrell, Luisa N.
    Farber, Harold J.
    Kumar, Rajesh
    Avila, Pedro C.
    Brigino-Buenaventura, Emerita
    Chapela, Rocio
    Ford, Jean G.
    LeNoir, Michael A.
    Lurmann, Fred
    Meade, Kelley
    Serebrisky, Denise
    Thyne, Shannon
    Rodriguez-Cintron, William
    Sen, Saunak
    Rodriguez-Santana, Jose R.
    Hernandez, Ryan D.
    Giacomini, Kathleen M.
    Burchard, Esteban G.
    JOURNAL OF ALLERGY AND CLINICAL IMMUNOLOGY, 2014, 133 (02) : 370 - +
  • [10] A genome-wide association study of a coronary artery disease risk variant
    Ji-Young Lee
    Bok-Soo Lee
    Dong-Jik Shin
    Kyung Woo Park
    Young-Ah Shin
    Kwang Joong Kim
    Lyong Heo
    Ji Young Lee
    Yun Kyoung Kim
    Young Jin Kim
    Chang Bum Hong
    Sang-Hak Lee
    Dankyu Yoon
    Hyo Jung Ku
    Il-Young Oh
    Bong-Jo Kim
    Juyoung Lee
    Seon-Joo Park
    Jimin Kim
    Hye-kyung Kawk
    Jong-Eun Lee
    Hye-kyung Park
    Jae-Eun Lee
    Hye-young Nam
    Hyun-young Park
    Chol Shin
    Mitsuhiro Yokota
    Hiroyuki Asano
    Masahiro Nakatochi
    Tatsuaki Matsubara
    Hidetoshi Kitajima
    Ken Yamamoto
    Hyung-Lae Kim
    Bok-Ghee Han
    Myeong-Chan Cho
    Yangsoo Jang
    Hyo-Soo Kim
    Jeong Euy Park
    Jong-Young Lee
    Journal of Human Genetics, 2013, 58 : 120 - 126