AP-SKAT: highly-efficient genome-wide rare variant association test

被引:5
|
作者
Hasegawa, Takanori [1 ]
Kojima, Kaname [1 ]
Kawai, Yosuke [1 ]
Misawa, Kazuharu [1 ]
Mimori, Takahiro [1 ]
Nagasaki, Masao [1 ]
机构
[1] Tohoku Univ, Dept Integrat Genom, Tohoku Med Megabank Org, Aoba Ku, 2-1 Seiryo Machi, Sendai, Miyagi, Japan
来源
BMC GENOMICS | 2016年 / 17卷
基金
英国惠康基金;
关键词
Genome wide association study; Multiple test; Rare variants;
D O I
10.1186/s12864-016-3094-3
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Genome-wide association studies have revealed associations between single-nucleotide polymorphisms (SNPs) and phenotypes such as disease symptoms and drug tolerance. To address the small sample size for rare variants, association studies tend to group gene or pathway level variants and evaluate the effect on the set of variants. One of such strategies, known as the sequential kernel association test (SKAT), is a widely used collapsing method. However, the reported p-values from SKAT tend to be biased because the asymptotic property of the statistic is used to calculate the p-value. Although this bias can be corrected by applying permutation procedures for the test statistics, the computational cost of obtaining p-values with high resolution is prohibitive. Results: To address this problem, we devise an adaptive SKAT procedure termed AP-SKAT that efficiently classifies significant SNP sets and ranks them according to the permuted p-values. Our procedure adaptively stops the permutation test when the significance level is outside some confidence interval of the estimated p-value for a binomial distribution. To evaluate the performance, we first compare the power and sample size calculation and the type I error rates estimate of SKAT, SKAT-O, and the proposed procedure using genotype data in the SKAT R package and from 1000 Genome Project. Through computational experiments using whole genome sequencing and SNP array data, we show that our proposed procedure is highly efficient and has comparable accuracy to the standard procedure. Conclusions: For several types of genetic data, the developed procedure could achieve competitive power and sample size under small and large sample size conditions with controlling considerable type I error rates, and estimate p-values of significant SNP sets that are consistent with those estimated by the standard permutation test within a realistic time. This demonstrates that the procedure is sufficiently powerful for recent whole genome sequencing and SNP array data with increasing numbers of phenotypes. Additionally, this procedure can be used in other association tests by employing alternative methods to calculate the statistics.
引用
收藏
页数:8
相关论文
共 50 条
  • [21] Genome-wide association study of pain sensitivity assessed by questionnaire and the cold pressor test
    Fontanillas, Pierre
    Kless, Achim
    Bothmer, John
    Tung, Joyce Y.
    PAIN, 2022, 163 (09) : 1763 - 1776
  • [22] Genome-wide association study of cognitive flexibility assessed by the Wisconsin Card Sorting Test
    Zhang, Huiping
    Zhou, Hang
    Lencz, Todd
    Farrer, Lindsay A.
    Kranzler, Henry R.
    Gelernter, Joel
    AMERICAN JOURNAL OF MEDICAL GENETICS PART B-NEUROPSYCHIATRIC GENETICS, 2018, 177 (05) : 511 - 519
  • [23] Associations of single nucleotide polymorphisms with mucinous colorectal cancer: genome-wide common variant and gene-based rare variant analyses
    Penney, Michelle E.
    Parfrey, Patrick S.
    Savas, Sevtap
    Yilmaz, Yildiz E.
    BIOMARKER RESEARCH, 2018, 6
  • [24] Associations of single nucleotide polymorphisms with mucinous colorectal cancer: genome-wide common variant and gene-based rare variant analyses
    Michelle E. Penney
    Patrick S. Parfrey
    Sevtap Savas
    Yildiz E. Yilmaz
    Biomarker Research, 6
  • [25] Rare Functional Variants in Genome-Wide Association Identified Candidate Genes for Nonsyndromic Clefts in the African Population
    Butali, Azeez
    Mossey, Peter
    Adeyemo, Wasiu
    Eshete, Mekonen
    Gaines, Lauren
    Braimah, Ramat
    Aregbesola, Babatunde
    Rigdon, Jennifer
    Emeka, Christian
    Olutayo, James
    Ogunlewe, Olugbenga
    Ladeinde, Akinola
    Abate, Fikre
    Hailu, Taye
    Mohammed, Ibrahim
    Gravem, Paul
    Deribew, Milliard
    Gesses, Mulualem
    Adeyemo, Adebowale
    Marazita, Mary
    Murray, Jeffrey
    AMERICAN JOURNAL OF MEDICAL GENETICS PART A, 2014, 164 (10) : 2567 - 2571
  • [26] Efficient Genome-Wide Association Testing of Gene-Environment Interaction in Case-Parent Trios
    Gauderman, W. James
    Thomas, Duncan C.
    Murcray, Cassandra E.
    Conti, David
    Li, Dalin
    Lewinger, Juan Pablo
    AMERICAN JOURNAL OF EPIDEMIOLOGY, 2010, 172 (01) : 116 - 122
  • [27] Efficient Adaptively Weighted Analysis of Secondary Phenotypes in Case-Control Genome-Wide Association Studies
    Li, Huilin
    Gail, Mitchell H.
    HUMAN HEREDITY, 2012, 73 (03) : 159 - 173
  • [28] Maliciously Secure and Efficient Large-Scale Genome-Wide Association Study With Multi-Party Computation
    Dong, Caiqin
    Weng, Jian
    Liu, Jia-Nan
    Yang, Anjia
    Liu, Zhiquan
    Yang, Yaxi
    Ma, Jianfeng
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2023, 20 (02) : 1243 - 1257
  • [29] Genome-wide association study toward efficient selection breeding of resistance to Vibrio alginolyticus in Pacific oyster, Crassostrea gigas
    Yang, Ben
    Zhai, Shangyu
    Zhang, Fuqiang
    Wang, Hebing
    Ren, Liting
    Li, Yongjing
    Li, Qi
    Liu, Shikai
    AQUACULTURE, 2022, 548
  • [30] The obesity-risk variant of FTO is inversely related with the So-Eum constitutional type: genome-wide association and replication analyses
    Cha, Seongwon
    Yu, Hyunjoo
    Park, Ah Yeon
    Oh, Soo A.
    Kim, Jong Yeol
    BMC COMPLEMENTARY AND ALTERNATIVE MEDICINE, 2015, 15