BGWAS: Bayesian variable selection in linear mixed models with nonlocal priors for genome-wide association studies

被引:4
作者
Williams, Jacob [1 ]
Xu, Shuangshuang [1 ]
Ferreira, Marco A. R. [1 ]
机构
[1] Virginia Tech, Dept Stat, Blacksburg, VA 24061 USA
基金
美国国家科学基金会;
关键词
GWAS; Bayesian; Model selection;
D O I
10.1186/s12859-023-05316-x
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundGenome-wide association studies (GWAS) seek to identify single nucleotide polymorphisms (SNPs) that cause observed phenotypes. However, with highly correlated SNPs, correlated observations, and the number of SNPs being two orders of magnitude larger than the number of observations, GWAS procedures often suffer from high false positive rates.ResultsWe propose BGWAS, a novel Bayesian variable selection method based on nonlocal priors for linear mixed models specifically tailored for genome-wide association studies. Our proposed method BGWAS uses a novel nonlocal prior for linear mixed models (LMMs). BGWAS has two steps: screening and model selection. The screening step scans through all the SNPs fitting one LMM for each SNP and then uses Bayesian false discovery control to select a set of candidate SNPs. After that, a model selection step searches through the space of LMMs that may have any number of SNPs from the candidate set. A simulation study shows that, when compared to popular GWAS procedures, BGWAS greatly reduces false positives while maintaining the same ability to detect true positive SNPs. We show the utility and flexibility of BGWAS with two case studies: a case study on salt stress in plants, and a case study on alcohol use disorder.ConclusionsBGWAS maintains and in some cases increases the recall of true SNPs while drastically lowering the number of false positives compared to popular SMA procedures.
引用
收藏
页数:20
相关论文
共 31 条
[21]  
Scrucca L, 2013, J STAT SOFTW, V53, P1
[22]   3D Strain helps relating LV function to LV and structure in athletes [J].
Stefani, Laura ;
De Luca, Alessio ;
Toncelli, Loira ;
Pedrizzetti, Gianni ;
Galanti, Giorgio .
CARDIOVASCULAR ULTRASOUND, 2014, 12
[23]   Underestimated Effect Sizes in GWAS: Fundamental Limitations of Single SNP Analysis for Dichotomous Phenotypes [J].
Stringer, Sven ;
Wray, Naomi R. ;
Kahn, Rene S. ;
Derks, Eske M. .
PLOS ONE, 2011, 6 (11)
[24]   Identification of novel risk loci with shared effects on alcoholism, heroin, and methamphetamine dependence [J].
Sun, Yan ;
Chang, Suhua ;
Liu, Zhen ;
Zhang, Libo ;
Wang, Fan ;
Yue, Weihua ;
Sun, Hongqiang ;
Ni, Zhaojun ;
Chang, Xiangwen ;
Zhang, Yibing ;
Chen, Yang ;
Liu, Jiqiang ;
Lu, Lin ;
Shi, Jie .
MOLECULAR PSYCHIATRY, 2021, 26 (04) :1152-1161
[25]   BICOSS: Bayesian iterative conditional stochastic search for GWAS [J].
Williams, Jacob ;
Ferreira, Marco A. R. ;
Ji, Tieming .
BMC BIOINFORMATICS, 2022, 23 (01)
[26]   Hyper Nonlocal Priors for Variable Selection in Generalized Linear Models [J].
Wu, Ho-Hsiang ;
Ferreira, Marco A. R. ;
Elkhouly, Mohamed ;
Ji, Tieming .
SANKHYA-SERIES A-MATHEMATICAL STATISTICS AND PROBABILITY, 2020, 82 (01) :147-185
[27]   Modeling allele-specific expression at the gene and SNP levels simultaneously by a Bayesian logistic mixed regression model [J].
Xie, Jing ;
Ji, Tieming ;
Ferreira, Marco A. R. ;
Li, Yahan ;
Patel, Bhaumik N. ;
Rivera, Rocio M. .
BMC BIOINFORMATICS, 2019, 20 (01)
[28]   A hybrid bayesian approach for genome-wide association studies on related individuals [J].
Yazdani, A. ;
Dunson, D. B. .
BIOINFORMATICS, 2015, 31 (24) :3890-3896
[29]   A unified mixed-model method for association mapping that accounts for multiple levels of relatedness [J].
Yu, JM ;
Pressoir, G ;
Briggs, WH ;
Bi, IV ;
Yamasaki, M ;
Doebley, JF ;
McMullen, MD ;
Gaut, BS ;
Nielsen, DM ;
Holland, JB ;
Kresovich, S ;
Buckler, ES .
NATURE GENETICS, 2006, 38 (02) :203-208
[30]   Mixed linear model approach adapted for genome-wide association studies [J].
Zhang, Zhiwu ;
Ersoz, Elhan ;
Lai, Chao-Qiang ;
Todhunter, Rory J. ;
Tiwari, Hemant K. ;
Gore, Michael A. ;
Bradbury, Peter J. ;
Yu, Jianming ;
Arnett, Donna K. ;
Ordovas, Jose M. ;
Buckler, Edward S. .
NATURE GENETICS, 2010, 42 (04) :355-U118