Simultaneous selection of multiple important single nucleotide polymorphisms in familial genome wide association studies data

被引:0
作者
Majumdar, Subhabrata [1 ,2 ]
Basu, Saonli [1 ]
McGue, Matt [1 ]
Chatterjee, Snigdhansu [1 ]
机构
[1] Univ Minnesota Twin Cities, Minneapolis, MN 55455 USA
[2] AI Risk & Vulnerabil Alliance, Seattle, WA 98108 USA
基金
美国国家科学基金会;
关键词
GWAS; TWIN; COMBINATION; VARIANTS; PEDIGREE; SLC6A4; TESTS; MODEL; RISK; LOCI;
D O I
10.1038/s41598-023-35379-y
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We propose a resampling-based fast variable selection technique for detecting relevant single nucleotide polymorphisms (SNP) in a multi-marker mixed effect model. Due to computational complexity, current practice primarily involves testing the effect of one SNP at a time, commonly termed as 'single SNP association analysis'. Joint modeling of genetic variants within a gene or pathway may have better power to detect associated genetic variants, especially the ones with weak effects. In this paper, we propose a computationally efficient model selection approach-based on the e-values framework-for single SNP detection in families while utilizing information on multiple SNPs simultaneously. To overcome computational bottleneck of traditional model selection methods, our method trains one single model, and utilizes a fast and scalable bootstrap procedure. We illustrate through numerical studies that our proposed method is more effective in detecting SNPs associated with a trait than either single-marker analysis using family data or model selection methods that ignore the familial dependency structure. Further, we perform gene-level analysis in Minnesota Center for Twin and Family Research (MCTFR) dataset using our method to detect several SNPs using this that have been implicated to be associated with alcohol consumption.
引用
收藏
页数:13
相关论文
共 50 条
  • [41] Genome-wide identification of m6A-associated single-nucleotide polymorphisms in Parkinson' s disease
    Qiu, Xiaohui
    He, Honghu
    Huang, Yanning
    Wang, Jin
    Xiao, Yousheng
    NEUROSCIENCE LETTERS, 2020, 737
  • [42] A Genome Wide Association Study Revealed Key Single Nucleotide Polymorphisms/Genes Associated With Seed Germination in Gossypium hirsutum L.
    Si, Aijun
    Sun, Zhengwen
    Li, Zhikun
    Chen, Bin
    Gu, Qishen
    Zhang, Yan
    Wu, Liqiang
    Zhang, Guiyin
    Wang, Xingfen
    Ma, Zhiying
    FRONTIERS IN PLANT SCIENCE, 2022, 13
  • [43] Genome-wide association study for metabolic syndrome reveals APOA5 single nucleotide polymorphisms with multilayered effects in Koreans
    Park, Young Jun
    Moon, Sungji
    Choi, Jaeyong
    Kim, Juhyun
    Kim, Hyun-Jin
    Son, Ho-Young
    Im, Sun-Wha
    Kim, Jong-Il
    LIPIDS IN HEALTH AND DISEASE, 2024, 23 (01)
  • [44] Genome-wide association studies for economically important traits in mink using copy number variation
    Davoudi, Pourya
    Do, Duy Ngoc
    Colombo, Stefanie
    Rathgeber, Bruce
    Sargolzaei, Mehdi
    Plastow, Graham
    Wang, Zhiquan
    Hu, Guoyu
    Valipour, Shafagh
    Miar, Younes
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [45] Wrapper-based selection of genetic features in genome-wide association studies through fast matrix operations
    Pahikkala, Tapio
    Okser, Sebastian
    Airola, Antti
    Salakoski, Tapio
    Aittokallio, Tero
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2012, 7
  • [46] Accounting for selection and correlation in the analysis of two-stage genome-wide association studies
    Robertson, David S.
    Prevost, A. Toby
    Bowden, Jack
    BIOSTATISTICS, 2016, 17 (04) : 634 - 649
  • [47] False Discovery Rate Estimation for Stability Selection: Application to Genome-Wide Association Studies
    Ahmed, Ismail
    Hartikainen, Anna-Liisa
    Jarvelin, Marjo-Riitta
    Richardson, Sylvia
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2011, 10 (01)
  • [48] Associations Between Incident Ischemic Stroke Events and Stroke and Cardiovascular Disease-Related Genome-Wide Association Studies Single Nucleotide Polymorphisms in the Population Architecture Using Genomics and Epidemiology Study
    Carty, Cara L.
    Buzkova, Petra
    Fornage, Myriam
    Franceschini, Nora
    Cole, Shelley
    Heiss, Gerardo
    Hindorff, Lucia A.
    Howard, Barbara V.
    Mann, Sue
    Martin, Lisa W.
    Zhang, Ying
    Matise, Tara C.
    Prentice, Ross
    Reiner, Alexander P.
    Kooperberg, Charles
    CIRCULATION-CARDIOVASCULAR GENETICS, 2012, 5 (02) : 210 - 216
  • [49] Genetic basis of lacunar stroke: a pooled analysis of individual patient data and genome-wide association studies
    Traylor, Matthew
    Persyn, Elodie
    Tomppo, Liisa
    Klasson, Sofia
    Abedi, Vida
    Bakker, Mark K.
    Torres, Nuria
    Li, Linxin
    Bell, Steven
    Rutten-Jacobs, Loes
    Tozer, Daniel J.
    Griessenauer, Christoph J.
    Zhang, Yanfei
    Pedersen, Annie
    Sharma, Pankaj
    Jimenez-Conde, Jordi
    Rundek, Tatjana
    Grewal, Raji P.
    Lindgren, Arne
    Meschia, James F.
    Salomaa, Veikko
    Havulinna, Aki
    Kourkoulis, Christina
    Crawford, Katherine
    Marini, Sandro
    Mitchell, Braxton D.
    Kittner, Steven J.
    Rosand, Jonathan
    Dichgans, Martin
    Jern, Christina
    Strbian, Daniel
    Fernandez-Cadenas, Israel
    Zand, Ramin
    Ruigrok, Ynte
    Rost, Natalia
    Lemmens, Robin
    Rothwell, Peter M.
    Anderson, Christopher D.
    Wardlaw, Joanna
    Lewis, Cathryn M.
    Markus, Hugh S.
    LANCET NEUROLOGY, 2021, 20 (05) : 351 - 361
  • [50] Pathway Analysis of Genome Wide Association Studies (GWAS) Data Associated with Male Infertility
    Salvi, Rupashree
    Gawde, Ulka
    Idicula-Thomas, Susan
    Biswas, Barnali
    REPRODUCTIVE MEDICINE, 2022, 3 (03): : 235 - 245