An Improved Expectation-Maximization Bayesian Algorithm for GWAS

被引:0
作者
Zhang, Ganwen [1 ]
Zhao, Jianini [1 ]
Wang, Jieru [1 ]
Lin, Guo [1 ]
Li, Lin [1 ]
Ban, Fengfei [1 ]
Zhu, Meiting [1 ]
Wen, Yangjun [1 ]
Zhang, Jin [1 ]
机构
[1] Nanjing Agr Univ, Coll Sci, Nanjing 210095, Peoples R China
关键词
GAWS; Bayesian method; mixed linear model; candidate gene; GENOME-WIDE ASSOCIATION; QUANTITATIVE TRAIT LOCI; MODEL; POPULATION; REGRESSION; LINKAGE;
D O I
10.3390/math12131944
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Genome-wide association studies (GWASs) are flexible and comprehensive tools for identifying single nucleotide polymorphisms (SNPs) associated with complex traits or diseases. The whole-genome Bayesian models are an effective way of incorporating important prior information into modeling. Bayesian methods have been widely used in association analysis. However, Bayesian analysis is often not feasible due to the high-throughput genotype and large sample sizes involved. In this study, we propose a new Bayesian algorithm under the mixed linear model framework: the expectation and maximization BayesB Improved algorithm (emBBI). The emBBI algorithm corrects polygenic and environmental noise and reduces dimensions; then, it estimates and tests marker effects using emBayesB and the LOD test, respectively. We conducted two simulation experiments and analyzed a real dataset related to flowering time in Arabidopsis to demonstrate the validation of the new algorithm. The results show that the emBBI algorithm is more flexible and accurate in simulation studies compared to established methods, and it performs well under complex genetic backgrounds. The analysis of the Arabidopsis real dataset further illustrates the advantages of the emBBI algorithm for GWAS by detecting known genes. Furthermore, 12 candidate genes are identified in the neighborhood of the significant quantitative trait nucleotides (QTNs) of flowering-related QTNs in Arabidopsis. In addition, we also performed enrichment analysis and tissue expression analysis of candidate genes, which will help us better understand the genetic basis of flowering-related traits in Arabidopsis.
引用
收藏
页数:14
相关论文
共 45 条
[1]   Robustification of GWAS to explore effective SNPs addressing the challenges of hidden population stratification and polygenic effects [J].
Akond, Zobaer ;
Ahsan, Md Asif ;
Alam, Munirul ;
Mollah, Md Nurul Haque .
SCIENTIFIC REPORTS, 2021, 11 (01)
[2]   Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines [J].
Atwell, Susanna ;
Huang, Yu S. ;
Vilhjalmsson, Bjarni J. ;
Willems, Glenda ;
Horton, Matthew ;
Li, Yan ;
Meng, Dazhe ;
Platt, Alexander ;
Tarone, Aaron M. ;
Hu, Tina T. ;
Jiang, Rong ;
Muliyati, N. Wayan ;
Zhang, Xu ;
Amer, Muhammad Ali ;
Baxter, Ivan ;
Brachi, Benjamin ;
Chory, Joanne ;
Dean, Caroline ;
Debieu, Marilyne ;
de Meaux, Juliette ;
Ecker, Joseph R. ;
Faure, Nathalie ;
Kniskern, Joel M. ;
Jones, Jonathan D. G. ;
Michael, Todd ;
Nemri, Adnane ;
Roux, Fabrice ;
Salt, David E. ;
Tang, Chunlao ;
Todesco, Marco ;
Traw, M. Brian ;
Weigel, Detlef ;
Marjoram, Paul ;
Borevitz, Justin O. ;
Bergelson, Joy ;
Nordborg, Magnus .
NATURE, 2010, 465 (7298) :627-631
[3]   A genome-wide association study identifies a transporter for zinc uploading to maize kernels [J].
Chao, Zhen-Fei ;
Chen, Yuan-Yuan ;
Ji, Chen ;
Wang, Ya-Ling ;
Huang, Xing ;
Zhang, Chu-Ying ;
Yang, Jun ;
Song, Tao ;
Wu, Jia-Chen ;
Guo, Liang-Xing ;
Liu, Chu-Bin ;
Han, Mei-Ling ;
Wu, Yong-Rui ;
Yan, Jianbing ;
Chao, Dai-Yin .
EMBO REPORTS, 2023, 24 (01)
[4]   Bayesian ridge regression shows the best fit for SSR markers in Psidium guajava']java among Bayesian models [J].
da Silva, Flavia Alves ;
Viana, Alexandre Pio ;
Guedes Correa, Caio Cezar ;
Santos, Eileen Azevedo ;
Salgado de Oliveira, Julie Anne Vieira ;
Gomes Andrade, Jose Daniel ;
Ribeiro, Rodrigo Moreira ;
Gloria, Leonardo Siqueira .
SCIENTIFIC REPORTS, 2021, 11 (01)
[5]   Genome-Wide Association Studies Identify Two Novel BMP15 Mutations Responsible for an Atypical Hyperprolificacy Phenotype in Sheep [J].
Demars, Julie ;
Fabre, Stephane ;
Sarry, Julien ;
Rossetti, Raffaella ;
Gilbert, Helene ;
Persani, Luca ;
Tosser-Klopp, Gwenola ;
Mulsant, Philippe ;
Nowak, Zuzanna ;
Drobik, Wioleta ;
Martyniuk, Elzbieta ;
Bodin, Loys .
PLOS GENETICS, 2013, 9 (04)
[6]  
Fan QC, 2017, GENET MOL RES, V16, DOI 10.4238/gmr16019431
[7]   A common variant in the FTO gene is associated with body mass index and predisposes to childhood and adult obesity [J].
Frayling, Timothy M. ;
Timpson, Nicholas J. ;
Weedon, Michael N. ;
Zeggini, Eleftheria ;
Freathy, Rachel M. ;
Lindgren, Cecilia M. ;
Perry, John R. B. ;
Elliott, Katherine S. ;
Lango, Hana ;
Rayner, Nigel W. ;
Shields, Beverley ;
Harries, Lorna W. ;
Barrett, Jeffrey C. ;
Ellard, Sian ;
Groves, Christopher J. ;
Knight, Bridget ;
Patch, Ann-Marie ;
Ness, Andrew R. ;
Ebrahim, Shah ;
Lawlor, Debbie A. ;
Ring, Susan M. ;
Ben-Shlomo, Yoav ;
Jarvelin, Marjo-Riitta ;
Sovio, Ulla ;
Bennett, Amanda J. ;
Melzer, David ;
Ferrucci, Luigi ;
Loos, Ruth J. F. ;
Barroso, Ines ;
Wareham, Nicholas J. ;
Karpe, Fredrik ;
Owen, Katharine R. ;
Cardon, Lon R. ;
Walker, Mark ;
Hitman, Graham A. ;
Palmer, Colin N. A. ;
Doney, Alex S. F. ;
Morris, Andrew D. ;
Smith, George Davey ;
Hattersley, Andrew T. ;
McCarthy, Mark I. .
SCIENCE, 2007, 316 (5826) :889-894
[8]   EM algorithm for Bayesian estimation of genomic breeding values [J].
Hayashi T. ;
Iwata H. .
BMC Genetics, 11 (1)
[9]   Genome-wide association studies of 14 agronomic traits in rice landraces [J].
Huang, Xuehui ;
Wei, Xinghua ;
Sang, Tao ;
Zhao, Qiang ;
Feng, Qi ;
Zhao, Yan ;
Li, Canyang ;
Zhu, Chuanrang ;
Lu, Tingting ;
Zhang, Zhiwu ;
Li, Meng ;
Fan, Danlin ;
Guo, Yunli ;
Wang, Ahong ;
Wang, Lu ;
Deng, Liuwei ;
Li, Wenjun ;
Lu, Yiqi ;
Weng, Qijun ;
Liu, Kunyan ;
Huang, Tao ;
Zhou, Taoying ;
Jing, Yufeng ;
Li, Wei ;
Lin, Zhang ;
Buckler, Edward S. ;
Qian, Qian ;
Zhang, Qi-Fa ;
Li, Jiayang ;
Han, Bin .
NATURE GENETICS, 2010, 42 (11) :961-U76
[10]   Bayesian association mapping of multiple quantitative trait loci and its application to the analysis of genetic variation among Oryza sativa L. germplasms [J].
Iwata, Hiroyoshi ;
Uga, Yusaku ;
Yoshioka, Yosuke ;
Ebana, Kaworu ;
Hayashi, Takeshi .
THEORETICAL AND APPLIED GENETICS, 2007, 114 (08) :1437-1449