GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction

被引:653
作者
Wang, Jiabo [1 ,2 ]
Zhang, Zhiwu [2 ]
机构
[1] Southwest Minzu Univ, Key Lab Qinghai Tibetan Plateau Anim Genet Resourc, Sichuan Prov & Minist Educ, Chengdu 610041, Peoples R China
[2] Washington State Univ, Dept Crop & Soil Sci, Pullman, WA 99164 USA
基金
美国食品与农业研究所; 美国国家科学基金会;
关键词
GWAS; Genomic selection; Software; R; GAPIT; WIDE ASSOCIATION; POPULATION-STRUCTURE; MODEL APPROACH;
D O I
10.1016/j.gpb.2021.08.005
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome-wide association study (GWAS) and genomic prediction/selection (GP/GS) are the two essential enterprises in genomic research. Due to the great magnitude and complexity of genomic and phenotypic data, analytical methods and their associated software packages are frequently advanced. GAPIT is a widely-used genomic association and prediction integrated tool as an R package. The first version was released to the public in 2012 with the implementation of the general linear model (GLM), mixed linear model (MLM), compressed MLM (CMLM), and genomic best linear unbiased prediction (gBLUP). The second version was released in 2016 with several new implementations, including enriched CMLM (ECMLM) and settlement of MLMs under progressively exclusive relationship (SUPER). All the GWAS methods are based on the single-locus test. For the first time, in the current release of GAPIT, version 3 implemented three multi-locus test methods, including multiple loci mixed model (MLMM), fixed and random model circulating probability unification (FarmCPU), and Bayesian-information and linkage-disequilibrium iteratively nested keyway (BLINK). Ad-ditionally, two GP/GS methods were implemented based on CMLM (named compressed BLUP; cBLUP) and SUPER (named SUPER BLUP; sBLUP). These new implementations not only boost statistical power for GWAS and prediction accuracy for GP/GS, but also improve computing speed and increase the capacity to analyze big genomic data. Here, we document the current upgrade of GAPIT by describing the selection of the recently developed methods, their im-plementations, and potential impact. All documents, including source code, user manual, demo data, and tutorials, are freely available at the GAPIT website (http://zzlab.net/GAPIT).
引用
收藏
页码:629 / 640
页数:12
相关论文
共 35 条
[1]   GenABEL: an R library for genome-wide association analysis [J].
Aulchenko, Yurii S. ;
Ripke, Stephan ;
Isaacs, Aaron ;
Van Duijn, Cornelia M. .
BIOINFORMATICS, 2007, 23 (10) :1294-1296
[3]   TASSEL: software for association mapping of complex traits in diverse samples [J].
Bradbury, Peter J. ;
Zhang, Zhiwu ;
Kroon, Dallas E. ;
Casstevens, Terry M. ;
Ramdoss, Yogesh ;
Buckler, Edward S. .
BIOINFORMATICS, 2007, 23 (19) :2633-2635
[4]   The Genetic Architecture of Maize Flowering Time [J].
Buckler, Edward S. ;
Holland, James B. ;
Bradbury, Peter J. ;
Acharya, Charlotte B. ;
Brown, Patrick J. ;
Browne, Chris ;
Ersoz, Elhan ;
Flint-Garcia, Sherry ;
Garcia, Arturo ;
Glaubitz, Jeffrey C. ;
Goodman, Major M. ;
Harjes, Carlos ;
Guill, Kate ;
Kroon, Dallas E. ;
Larsson, Sara ;
Lepak, Nicholas K. ;
Li, Huihui ;
Mitchell, Sharon E. ;
Pressoir, Gael ;
Peiffer, Jason A. ;
Rosas, Marco Oropeza ;
Rocheford, Torbert R. ;
Cinta Romay, M. ;
Romero, Susan ;
Salvo, Stella ;
Sanchez Villeda, Hector ;
da Silva, H. Sofia ;
Sun, Qi ;
Tian, Feng ;
Upadyayula, Narasimham ;
Ware, Doreen ;
Yates, Heather ;
Yu, Jianming ;
Zhang, Zhiwu ;
Kresovich, Stephen ;
McMullen, Michael D. .
SCIENCE, 2009, 325 (5941) :714-718
[5]   iPat: intelligent prediction and association tool for genomic research [J].
Chen, Chunpeng James ;
Zhang, Zhiwu .
BIOINFORMATICS, 2018, 34 (11) :1925-1927
[6]   Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP [J].
Endelman, Jeffrey B. .
PLANT GENOME, 2011, 4 (03) :250-255
[7]   Extension of the bayesian alphabet for genomic selection [J].
Habier, David ;
Fernando, Rohan L. ;
Kizilkaya, Kadir ;
Garrick, Dorian J. .
BMC BIOINFORMATICS, 2011, 12
[8]   BLINK: a package for the next level of genome-wide association studies with both individuals and markers in the millions [J].
Huang, Meng ;
Liu, Xiaolei ;
Zhou, Yao ;
Summers, Ryan M. ;
Zhang, Zhiwu .
GIGASCIENCE, 2019, 8 (02)
[9]   Efficient control of population structure in model organism association mapping [J].
Kang, Hyun Min ;
Zaitlen, Noah A. ;
Wade, Claire M. ;
Kirby, Andrew ;
Heckerman, David ;
Daly, Mark J. ;
Eskin, Eleazar .
GENETICS, 2008, 178 (03) :1709-1723
[10]   Variance component model to account for sample structure in genome-wide association studies [J].
Kang, Hyun Min ;
Sul, Jae Hoon ;
Service, Susan K. ;
Zaitlen, Noah A. ;
Kong, Sit-yee ;
Freimer, Nelson B. ;
Sabatti, Chiara ;
Eskin, Eleazar .
NATURE GENETICS, 2010, 42 (04) :348-U110