GAPIT Version 3: Boosting Power and Accuracy for Genomic Association and Prediction

被引:509
作者
Wang, Jiabo [1 ,2 ]
Zhang, Zhiwu [2 ]
机构
[1] Southwest Minzu Univ, Key Lab Qinghai Tibetan Plateau Anim Genet Resourc, Sichuan Prov & Minist Educ, Chengdu 610041, Peoples R China
[2] Washington State Univ, Dept Crop & Soil Sci, Pullman, WA 99164 USA
基金
美国国家科学基金会; 美国食品与农业研究所;
关键词
GWAS; Genomic selection; Software; R; GAPIT; WIDE ASSOCIATION; POPULATION-STRUCTURE; MODEL APPROACH;
D O I
10.1016/j.gpb.2021.08.005
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Genome-wide association study (GWAS) and genomic prediction/selection (GP/GS) are the two essential enterprises in genomic research. Due to the great magnitude and complexity of genomic and phenotypic data, analytical methods and their associated software packages are frequently advanced. GAPIT is a widely-used genomic association and prediction integrated tool as an R package. The first version was released to the public in 2012 with the implementation of the general linear model (GLM), mixed linear model (MLM), compressed MLM (CMLM), and genomic best linear unbiased prediction (gBLUP). The second version was released in 2016 with several new implementations, including enriched CMLM (ECMLM) and settlement of MLMs under progressively exclusive relationship (SUPER). All the GWAS methods are based on the single-locus test. For the first time, in the current release of GAPIT, version 3 implemented three multi-locus test methods, including multiple loci mixed model (MLMM), fixed and random model circulating probability unification (FarmCPU), and Bayesian-information and linkage-disequilibrium iteratively nested keyway (BLINK). Ad-ditionally, two GP/GS methods were implemented based on CMLM (named compressed BLUP; cBLUP) and SUPER (named SUPER BLUP; sBLUP). These new implementations not only boost statistical power for GWAS and prediction accuracy for GP/GS, but also improve computing speed and increase the capacity to analyze big genomic data. Here, we document the current upgrade of GAPIT by describing the selection of the recently developed methods, their im-plementations, and potential impact. All documents, including source code, user manual, demo data, and tutorials, are freely available at the GAPIT website (http://zzlab.net/GAPIT).
引用
收藏
页码:629 / 640
页数:12
相关论文
共 35 条
  • [1] GenABEL: an R library for genome-wide association analysis
    Aulchenko, Yurii S.
    Ripke, Stephan
    Isaacs, Aaron
    Van Duijn, Cornelia M.
    [J]. BIOINFORMATICS, 2007, 23 (10) : 1294 - 1296
  • [3] TASSEL: software for association mapping of complex traits in diverse samples
    Bradbury, Peter J.
    Zhang, Zhiwu
    Kroon, Dallas E.
    Casstevens, Terry M.
    Ramdoss, Yogesh
    Buckler, Edward S.
    [J]. BIOINFORMATICS, 2007, 23 (19) : 2633 - 2635
  • [4] The Genetic Architecture of Maize Flowering Time
    Buckler, Edward S.
    Holland, James B.
    Bradbury, Peter J.
    Acharya, Charlotte B.
    Brown, Patrick J.
    Browne, Chris
    Ersoz, Elhan
    Flint-Garcia, Sherry
    Garcia, Arturo
    Glaubitz, Jeffrey C.
    Goodman, Major M.
    Harjes, Carlos
    Guill, Kate
    Kroon, Dallas E.
    Larsson, Sara
    Lepak, Nicholas K.
    Li, Huihui
    Mitchell, Sharon E.
    Pressoir, Gael
    Peiffer, Jason A.
    Rosas, Marco Oropeza
    Rocheford, Torbert R.
    Cinta Romay, M.
    Romero, Susan
    Salvo, Stella
    Sanchez Villeda, Hector
    da Silva, H. Sofia
    Sun, Qi
    Tian, Feng
    Upadyayula, Narasimham
    Ware, Doreen
    Yates, Heather
    Yu, Jianming
    Zhang, Zhiwu
    Kresovich, Stephen
    McMullen, Michael D.
    [J]. SCIENCE, 2009, 325 (5941) : 714 - 718
  • [5] iPat: intelligent prediction and association tool for genomic research
    Chen, Chunpeng James
    Zhang, Zhiwu
    [J]. BIOINFORMATICS, 2018, 34 (11) : 1925 - 1927
  • [6] Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP
    Endelman, Jeffrey B.
    [J]. PLANT GENOME, 2011, 4 (03): : 250 - 255
  • [7] Extension of the bayesian alphabet for genomic selection
    Habier, David
    Fernando, Rohan L.
    Kizilkaya, Kadir
    Garrick, Dorian J.
    [J]. BMC BIOINFORMATICS, 2011, 12
  • [8] BLINK: a package for the next level of genome-wide association studies with both individuals and markers in the millions
    Huang, Meng
    Liu, Xiaolei
    Zhou, Yao
    Summers, Ryan M.
    Zhang, Zhiwu
    [J]. GIGASCIENCE, 2019, 8 (02):
  • [9] Efficient control of population structure in model organism association mapping
    Kang, Hyun Min
    Zaitlen, Noah A.
    Wade, Claire M.
    Kirby, Andrew
    Heckerman, David
    Daly, Mark J.
    Eskin, Eleazar
    [J]. GENETICS, 2008, 178 (03) : 1709 - 1723
  • [10] Variance component model to account for sample structure in genome-wide association studies
    Kang, Hyun Min
    Sul, Jae Hoon
    Service, Susan K.
    Zaitlen, Noah A.
    Kong, Sit-yee
    Freimer, Nelson B.
    Sabatti, Chiara
    Eskin, Eleazar
    [J]. NATURE GENETICS, 2010, 42 (04) : 348 - U110