GAPIT Version 2: An Enhanced Integrated Tool for Genomic Association and Prediction

被引:294
作者
Tang, You [1 ]
Liu, Xiaolei [2 ,3 ,4 ]
Wang, Jiabo [5 ,6 ]
Li, Meng [7 ]
Wang, Qishan [8 ]
Tian, Feng [9 ]
Su, Zhongbin [1 ]
Pan, Yuchun [8 ]
Liu, Di [6 ]
Lipka, Alexander E. [10 ]
Buckler, Edward S. [4 ,11 ]
Zhang, Zhiwu [5 ,12 ]
机构
[1] Northeast Agr Univ, Coll Elect & Informat, Harbin, Peoples R China
[2] Huazhong Agr Univ, Minist Educ, Key Lab Agr Anim Genet Breeding & Reprod, Wuhan, Peoples R China
[3] Huazhong Agr Univ, Coll Anim Sci & Technol, Wuhan, Peoples R China
[4] Cornell Univ, Inst Genom Divers, Ithaca, NY 14853 USA
[5] North East Agr Univ, Dept Anim Sci & Technol, Harbin, Peoples R China
[6] Heilongjiang Acad Agr Sci, Inst Anim Husb, Harbin, Peoples R China
[7] Nanjing Agr Univ, Coll Hort, Nanjing 210095, Jiangsu, Peoples R China
[8] Shanghai Jiao Tong Univ, Sch Agr & Biol, Shanghai, Peoples R China
[9] China Agr Univ, Natl Maize Improvement Ctr China, Beijing, Peoples R China
[10] Univ Illinois, Dept Crop Sci, Urbana, IL 61801 USA
[11] ARS, USDA, Ithaca, NY 14853 USA
[12] Washington State Univ, Dept Crop & Soil Sci, Pullman, WA 99164 USA
基金
美国国家科学基金会; 中国博士后科学基金; 中国国家自然科学基金;
关键词
LINEAR MIXED MODELS; WIDE ASSOCIATION; POPULATION-STRUCTURE; SELECTION;
D O I
10.3835/plantgenome2015.11.0120
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Most human diseases and agriculturally important traits are complex. Dissecting their genetic architecture requires continued development of innovative and powerful statistical methods. Corresponding advances in computing tools are critical to efficiently use these statistical innovations and to enhance and accelerate biomedical and agricultural research and applications. The genome association and prediction integrated tool (GAPIT) was first released in 2012 and became widely used for genome-wide association studies (GWAS) and genomic prediction. The GAPIT implemented computationally efficient statistical methods, including the compressed mixed linear model (CMLM) and genomic prediction by using genomic best linear unbiased prediction (gBLUP). New state-of-the-art statistical methods have now been implemented in a new, enhanced version of GAPIT. These methods include factored spectrally transformed linear mixed models (FaST-LMM), enriched CMLM (ECMLM), FaST-LMM-Select, and settlement of mixed linear models under progressively exclusive relationship (SUPER). The genomic prediction methods implemented in this new release of the GAPIT include gBLUP based on CMLM, ECMLM, and SUPER. Additionally, the GAPIT was updated to improve its existing output display features and to add new data display and evaluation functions, including new graphing options and capabilities, phenotype simulation, power analysis, and cross-validation. These enhancements make the GAPIT a valuable resource for determining appropriate experimental designs and performing GWAS and genomic prediction. The enhanced R-based GAPIT software package uses state-of-the-art methods to conduct GWAS and genomic prediction. The GAPIT also provides new functions for developing experimental designs and creating publication-ready tabular summaries and graphs to improve the efficiency and application of genomic research.
引用
收藏
页数:9
相关论文
共 27 条
[1]   Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines [J].
Atwell, Susanna ;
Huang, Yu S. ;
Vilhjalmsson, Bjarni J. ;
Willems, Glenda ;
Horton, Matthew ;
Li, Yan ;
Meng, Dazhe ;
Platt, Alexander ;
Tarone, Aaron M. ;
Hu, Tina T. ;
Jiang, Rong ;
Muliyati, N. Wayan ;
Zhang, Xu ;
Amer, Muhammad Ali ;
Baxter, Ivan ;
Brachi, Benjamin ;
Chory, Joanne ;
Dean, Caroline ;
Debieu, Marilyne ;
de Meaux, Juliette ;
Ecker, Joseph R. ;
Faure, Nathalie ;
Kniskern, Joel M. ;
Jones, Jonathan D. G. ;
Michael, Todd ;
Nemri, Adnane ;
Roux, Fabrice ;
Salt, David E. ;
Tang, Chunlao ;
Todesco, Marco ;
Traw, M. Brian ;
Weigel, Detlef ;
Marjoram, Paul ;
Borevitz, Justin O. ;
Bergelson, Joy ;
Nordborg, Magnus .
NATURE, 2010, 465 (7298) :627-631
[2]   TASSEL: software for association mapping of complex traits in diverse samples [J].
Bradbury, Peter J. ;
Zhang, Zhiwu ;
Kroon, Dallas E. ;
Casstevens, Terry M. ;
Ramdoss, Yogesh ;
Buckler, Edward S. .
BIOINFORMATICS, 2007, 23 (19) :2633-2635
[3]   Prioritizing GWAS Results: A Review of Statistical Methods and Recommendations for Their Application [J].
Cantor, Rita M. ;
Lange, Kenneth ;
Sinsheimer, Janet S. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (01) :6-22
[4]   Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP [J].
Endelman, Jeffrey B. .
PLANT GENOME, 2011, 4 (03) :250-255
[5]   Inference of population structure using multilocus genotype data: dominant markers and null alleles [J].
Falush, Daniel ;
Stephens, Matthew ;
Pritchard, Jonathan K. .
MOLECULAR ECOLOGY NOTES, 2007, 7 (04) :574-578
[6]   Towards sequence-based genomic selection of cattle [J].
Georges, Michel .
NATURE GENETICS, 2014, 46 (08) :807-809
[7]  
Groth Detlef, 2013, Methods Mol Biol, V930, P527, DOI 10.1007/978-1-62703-059-5_22
[8]   Efficient control of population structure in model organism association mapping [J].
Kang, Hyun Min ;
Zaitlen, Noah A. ;
Wade, Claire M. ;
Kirby, Andrew ;
Heckerman, David ;
Daly, Mark J. ;
Eskin, Eleazar .
GENETICS, 2008, 178 (03) :1709-1723
[9]   Variance component model to account for sample structure in genome-wide association studies [J].
Kang, Hyun Min ;
Sul, Jae Hoon ;
Service, Susan K. ;
Zaitlen, Noah A. ;
Kong, Sit-yee ;
Freimer, Nelson B. ;
Sabatti, Chiara ;
Eskin, Eleazar .
NATURE GENETICS, 2010, 42 (04) :348-U110
[10]   A mixed-model approach for genome-wide association studies of correlated traits in structured populations [J].
Korte, Arthur ;
Vilhjalmsson, Bjarni J. ;
Segura, Vincent ;
Platt, Alexander ;
Long, Quan ;
Nordborg, Magnus .
NATURE GENETICS, 2012, 44 (09) :1066-+