An Improved Genotyping by Sequencing (GBS) Approach Offering Increased Versatility and Efficiency of SNP Discovery and Genotyping

被引:345
作者
Sonah, Humira [1 ,2 ]
Bastien, Maxime [1 ,2 ]
Iquira, Elmer [1 ,2 ]
Tardivel, Aurelie [1 ,2 ]
Legare, Gaetan [2 ]
Boyle, Brian [2 ]
Normandeau, Eric [2 ,3 ]
Laroche, Jerome [2 ]
Larose, Stephane [2 ]
Jean, Martine [1 ,2 ]
Belzile, Francois [1 ,2 ]
机构
[1] Univ Laval, Dept Phytol, Quebec City, PQ G1K 7P4, Canada
[2] Univ Laval, Inst Biol Integrat & Syst, Quebec City, PQ, Canada
[3] Univ Laval, Dept Biol, Quebec City, PQ G1K 7P4, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
SINGLE-NUCLEOTIDE POLYMORPHISMS; GENOME;
D O I
10.1371/journal.pone.0054603
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Highly parallel SNP genotyping platforms have been developed for some important crop species, but these platforms typically carry a high cost per sample for first-time or small-scale users. In contrast, recently developed genotyping by sequencing (GBS) approaches offer a highly cost effective alternative for simultaneous SNP discovery and genotyping. In the present investigation, we have explored the use of GBS in soybean. In addition to developing a novel analysis pipeline to call SNPs and indels from the resulting sequence reads, we have devised a modified library preparation protocol to alter the degree of complexity reduction. We used a set of eight diverse soybean genotypes to conduct a pilot scale test of the protocol and pipeline. Using ApeKI for GBS library preparation and sequencing on an Illumina GAIIx machine, we obtained 5.5 M reads and these were processed using our pipeline. A total of 10,120 high quality SNPs were obtained and the distribution of these SNPs mirrored closely the distribution of gene-rich regions in the soybean genome. A total of 39.5% of the SNPs were present in genic regions and 52.5% of these were located in the coding sequence. Validation of over 400 genotypes at a set of randomly selected SNPs using Sanger sequencing showed a 98% success rate. We then explored the use of selective primers to achieve a greater complexity reduction during GBS library preparation. The number of SNP calls could be increased by almost 40% and their depth of coverage was more than doubled, thus opening the door to an increase in the throughput and a significant decrease in the per sample cost. The approach to obtain high quality SNPs developed here will be helpful for marker assisted genomics as well as assessment of available genetic resources for effective utilisation in a wide number of species.
引用
收藏
页数:9
相关论文
共 20 条
[1]   Advances in molecular marker techniques and their applications in plant sciences [J].
Agarwal, Milee ;
Shrivastava, Neeta ;
Padh, Harish .
PLANT CELL REPORTS, 2008, 27 (04) :617-631
[2]   A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3 [J].
Cingolani, Pablo ;
Platts, Adrian ;
Wang, Le Lily ;
Coon, Melissa ;
Tung Nguyen ;
Wang, Luan ;
Land, Susan J. ;
Lu, Xiangyi ;
Ruden, Douglas M. .
FLY, 2012, 6 (02) :80-92
[3]   Genome-wide genetic marker discovery and genotyping using next-generation sequencing [J].
Davey, John W. ;
Hohenlohe, Paul A. ;
Etter, Paul D. ;
Boone, Jason Q. ;
Catchen, Julian M. ;
Blaxter, Mark L. .
NATURE REVIEWS GENETICS, 2011, 12 (07) :499-510
[4]   A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species [J].
Elshire, Robert J. ;
Glaubitz, Jeffrey C. ;
Sun, Qi ;
Poland, Jesse A. ;
Kawamoto, Ken ;
Buckler, Edward S. ;
Mitchell, Sharon E. .
PLOS ONE, 2011, 6 (05)
[5]   The Composition and Origins of Genomic Variation among Individuals of the Soybean Reference Cultivar Williams 82 [J].
Haun, William J. ;
Hyten, David L. ;
Xu, Wayne W. ;
Gerhardt, Daniel J. ;
Albert, Thomas J. ;
Richmond, Todd ;
Jeddeloh, Jeffrey A. ;
Jia, Gaofeng ;
Springer, Nathan M. ;
Vance, Carroll P. ;
Stupar, Robert M. .
PLANT PHYSIOLOGY, 2011, 155 (02) :645-655
[6]   High-throughput genotyping with the GoldenGate assay in the complex genome of soybean [J].
Hyten, David L. ;
Song, Qijian ;
Choi, Ik-Young ;
Yoon, Mun-Sup ;
Specht, James E. ;
Matukumalli, Lakshmi K. ;
Nelson, Randall L. ;
Shoemaker, Randy C. ;
Young, Nevin D. ;
Cregan, Perry B. .
THEORETICAL AND APPLIED GENETICS, 2008, 116 (07) :945-952
[7]   High-throughput SNP discovery through deep resequencing of a reduced representation library to anchor and orient scaffolds in the soybean whole genome sequence [J].
Hyten, David L. ;
Cannon, Steven B. ;
Song, Qijian ;
Weeks, Nathan ;
Fickus, Edward W. ;
Shoemaker, Randy C. ;
Specht, James E. ;
Farmer, Andrew D. ;
May, Gregory D. ;
Cregan, Perry B. .
BMC GENOMICS, 2010, 11
[8]   Resequencing of 31 wild and cultivated soybean genomes identifies patterns of genetic diversity and selection [J].
Lam, Hon-Ming ;
Xu, Xun ;
Liu, Xin ;
Chen, Wenbin ;
Yang, Guohua ;
Wong, Fuk-Ling ;
Li, Man-Wah ;
He, Weiming ;
Qin, Nan ;
Wang, Bo ;
Li, Jun ;
Jian, Min ;
Wang, Jian ;
Shao, Guihua ;
Wang, Jun ;
Sun, Samuel Sai-Ming ;
Zhang, Gengyun .
NATURE GENETICS, 2010, 42 (12) :1053-U41
[9]   A physical, genetic and functional sequence assembly of the barley genome [J].
Mayer, Klaus F. X. ;
Waugh, Robbie ;
Langridge, Peter ;
Close, Timothy J. ;
Wise, Roger P. ;
Graner, Andreas ;
Matsumoto, Takashi ;
Sato, Kazuhiro ;
Schulman, Alan ;
Muehlbauer, Gary J. ;
Stein, Nils ;
Ariyadasa, Ruvini ;
Schulte, Daniela ;
Poursarebani, Naser ;
Zhou, Ruonan ;
Steuernagel, Burkhard ;
Mascher, Martin ;
Scholz, Uwe ;
Shi, Bujun ;
Langridge, Peter ;
Madishetty, Kavitha ;
Svensson, Jan T. ;
Bhat, Prasanna ;
Moscou, Matthew ;
Resnik, Josh ;
Close, Timothy J. ;
Muehlbauer, Gary J. ;
Hedley, Pete ;
Liu, Hui ;
Morris, Jenny ;
Waugh, Robbie ;
Frenkel, Zeev ;
Korol, Avraham ;
Berges, Helene ;
Graner, Andreas ;
Stein, Nils ;
Steuernagel, Burkhard ;
Taudien, Stefan ;
Groth, Marco ;
Felder, Marius ;
Lonardi, Stefano ;
Duma, Denisa ;
Alpert, Matthew ;
Cordero, Francesa ;
Beccuti, Marco ;
Ciardo, Gianfranco ;
Ma, Yaqin ;
Wanamaker, Steve ;
Stein, Nils ;
Close, Timothy J. .
NATURE, 2012, 491 (7426) :711-+
[10]   Sequencing technologies and genome sequencing [J].
Pareek, Chandra Shekhar ;
Smoczynski, Rafal ;
Tretyn, Andrzej .
JOURNAL OF APPLIED GENETICS, 2011, 52 (04) :413-435