Nucleotide polymorphism and copy number variant detection using exome capture and next-generation sequencing in the polyploid grass Panicum virgatum

被引:27
作者
Evans, Joseph [1 ,2 ]
Kim, Jeongwoon [1 ,2 ]
Childs, Kevin L. [1 ,2 ]
Vaillancourt, Brieanne [1 ,2 ]
Crisovan, Emily [1 ,2 ]
Nandety, Aruna [3 ,4 ]
Gerhardt, Daniel J. [5 ]
Richmond, Todd A. [5 ]
Jeddeloh, Jeffrey A. [5 ]
Kaeppler, Shawn M. [3 ,6 ]
Casler, Michael D. [3 ,4 ]
Buell, C. Robin [1 ,2 ]
机构
[1] Michigan State Univ, Great Lakes Bioenergy Res Ctr, Dept Energy, E Lansing, MI 48824 USA
[2] Michigan State Univ, Dept Plant Biol, E Lansing, MI 48824 USA
[3] Univ Wisconsin, Dept Energy, Great Lakes Bioenergy Res Ctr, Madison, WI 53706 USA
[4] Univ Wisconsin, USDA ARS, US Dairy Forage Res Ctr, Madison, WI 53706 USA
[5] Roche NimbleGen, Madison, WI 53719 USA
[6] Univ Wisconsin, Dept Agron, Madison, WI 53706 USA
关键词
exome; switchgrass; polyploidy; copy number variant; presence/absence variant; Panicum virgatum; GENOME; GENES; DIVERSITY; BARLEY; IDENTIFICATION; DUPLICATION; RESISTANCE; ALIGNMENT; LOCATION; RESOURCE;
D O I
10.1111/tpj.12601
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Switchgrass (Panicum virgatum) is a polyploid, outcrossing grass species native to North America and has recently been recognized as a potential biofuel feedstock crop. Significant phenotypic variation including ploidy is present across the two primary ecotypes of switchgrass, referred to as upland and lowland switchgrass. The tetraploid switchgrass genome is approximately 1400Mbp, split between two subgenomes, with significant repetitive sequence content limiting the efficiency of re-sequencing approaches for determining genome diversity. To characterize genetic diversity in upland and lowland switchgrass as a first step in linking genotype to phenotype, we designed an exome capture probe set based on transcript assemblies that represent approximately 50Mb of annotated switchgrass exome sequences. We then evaluated and optimized the probe set using solid phase comparative genome hybridization and liquid phase exome capture followed by next-generation sequencing. Using the optimized probe set, we assessed variation in the exomes of eight switchgrass genotypes representing tetraploid lowland and octoploid upland cultivars to benchmark our exome capture probe set design. We identified ample variation in the switchgrass genome including 1395501 single nucleotide polymorphisms (SNPs), 8173 putative copy number variants and 3336 presence/absence variants. While the majority of the SNPs (84%) detected was bi-allelic, a substantial number was tri-allelic with limited occurrence of tetra-allelic polymorphisms consistent with the heterozygous and polyploid nature of the switchgrass genome. Collectively, these data demonstrate the efficacy of exome capture for discovery of genome variation in a polyploid species with a large, repetitive and heterozygous genome.
引用
收藏
页码:993 / 1008
页数:16
相关论文
共 70 条
[1]  
[Anonymous], PLANT GENOME
[2]  
[Anonymous], BIOENERG RES
[3]  
[Anonymous], 2005, PHYLIP (phylogeny inference package) version 3.6
[4]   Whole exome capture in solution with 3 Gbp of data [J].
Bainbridge, Matthew N. ;
Wang, Min ;
Burgess, Daniel L. ;
Kovar, Christie ;
Rodesch, Matthew J. ;
D'Ascenzo, Mark ;
Kitzman, Jacob ;
Wu, Yuan-Qing ;
Newsham, Irene ;
Richmond, Todd A. ;
Jeddeloh, Jeffrey A. ;
Muzny, Donna ;
Albert, Thomas J. ;
Gibbs, Richard A. .
GENOME BIOLOGY, 2010, 11 (06)
[5]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[6]   Reference genome sequence of the model plant Setaria [J].
Bennetzen, Jeffrey L. ;
Schmutz, Jeremy ;
Wang, Hao ;
Percifield, Ryan ;
Hawkins, Jennifer ;
Pontaroli, Ana C. ;
Estep, Matt ;
Feng, Liang ;
Vaughn, Justin N. ;
Grimwood, Jane ;
Jenkins, Jerry ;
Barry, Kerrie ;
Lindquist, Erika ;
Hellsten, Uffe ;
Deshpande, Shweta ;
Wang, Xuewen ;
Wu, Xiaomei ;
Mitros, Therese ;
Triplett, Jimmy ;
Yang, Xiaohan ;
Ye, Chu-Yu ;
Mauro-Herrera, Margarita ;
Wang, Lin ;
Li, Pinghua ;
Sharma, Manoj ;
Sharma, Rita ;
Ronald, Pamela C. ;
Panaud, Olivier ;
Kellogg, Elizabeth A. ;
Brutnell, Thomas P. ;
Doust, Andrew N. ;
Tuskan, Gerald A. ;
Rokhsar, Daniel ;
Devos, Katrien M. .
NATURE BIOTECHNOLOGY, 2012, 30 (06) :555-+
[7]   Phenotypic and Genomic Analyses of a Fast Neutron Mutant Population Resource in Soybean [J].
Bolon, Yung-Tsi ;
Haun, William J. ;
Xu, Wayne W. ;
Grant, David ;
Stacey, Minviluz G. ;
Nelson, Rex T. ;
Gerhardt, Daniel J. ;
Jeddeloh, Jeffrey A. ;
Stacey, Gary ;
Muehlbauer, Gary J. ;
Orf, James H. ;
Naeve, Seth L. ;
Stupar, Robert M. ;
Vance, Carroll P. .
PLANT PHYSIOLOGY, 2011, 156 (01) :240-253
[8]  
Casler MD., 2012, Handbook of bioenergy crop plants, P563, DOI DOI 10.1201/B11711-25
[9]   The Switchgrass Genome: Tools and Strategies [J].
Casler, Michael D. ;
Tobias, Christian M. ;
Kaeppler, Shawn M. ;
Buell, C. Robin ;
Wang, Zeng-Yu ;
Cao, Peijian ;
Schmutz, Jeremy ;
Ronald, Pamela .
PLANT GENOME, 2011, 4 (03) :273-282
[10]   α-synuclein locus duplication as a cause of familial Parkinson's disease [J].
Chartier-Harlin, MC ;
Kachergus, J ;
Roumier, C ;
Mouroux, V ;
Douay, X ;
Lincoln, S ;
Levecque, C ;
Larvor, L ;
Andrieux, J ;
Hulihan, M ;
Waucquier, N ;
Defebvre, L ;
Amouyel, P ;
Farrer, M ;
Destée, A .
LANCET, 2004, 364 (9440) :1167-1169