Design and evaluation of a sequence capture system for genome-wide SNP genotyping in highly heterozygous plant genomes: a case study with a keystone Neotropical hardwood tree genome

被引:7
作者
Silva-Junior, Orzenil Bonfim [1 ,2 ]
Grattapaglia, Dario [1 ,2 ]
Novaes, Evandro [3 ]
Collevatti, Rosane G. [4 ]
机构
[1] EPqB, EMBRAPA Recursos Genet & Biotecnol, BR-70770910 Brasilia, DF, Brazil
[2] Univ Catolica Brasilia, Programa Ciencias Genom & Biotecnol, SGAN 916 Modulo B, BR-70790160 Brasilia, DF, Brazil
[3] Univ Fed Lavras, Dept Biol, BR-37200000 Lavras, MG, Brazil
[4] Univ Fed Goias, Inst Ciencias Biol, Lab Genet & Biodiversidade, BR-74001970 Goiania, Go, Brazil
关键词
target enrichment; sequence capture; SNPs; Handroanthus; SHORT READ ALIGNMENT; POLYMORPHISM; DIVERSITY; POPULATIONS; ENRICHMENT; RESOURCES; DISCOVERY; SELECTION; RAD; MAP;
D O I
10.1093/dnares/dsy023
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Targeted sequence capture coupled to high-throughput sequencing has become a powerful method for the study of genome-wide sequence variation. Following our recent development of a genome assembly for the Pink Ipe tree (Handroanthus impetiginosus), a widely distributed Neotropical timber species, we now report the development of a set of 24,751 capture probes for single-nucleotide polymorphisms (SNPs) characterization and genotyping across 18,216 distinct loci, sampling more than 10 Mbp of the species genome. This system identifies nearly 200,000 SNPs located inside or in close proximity to almost 14,000 annotated protein-coding genes, generating quality genotypic data in populations spanning wide geographic distances across the species native range. To provide recommendations for future developments of similar systems for highly heterozygous plant genomes we investigated issues such as probe design, sequencing coverage and bioinformatics, including the evaluation of the capture efficiency and a reassessment of the technical reproducibility of the assay for SNPs recall and genotyping precision. Our results highlight the value of a detailed probe screening on a preliminary genome assembly to produce reliable data for downstream genetic studies. This work should inspire and assist the development of similar genomic resources for other orphan crops and forest trees with highly heterozygous genomes.
引用
收藏
页码:535 / 545
页数:11
相关论文
共 58 条
[1]   High-Resolution Linkage Map and Chromosome-Scale Genome Assembly for Cassava (Manihot esculenta Crantz) from 10 Populations [J].
Alaba, Oluwafemi A. ;
Bredeson, Jessen V. ;
Egesi, Chiedozie N. ;
Esuma, Williams ;
Ezenwaka, Lydia ;
Ferguson, Morag E. ;
Ha, Cindy M. ;
Hall, Megan ;
Herselman, Liezel ;
Ikpan, Andrew ;
Kafiriti, Elly ;
Kanju, Edward ;
Kapinga, Fortunus ;
Karugu, Arthur ;
Kawuki, Robert ;
Kimata, Bernadetha ;
Kimurto, Paul ;
Kulakow, Peter ;
Kulembeka, Heneriko ;
Kusolwa, Paul ;
Lyons, Jessica B. ;
Masumba, Esther ;
van de Merwe, Albe ;
Mkamilo, Geoffrey ;
Myburg, Alexander A. ;
Nwaogu, Ahamefule ;
Nzuki, Inosters ;
Olasanmi, Bunmi ;
Okogbenin, Emmanuel ;
Onyegbule, Onyeyirichi ;
Owuoche, James ;
Pariyo, Anthony ;
Prochnik, Simon E. ;
Rabbi, Ismail Y. ;
Rokhsar, Daniel S. ;
Rounsley, Steve ;
Salum, Kasele ;
Shuaibu, Kahya S. ;
Sichalwe, Caroline ;
Stephen, Mary .
G3-GENES GENOMES GENETICS, 2015, 5 (01) :133-144
[2]   RADseq underestimates diversity and introduces genealogical biases due to nonrandom haplotype sampling [J].
Arnold, B. ;
Corbett-Detig, R. B. ;
Hartl, D. ;
Bomblies, K. .
MOLECULAR ECOLOGY, 2013, 22 (11) :3179-3190
[3]   Rapid SNP Discovery and Genetic Mapping Using Sequenced RAD Markers [J].
Baird, Nathan A. ;
Etter, Paul D. ;
Atwood, Tressa S. ;
Currey, Mark C. ;
Shiver, Anthony L. ;
Lewis, Zachary A. ;
Selker, Eric U. ;
Cresko, William A. ;
Johnson, Eric A. .
PLOS ONE, 2008, 3 (10)
[4]   A representation of a compressed de Bruijn graph for pan-genome analysis that enables search [J].
Beller, Timo ;
Ohlebusch, Enno .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2016, 11
[5]   Second-generation PLINK: rising to the challenge of larger and richer datasets [J].
Chang, Christopher C. ;
Chow, Carson C. ;
Tellier, Laurent C. A. M. ;
Vattikuti, Shashaank ;
Purcell, Shaun M. ;
Lee, James J. .
GIGASCIENCE, 2015, 4
[6]   A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3 [J].
Cingolani, Pablo ;
Platts, Adrian ;
Wang, Le Lily ;
Coon, Melissa ;
Tung Nguyen ;
Wang, Luan ;
Land, Susan J. ;
Lu, Xiangyi ;
Ruden, Douglas M. .
FLY, 2012, 6 (02) :80-92
[7]   A coupled phylogeographical and species distribution modelling approach recovers the demographical history of a Neotropical seasonally dry forest tree species [J].
Collevatti, Rosane G. ;
Terribile, Levi Carina ;
Lima-Ribeiro, Matheus S. ;
Nabout, Joao C. ;
Oliveira, Guilhermede ;
Rangel, Thiago F. ;
Rabelo, Suelen G. ;
Diniz-Filho, Jose A. F. .
MOLECULAR ECOLOGY, 2012, 21 (23) :5845-5863
[8]   The next generation of target capture technologies - large DNA fragment enrichment and sequencing determines regional genomic variation of high complexity [J].
Dapprich, Johannes ;
Ferriola, Deborah ;
Mackiewicz, Kate ;
Clark, Peter M. ;
Rappaport, Eric ;
D'Arcy, Monica ;
Sasson, Ariella ;
Gai, Xiaowu ;
Schug, Jonathan ;
Kaestner, Klaus H. ;
Monos, Dimitri .
BMC GENOMICS, 2016, 17
[9]   Development of Genetic Markers in Eucalyptus Species by Target Enrichment and Exome Sequencing [J].
Dasgupta, Modhumita Ghosh ;
Dharanishanthi, Veeramuthu ;
Agarwal, Ishangi ;
Krutovsky, Konstantin V. .
PLOS ONE, 2015, 10 (01)
[10]   Genome-wide genetic marker discovery and genotyping using next-generation sequencing [J].
Davey, John W. ;
Hohenlohe, Paul A. ;
Etter, Paul D. ;
Boone, Jason Q. ;
Catchen, Julian M. ;
Blaxter, Mark L. .
NATURE REVIEWS GENETICS, 2011, 12 (07) :499-510