A two-tier bioinformatic pipeline to develop probes for target capture of nuclear loci with applications in Melastomataceae

被引:24
作者
Jantzen, Johanna R. [1 ,2 ]
Amarasinghe, Prabha [1 ,2 ]
Folk, Ryan A. [3 ]
Reginato, Marcelo [4 ,5 ]
Michelangeli, Fabian A. [4 ]
Soltis, Douglas E. [1 ,2 ]
Cellinese, Nico [1 ,2 ]
Soltis, Pamela S. [2 ]
机构
[1] Univ Florida, Dept Biol, Gainesville, FL 32611 USA
[2] Univ Florida, Florida Museum Nat Hist, Gainesville, FL 32611 USA
[3] Mississippi State Univ, Dept Biol Sci, Starkville, MS 39762 USA
[4] New York Bot Garden, Inst Systemat Bot, Bronx, NY 10458 USA
[5] Univ Fed Rio Grande do Sul, BR-90090060 Porto Alegre, RS, Brazil
来源
APPLICATIONS IN PLANT SCIENCES | 2020年 / 8卷 / 05期
基金
加拿大自然科学与工程研究理事会; 美国国家科学基金会;
关键词
HybPiper; MarkerMiner; Memecylon; phylogenomics; target capture; Tibouchina; ARABIDOPSIS INFORMATION RESOURCE; TRIBE MICONIEAE MELASTOMATACEAE; HISTORICAL BIOGEOGRAPHY; PHYLOGENY; ALIGNMENT; CLASSIFICATION; MELASTOMEAE; WORLD; GENE;
D O I
10.1002/aps3.11345
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Premise Putatively single-copy nuclear (SCN) loci, which are identified using genomic resources of closely related species, are ideal for phylogenomic inference. However, suitable genomic resources are not available for many clades, including Melastomataceae. We introduce a versatile approach to identify SCN loci for clades with few genomic resources and use it to develop probes for target enrichment in the distantly related Memecylon and Tibouchina (Melastomataceae). Methods We present a two-tiered pipeline. First, we identified putatively SCN loci using MarkerMiner and transcriptomes from distantly related species in Melastomataceae. Published loci and genes of functional significance were then added (384 total loci). Second, using HybPiper, we retrieved 689 homologous template sequences for these loci using genome-skimming data from within the focal clades. Results We sequenced 193 loci common to Memecylon and Tibouchina. Probes designed from 56 template sequences successfully targeted sequences in both clades. Probes designed from genome-skimming data within a focal clade were more successful than probes designed from other sources. Discussion Our pipeline successfully identified and targeted SCN loci in Memecylon and Tibouchina, enabling phylogenomic studies in both clades and potentially across Melastomataceae. This pipeline could be easily applied to other clades with few genomic resources.
引用
收藏
页数:14
相关论文
共 54 条
  • [1] Recovery of plant DNA using a reciprocating saw and silica-based columns
    Alexander, Patrick J.
    Rajanikanth, Govindarajalu
    Bacon, Christine D.
    Bailey, C. Donovan
    [J]. MOLECULAR ECOLOGY NOTES, 2007, 7 (01): : 5 - 9
  • [2] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [3] SECAPR-a bioinformatics pipeline for the rapid and user-friendly processing of targeted enriched Illumina sequences, from raw reads to alignments
    Andermann, Tobias
    Cano, Angela
    Zizka, Alexander
    Bacon, Christine
    Antonelli, Alexandre
    [J]. PEERJ, 2018, 6
  • [4] The genome of Theobroma cacao
    Argout, Xavier
    Salse, Jerome
    Aury, Jean-Marc
    Guiltinan, Mark J.
    Droc, Gaetan
    Gouzy, Jerome
    Allegre, Mathilde
    Chaparro, Cristian
    Legavre, Thierry
    Maximova, Siela N.
    Abrouk, Michael
    Murat, Florent
    Fouet, Olivier
    Poulain, Julie
    Ruiz, Manuel
    Roguet, Yolande
    Rodier-Goud, Maguy
    Barbosa-Neto, Jose Fernandes
    Sabot, Francois
    Kudrna, Dave
    Ammiraju, Jetty Siva S.
    Schuster, Stephan C.
    Carlson, John E.
    Sallet, Erika
    Schiex, Thomas
    Dievart, Anne
    Kramer, Melissa
    Gelley, Laura
    Shi, Zi
    Berard, Aurelie
    Viot, Christopher
    Boccara, Michel
    Risterucci, Ange Marie
    Guignon, Valentin
    Sabau, Xavier
    Axtell, Michael J.
    Ma, Zhaorong
    Zhang, Yufan
    Brown, Spencer
    Bourge, Mickael
    Golser, Wolfgang
    Song, Xiang
    Clement, Didier
    Rivallan, Ronan
    Tahi, Mathias
    Akaza, Joseph Moroh
    Pitollat, Bertrand
    Gramacho, Karina
    D'Hont, Angelique
    Brunel, Dominique
    [J]. NATURE GENETICS, 2011, 43 (02) : 101 - 108
  • [5] Bacci LF, 2019, BOT J LINN SOC, V190, P1, DOI 10.1093/botlinnean/boz006
  • [6] The arabidopsis information resource: Making and mining the "gold standard" annotated reference plant genome
    Berardini, Tanya Z.
    Reiser, Leonore
    Li, Donghui
    Mezheritsky, Yarik
    Muller, Robert
    Strait, Emily
    Huala, Eva
    [J]. GENESIS, 2015, 53 (08) : 474 - 485
  • [7] Divergence times, historical biogeography, and shifts in speciation rates of Myrtales
    Berger, Brent A.
    Kriebel, Ricardo
    Spalink, Daniel
    Sytsma, Kenneth J.
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2016, 95 : 116 - 136
  • [8] Trimmomatic: a flexible trimmer for Illumina sequence data
    Bolger, Anthony M.
    Lohse, Marc
    Usadel, Bjoern
    [J]. BIOINFORMATICS, 2014, 30 (15) : 2114 - 2120
  • [9] Byng JW, 2016, BOT J LINN SOC, V181, P1, DOI [10.1111/boj.12385, 10.1111/j.1095-8339.2009.00996.x]
  • [10] MARKERMINER 1.0: A NEW APPLICATION FOR PHYLOGENETIC MARKER DEVELOPMENT USING ANGIOSPERM TRANSCRIPTOMES
    Chamala, Srikar
    Garcia, Nicolas
    Godden, Grant T.
    Krishnakumar, Vivek
    Jordon-Thaden, Ingrid E.
    De Smet, Riet
    Barbazuk, W. Brad
    Soltis, Douglas E.
    Soltis, Pamela S.
    [J]. APPLICATIONS IN PLANT SCIENCES, 2015, 3 (04):