Chloroplast Genomes of Two Species of Cypripedium: Expanded Genome Size and Proliferation of AT-Biased Repeat Sequences

被引:45
作者
Guo, Yan-Yan [1 ]
Yang, Jia-Xing [1 ]
Li, Hong-Kun [1 ]
Zhao, Hu-Sheng [1 ]
机构
[1] Henan Agr Univ, Coll Plant Protect, Zhengzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
plastome expansion; repeat sequence; hybrid assembly; AT-biased base composition; long-read sequencing; palindromic repeat; inversion; PLASTID GENOME; MOLECULAR PHYLOGENY; INVERTED REPEAT; EVOLUTION; ORCHIDACEAE; REARRANGEMENT; DIVERGENT; NUCLEAR;
D O I
10.3389/fpls.2021.609729
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
The size of the chloroplast genome (plastome) of autotrophic angiosperms is generally conserved. However, the chloroplast genomes of some lineages are greatly expanded, which may render assembling these genomes from short read sequencing data more challenging. Here, we present the sequencing, assembly, and annotation of the chloroplast genomes of Cypripedium tibeticum and Cypripedium subtropicum. We de novo assembled the chloroplast genomes of the two species with a combination of short-read Illumina data and long-read PacBio data. The plastomes of the two species are characterized by expanded genome size, proliferated AT-rich repeat sequences, low GC content and gene density, as well as low substitution rates of the coding genes. The plastomes of C. tibeticum (197,815 bp) and C. subtropicum (212,668 bp) are substantially larger than those of the three species sequenced in previous studies. The plastome of C. subtropicum is the longest one of Orchidaceae to date. Despite the increase in genome size, the gene order and gene number of the plastomes are conserved, with the exception of an similar to 75 kb large inversion in the large single copy (LSC) region shared by the two species. The most striking is the record-setting low GC content in C. subtropicum (28.2%). Moreover, the plastome expansion of the two species is strongly correlated with the proliferation of AT-biased non-coding regions: the non-coding content of C. subtropicum is in excess of 57%. The genus provides a typical example of plastome expansion induced by the expansion of non-coding regions. Considering the pros and cons of different sequencing technologies, we recommend hybrid assembly based on long and short reads applied to the sequencing of plastomes with AT-biased base composition.
引用
收藏
页数:12
相关论文
共 57 条
[1]   SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing [J].
Bankevich, Anton ;
Nurk, Sergey ;
Antipov, Dmitry ;
Gurevich, Alexey A. ;
Dvorkin, Mikhail ;
Kulikov, Alexander S. ;
Lesin, Valery M. ;
Nikolenko, Sergey I. ;
Son Pham ;
Prjibelski, Andrey D. ;
Pyshkin, Alexey V. ;
Sirotkin, Alexander V. ;
Vyahhi, Nikolay ;
Tesler, Glenn ;
Alekseyev, Max A. ;
Pevzner, Pavel A. .
JOURNAL OF COMPUTATIONAL BIOLOGY, 2012, 19 (05) :455-477
[2]   Alternative translation initiation codons for the plastid maturase MatK: unraveling the pseudogene misconception in the Orchidaceae [J].
Barthet, Michelle M. ;
Moukarzel, Keenan ;
Smith, Kayla N. ;
Patel, Jaimin ;
Hilu, Khidir W. .
BMC EVOLUTIONARY BIOLOGY, 2015, 15
[3]   MISA-web: a web server for microsatellite prediction [J].
Beier, Sebastian ;
Thiel, Thomas ;
Muench, Thomas ;
Scholz, Uwe ;
Mascher, Martin .
BIOINFORMATICS, 2017, 33 (16) :2583-2585
[4]   The Plastomes of Two Species in the Endoparasite Genus Pilostyles (Apodanthaceae) Each Retain Just Five or Six Possibly Functional Genes [J].
Bellot, Sidonie ;
Renner, Susanne S. .
GENOME BIOLOGY AND EVOLUTION, 2016, 8 (01) :189-201
[5]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[6]   Divergence of RNA polymerase α subunits in angiosperm plastid genomes is mediated by genomic rearrangement [J].
Blazier, J. Chris ;
Ruhlman, Tracey A. ;
Weng, Mao-Lun ;
Rehman, Sumaiyah K. ;
Sabir, Jamal S. M. ;
Jansen, Robert K. .
SCIENTIFIC REPORTS, 2016, 6
[7]   GC bias affects genomic and metagenomic reconstructions, underrepresenting GC-poor organisms [J].
Browne, Patrick Denis ;
Nielsen, Tue Kjaergaard ;
Kot, Witold ;
Aggerholm, Anni ;
Gilbert, M. Thomas P. ;
Puetz, Lara ;
Rasmussen, Morten ;
Zervas, Athanasios ;
Hansen, Lars Hestbjerg .
GIGASCIENCE, 2020, 9 (02)
[8]   The Chloroplast Genome of Passiflora edulis (Passifloraceae) Assembled from Long Sequence Reads: Structural Organization and Phylogenomic Studies in Malpighiales [J].
Cauz-Santos, Luiz A. ;
Munhoz, Carla F. ;
Rodde, Nathalie ;
Cauet, Stephane ;
Santos, Anselmo A. ;
Penha, Helen A. ;
Dornelas, Marcelo C. ;
Varani, Alessandro M. ;
Oliveira, Giancarlo C. X. ;
Berges, Helene ;
Vieira, Maria Lucia C. .
FRONTIERS IN PLANT SCIENCE, 2017, 8
[9]   Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory [J].
Chaisson, Mark J. ;
Tesler, Glenn .
BMC BIOINFORMATICS, 2012, 13
[10]  
Chen S., 2013, The Genus Cypripedium in China