Chromosome-Level Genome Assembly of the Rare and Endangered Tropical Plant Speranskia yunnanensis (Euphorbiaceae)

被引:2
作者
Guofang, Yuan [1 ]
Shufang, Tan [1 ]
Dandan, Wang [2 ,3 ]
Yongzhi, Yang [2 ,3 ]
Bin, Tian [1 ,4 ]
机构
[1] Southwest Forestry Univ, Key Lab Forest Resources Conservat & Utilizat Sou, Minist Educ, Kunming, Yunnan, Peoples R China
[2] Lanzhou Univ, Inst Innovat Ecol, State Key Lab Grassland Agroecosyst, Lanzhou, Peoples R China
[3] Lanzhou Univ, Sch Life Sci, Lanzhou, Peoples R China
[4] Chinese Acad Sci, Kunming Inst Bot, CAS Key Lab Plant Divers & Biogeog, Kunming, Yunnan, Peoples R China
关键词
genome assembly; chromosome-level genome; phylogenetic relationships; evolution; Speranskia yunnanensis; GENERATION; PROVIDES; DATABASE; PROGRAM; SYSTEM; GENES;
D O I
10.3389/fgene.2021.755564
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Speranskia yunnanensis S. M. Hwang is an endangered shrub narrowly distributed in tropical regions, and its populations are gradually shrinking. We assembled and annotated the genome of S. yunnanensis at the chromosome level by combining Nanopore sequencing, Illumina HiSeq sequencing and Hi-C technology. The final genome assembly was similar to 417.65 Mb, with a contig N50 value of 12.52 Mb, and 408.62 Mb (97.84%) of which could be grouped into seven pseudochromosomes. Approximately 69.11% of the assembly was identified as repetitive elements, and 25,467 protein-coding genes were annotated. Based on the 1,517 single-copy orthologous genes, and 751 expanded and 1,645 contracted gene families among the 16,389 gene families in S. yunnanensis, a phylogenetic tree was further built. The high-quality, annotated, and chromosome-level genome of S. yunnanensis will present an important source of data for future research on the evolution of Euphorbiaceae genomes, and provide genomic resources toward studies on speciation, local adaptation, as well as conservation genomics of the ecologically important genus Speranskia.
引用
收藏
页数:6
相关论文
共 35 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[3]   Repbase Update, a database of repetitive elements in eukaryotic genomes [J].
Bao, Weidong ;
Kojima, Kenji K. ;
Kohany, Oleksiy .
MOBILE DNA, 2015, 6
[4]   Tandem repeats finder: a program to analyze DNA sequences [J].
Benson, G .
NUCLEIC ACIDS RESEARCH, 1999, 27 (02) :573-580
[5]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[6]   Finding the genes in genomic DNA [J].
Burge, CB ;
Karlin, S .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1998, 8 (03) :346-354
[7]  
Chen Nansheng, 2004, Curr Protoc Bioinformatics, VChapter 4, DOI 10.1002/0471250953.bi0410s05
[8]   fastp: an ultra-fast all-in-one FASTQ preprocessor [J].
Chen, Shifu ;
Zhou, Yanqing ;
Chen, Yaru ;
Gu, Jia .
BIOINFORMATICS, 2018, 34 (17) :884-890
[9]   De novo assembly of the Aedes aegypti genome using Hi-C yields chromosome-length scaffolds [J].
Dudchenko, Olga ;
Batra, Sanjit S. ;
Omer, Arina D. ;
Nyquist, Sarah K. ;
Hoeger, Marie ;
Durand, Neva C. ;
Shamim, Muhammad S. ;
Machol, Ido ;
Lander, Eric S. ;
Aiden, Aviva Presser ;
Aiden, Erez Lieberman .
SCIENCE, 2017, 356 (6333) :92-95
[10]   Juicebox Provides a Visualization System for Hi-C Contact Maps with Unlimited Zoom [J].
Durand, Neva C. ;
Robinson, James T. ;
Shamim, Muhammad S. ;
Machol, Ido ;
Mesirov, Jill P. ;
Lander, Eric S. ;
Aiden, Erez Lieberman .
CELL SYSTEMS, 2016, 3 (01) :99-101