Improved Reference Genome for Cyclotella cryptica CCMP332, a Model for Cell Wall Morphogenesis, Salinity Adaptation, and Lipid Production in Diatoms (Bacillariophyta)

被引:14
作者
Roberts, Wade R. [1 ]
Downey, Kala M. [1 ]
Ruck, Elizabeth C. [1 ]
Traller, Jesse C. [2 ]
Alverson, Andrew J. [1 ]
机构
[1] Univ Arkansas, Dept Biol Sci, SCEN 601, Fayetteville, AR 72701 USA
[2] Global Algae Innovat, San Diego, CA USA
基金
美国国家科学基金会;
关键词
algal biofuels; horizontal gene transfer; lipids; nanopore; transposable elements; DE-NOVO IDENTIFICATION; ANNOTATION; EVOLUTION; FAMILIES; EFFICIENT; ALIGNMENT; DYNAMICS; GROWTH; MAKER; SIZE;
D O I
10.1534/g3.120.401408
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The diatom,Cyclotella cryptica, is a well-established model species for physiological studies and biotechnology applications of diatoms. To further facilitate its use as a model diatom, we report an improved reference genome assembly and annotation forC. crypticastrain CCMP332. We used a combination of long- and short-read sequencing to assemble a high-quality and contaminant-free genome. The genome is 171 Mb in size and consists of 662 scaffolds with a scaffold N50 of 494 kb. This represents a 176-fold decrease in scaffold number and 41-fold increase in scaffold N50 compared to the previous assembly. The genome contains 21,250 predicted genes, 75% of which were assigned putative functions. Repetitive DNA comprises 59% of the genome, and an improved classification of repetitive elements indicated that a historically steady accumulation of transposable elements has contributed to the relatively large size of theC. crypticagenome. The high-qualityC. crypticagenome will serve as a valuable reference for ecological, genetic, and biotechnology studies of diatoms.
引用
收藏
页码:2965 / 2974
页数:10
相关论文
共 79 条
[1]   Interactions between Diatoms and Bacteria [J].
Amin, Shady A. ;
Parker, Micaela S. ;
Armbrust, E. Virginia .
MICROBIOLOGY AND MOLECULAR BIOLOGY REVIEWS, 2012, 76 (03) :667-+
[2]  
Apweiler R, 2004, NUCLEIC ACIDS RES, V32, pD115, DOI [10.1093/nar/gkw1099, 10.1093/nar/gkh131]
[3]   The genome of the diatom Thalassiosira pseudonana:: Ecology, evolution, and metabolism [J].
Armbrust, EV ;
Berges, JA ;
Bowler, C ;
Green, BR ;
Martinez, D ;
Putnam, NH ;
Zhou, SG ;
Allen, AE ;
Apt, KE ;
Bechner, M ;
Brzezinski, MA ;
Chaal, BK ;
Chiovitti, A ;
Davis, AK ;
Demarest, MS ;
Detter, JC ;
Glavina, T ;
Goodstein, D ;
Hadi, MZ ;
Hellsten, U ;
Hildebrand, M ;
Jenkins, BD ;
Jurka, J ;
Kapitonov, VV ;
Kröger, N ;
Lau, WWY ;
Lane, TW ;
Larimer, FW ;
Lippmeier, JC ;
Lucas, S ;
Medina, M ;
Montsant, A ;
Obornik, M ;
Parker, MS ;
Palenik, B ;
Pazour, GJ ;
Richardson, PM ;
Rynearson, TA ;
Saito, MA ;
Schwartz, DC ;
Thamatrakoln, K ;
Valentin, K ;
Vardi, A ;
Wilkerson, FP ;
Rokhsar, DS .
SCIENCE, 2004, 306 (5693) :79-86
[4]   The PRINTS database: a fine-grained protein sequence annotation and analysis resource-its status in 2012 [J].
Attwood, Teresa K. ;
Coletta, Alain ;
Muirhead, Gareth ;
Pavlopoulou, Athanasia ;
Philippou, Peter B. ;
Popov, Ivan ;
Roma-Mateo, Carlos ;
Theodosiou, Athina ;
Mitchell, Alex L. .
DATABASE-THE JOURNAL OF BIOLOGICAL DATABASES AND CURATION, 2012,
[5]   Automated de novo identification of repeat sequence families in sequenced genomes [J].
Bao, ZR ;
Eddy, SR .
GENOME RESEARCH, 2002, 12 (08) :1269-1276
[6]   SSPACE-LongRead: scaffolding bacterial draft genomes using long read sequence information [J].
Boetzer, Marten ;
Pirovano, Walter .
BMC BIOINFORMATICS, 2014, 15
[7]   Trimmomatic: a flexible trimmer for Illumina sequence data [J].
Bolger, Anthony M. ;
Lohse, Marc ;
Usadel, Bjoern .
BIOINFORMATICS, 2014, 30 (15) :2114-2120
[8]   The Phaeodactylum genome reveals the evolutionary history of diatom genomes [J].
Bowler, Chris ;
Allen, Andrew E. ;
Badger, Jonathan H. ;
Grimwood, Jane ;
Jabbari, Kamel ;
Kuo, Alan ;
Maheswari, Uma ;
Martens, Cindy ;
Maumus, Florian ;
Otillar, Robert P. ;
Rayko, Edda ;
Salamov, Asaf ;
Vandepoele, Klaas ;
Beszteri, Bank ;
Gruber, Ansgar ;
Heijde, Marc ;
Katinka, Michael ;
Mock, Thomas ;
Valentin, Klaus ;
Verret, Frederic ;
Berges, John A. ;
Brownlee, Colin ;
Cadoret, Jean-Paul ;
Chiovitti, Anthony ;
Choi, Chang Jae ;
Coesel, Sacha ;
De Martino, Alessandra ;
Detter, J. Chris ;
Durkin, Colleen ;
Falciatore, Angela ;
Fournet, Jerome ;
Haruta, Miyoshi ;
Huysman, Marie J. J. ;
Jenkins, Bethany D. ;
Jiroutova, Katerina ;
Jorgensen, Richard E. ;
Joubert, Yolaine ;
Kaplan, Aaron ;
Kroger, Nils ;
Kroth, Peter G. ;
La Roche, Julie ;
Lindquist, Erica ;
Lommer, Markus ;
Martin-Jezequel, Veronique ;
Lopez, Pascal J. ;
Lucas, Susan ;
Mangogna, Manuela ;
McGinnis, Karen ;
Medlin, Linda K. ;
Montsant, Anton .
NATURE, 2008, 456 (7219) :239-244
[9]   Fast and sensitive protein alignment using DIAMOND [J].
Buchfink, Benjamin ;
Xie, Chao ;
Huson, Daniel H. .
NATURE METHODS, 2015, 12 (01) :59-60
[10]   BLAST plus : architecture and applications [J].
Camacho, Christiam ;
Coulouris, George ;
Avagyan, Vahram ;
Ma, Ning ;
Papadopoulos, Jason ;
Bealer, Kevin ;
Madden, Thomas L. .
BMC BIOINFORMATICS, 2009, 10