Deep sequencing for de novo construction of a marine fish (Sparus aurata) transcriptome database with a large coverage of protein-coding transcripts

被引:67
作者
Calduch-Giner, Josep A. [1 ]
Bermejo-Nogales, Azucena [1 ]
Benedito-Palos, Laura [1 ]
Estensoro, Itziar [2 ]
Ballester-Lozano, Gabriel [1 ]
Sitja-Bobadilla, Ariadna [2 ]
Perez-Sanchez, Jaume [1 ]
机构
[1] CSIC, Inst Aquaculture Torre Sal, Dept Marine Species Biol Culture & Pathol, Nutrigen & Fish Growth Endocrinol Grp, Castellon de La Plana, Spain
[2] CSIC, Inst Aquaculture Torre Sal, Dept Marine Species Biol Culture & Pathol, Fish Pathol Grp, Castellon de La Plana, Spain
关键词
Sparus aurata; Next-generation sequencing; De novo assembly; Transcriptome; Database; GILTHEAD SEA BREAM; TAGS ESTS; EXPRESSION; MICROARRAY; RNA; EXPOSURE; GENOMICS; GENES; BASS; L;
D O I
10.1186/1471-2164-14-178
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: The gilthead sea bream (Sparus aurata) is the main fish species cultured in the Mediterranean area and constitutes an interesting model of research. Nevertheless, transcriptomic and genomic data are still scarce for this highly valuable species. A transcriptome database was constructed by de novo assembly of gilthead sea bream sequences derived from public repositories of mRNA and collections of expressed sequence tags together with new high-quality reads from five cDNA 454 normalized libraries of skeletal muscle (1), intestine (1), head kidney (2) and blood (1). Results: Sequencing of the new 454 normalized libraries produced 2,945,914 high-quality reads and the de novo global assembly yielded 125,263 unique sequences with an average length of 727 nt. Blast analysis directed to protein and nucleotide databases annotated 63,880 sequences encoding for 21,384 gene descriptions, that were curated for redundancies and frameshifting at the homopolymer regions of open reading frames, and hosted at http://www.nutrigroup-iats.org/seabreamdb. Among the annotated gene descriptions, 16,177 were mapped in the Ingenuity Pathway Analysis (IPA) database, and 10,899 were eligible for functional analysis with a representation in 341 out of 372 IPA canonical pathways. The high representation of randomly selected stickleback transcripts by Blast search in the nucleotide gilthead sea bream database evidenced its high coverage of protein-coding transcripts. Conclusions: The newly assembled gilthead sea bream transcriptome represents a progress in genomic resources for this species, as it probably contains more than 75% of actively transcribed genes, constituting a valuable tool to assist studies on functional genomics and future genome projects.
引用
收藏
页数:11
相关论文
共 43 条
[1]   Annotated Expressed Sequence Tags (ESTs) from pre-smolt Atlantic salmon (Salmo salar) in a searchable data resource [J].
Adzhubei, Alexei A. ;
Vlasova, Anna V. ;
Hagen-Larsen, Heidi ;
Ruden, Torgeir A. ;
Laerdahl, Jon K. ;
Hoyheim, Bjorn .
BMC GENOMICS, 2007, 8
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
[Anonymous], CURR PROTOC BIOINFOR
[4]  
[Anonymous], FRONT GENET
[5]   Next-generation DNA sequencing techniques [J].
Ansorge, Wilhelm J. .
NEW BIOTECHNOLOGY, 2009, 25 (04) :195-203
[6]   Characteristics of 454 pyrosequencing data-enabling realistic simulation with flowsim [J].
Balzer, Susanne ;
Malde, Ketil ;
Lanzen, Anders ;
Sharma, Animesh ;
Jonassen, Inge .
BIOINFORMATICS, 2010, 26 (18) :i420-i425
[7]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[8]   Dietary vegetable oils do not alter the intestine transcriptome of gilthead sea bream (Sparus aurata), but modulate the transcriptomic response to infection with Enteromyxum leei [J].
Calduch-Giner, Josep A. ;
Sitja-Bobadilla, Ariadna ;
Davey, Grace C. ;
Cairns, Michael T. ;
Kaushik, Sadasivam ;
Perez-Sanchez, Jaume .
BMC GENOMICS, 2012, 13
[9]   Use of microarray technology to assess the time course of liver stress response after confinement exposure in gilthead sea bream (Sparus aurata L.) [J].
Calduch-Giner, Josep A. ;
Davey, Grace ;
Saera-Vila, Alfonso ;
Houeix, Benoit ;
Talbot, Anita ;
Prunet, Patrick ;
Cairns, Michael T. ;
Perez-Sanchez, Jaume .
BMC GENOMICS, 2010, 11
[10]   Using the miraEST assembler for reliable and automated mRNA transcript assembly and SNP detection in sequenced ESTs [J].
Chevreux, B ;
Pfisterer, T ;
Drescher, B ;
Driesel, AJ ;
Müller, WEG ;
Wetter, T ;
Suhai, S .
GENOME RESEARCH, 2004, 14 (06) :1147-1159