Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)

被引:29
作者
Andreassen, Rune [1 ,2 ,3 ]
Lunner, Sigbjorn [1 ]
Hoyheim, Bjorn [1 ,2 ]
机构
[1] Norwegian Sch Vet Sci, BasAM Genet, NO-0033 Oslo, Norway
[2] CIGENE Ctr Integrat Genet, As, Norway
[3] Oslo Univ Coll, Fac Hlth Sci, Oslo, Norway
来源
BMC GENOMICS | 2009年 / 10卷
关键词
MESSENGER-RNAS; LINKAGE MAP; POLYADENYLATION SIGNAL; DUPLICATED LOCI; RAINBOW-TROUT; GENES; TOOL; SNP; IDENTIFICATION; ANNOTATION;
D O I
10.1186/1471-2164-10-502
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs) are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP), the number of sequences where the full-length of the cDNA insert has been determined has been small. Results: High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91%) of the transcripts were annotated using Gene Ontology (GO) terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS). The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. Conclusion: This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS). This suggests that the remaining cDNA libraries generated by SGP represent a valuable cCDS FLIc source. The conservation of 7-mers in 3'UTRs indicates that these motifs are functionally important. Identity between some of these 7-mers and miRNA target sequences suggests that they are miRNA targets in Salmo salar transcripts as well.
引用
收藏
页数:11
相关论文
共 49 条
  • [1] Annotated Expressed Sequence Tags (ESTs) from pre-smolt Atlantic salmon (Salmo salar) in a searchable data resource
    Adzhubei, Alexei A.
    Vlasova, Anna V.
    Hagen-Larsen, Heidi
    Ruden, Torgeir A.
    Laerdahl, Jon K.
    Hoyheim, Bjorn
    [J]. BMC GENOMICS, 2007, 8
  • [2] Allendorf FW, 1997, GENETICS, V145, P1083
  • [3] [Anonymous], TETRAPLOIDY EVOLUTIO
  • [4] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [5] GENE DUPLICATION IN TETRAPLOID FISH - MODEL FOR GENE SILENCING AT UNLINKED DUPLICATED LOCI
    BAILEY, GS
    POULTER, RTM
    STOCKWELL, PA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1978, 75 (11) : 5575 - 5579
  • [6] Patterns of variant polyadenylation signal usage in human genes
    Beaudoing, E
    Freier, S
    Wyatt, JR
    Claverie, JM
    Gautheret, D
    [J]. GENOME RESEARCH, 2000, 10 (07) : 1001 - 1010
  • [7] Principles of MicroRNA-target recognition
    Brennecke, J
    Stark, A
    Russell, RB
    Cohen, SM
    [J]. PLOS BIOLOGY, 2005, 3 (03): : 404 - 418
  • [8] Whole genome sequence comparisons and "full-length" cDNA sequences:: A combined approach to evaluate and improve Arabidopsis genome annotation
    Castelli, V
    Aury, JM
    Jaillon, O
    Wincker, P
    Clepet, C
    Menard, M
    Cruaud, C
    Quétier, F
    Scarpelli, C
    Schächter, V
    Temple, G
    Caboche, M
    Weissenbach, J
    Salanoubat, M
    [J]. GENOME RESEARCH, 2004, 14 (03) : 406 - 413
  • [9] Blast2GO:: a universal tool for annotation, visualization and analysis in functional genomics research
    Conesa, A
    Götz, S
    García-Gómez, JM
    Terol, J
    Talón, M
    Robles, M
    [J]. BIOINFORMATICS, 2005, 21 (18) : 3674 - 3676
  • [10] WebLogo: A sequence logo generator
    Crooks, GE
    Hon, G
    Chandonia, JM
    Brenner, SE
    [J]. GENOME RESEARCH, 2004, 14 (06) : 1188 - 1190