Representation and high-quality annotation of the Physcomitrella patens transcriptome demonstrates a high proportion of proteins involved in metabolism in mosses

被引:67
作者
Lang, D
Eisinger, J
Reski, R
Rensing, SA
机构
[1] Univ Freiburg, Fac Biol, D-79104 Freiburg, Germany
[2] Univ Freiburg, Chair Comp Architecture, Fac Sci Appl, D-79110 Freiburg, Germany
关键词
Physcomitrella patens; moss; transcriptome; annotation; gene ontology;
D O I
10.1055/s-2005-837578
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
To gain insight into the transcriptome of the well-used plant model system Physcomitrella patens, several EST sequencing projects have been undertaken. We have clustered, assembled, and annotated all publicly available EST and CDS sequences in order to represent the transcriptome of this non-seed plant. Here, we present our fully annotated knowledge resource for the Physcomitrella patens transcriptome, integrating annotation from the production process of the clustered sequences and from a high-quality annotation pipeline developed during this study. Each transcript is represented as an entity containing full annotations and GO term associations. The whole production, filtering, clustering, and annotation process is being modelled and results in seven datasets, representing the annotated Physcomitrella transcriptome from different perspectives. We were able to annotate 63.4% of the 26123 virtual transcripts. The transcript archetype, as covered by our clustered data, is compared to a compilation based on all available Physcomitrella full length CDS. The distribution of the gene ontology annotations (GOA) for the virtual transcriptome of Physcomitrella patens demonstrates consistency in the ratios of the core molecular functions among the plant GOA. However, the metabolism subcategory is over-represented in bryophytes as compared to seed plants. This observation can be taken as an indicator for the wealth of alternative metabolic pathways in moss in comparison to spermatophytes. All resources presented in this study have been made available to the scientific community through a suite of user-friendly web interfaces via www. cosmoss.org and form the basis for assembly and annotation of the moss genome, which will be sequenced in 2005.
引用
收藏
页码:238 / 250
页数:13
相关论文
共 60 条
[41]  
RENSING SA, 2005, IN PRESS BMC GENOMIC
[42]  
RENSING SA, 2002, TRANSCRIPTOME MOSS P
[43]  
RENSING SA, 2003, P GERM C BIOINF 2003, P117
[44]  
Reski R, 1998, BOT ACTA, V111, P1
[45]   Molecular genetics of Physcomitrella [J].
Reski, R .
PLANTA, 1999, 208 (03) :301-309
[46]   The Arabidopsis Information Resource (TAIR):: a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community [J].
Rhee, SY ;
Beavis, W ;
Berardini, TZ ;
Chen, GH ;
Dixon, D ;
Doyle, A ;
Garcia-Hernandez, M ;
Huala, E ;
Lander, G ;
Montoya, M ;
Miller, N ;
Mueller, LA ;
Mundodi, S ;
Reiser, L ;
Tacklind, J ;
Weems, DC ;
Wu, YH ;
Xu, I ;
Yoo, D ;
Yoon, J ;
Zhang, PF .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :224-228
[47]   Two RpoT genes of Physcomitrella patens encode phage-type RNA polymerases with dual targeting to mitochondria and plastids [J].
Richter, U ;
Kiessling, J ;
Hedtke, B ;
Decker, E ;
Reski, R ;
Börner, T ;
Weihe, A .
GENE, 2002, 290 (1-2) :95-105
[48]   Mapping of the Physcomitrella patens proteome [J].
Sarnighausen, E ;
Wurtz, V ;
Heintz, D ;
Van Dorsselaer, A ;
Reski, R .
PHYTOCHEMISTRY, 2004, 65 (11) :1589-1607
[49]  
Schuler GD, 1996, METHOD ENZYMOL, V266, P141
[50]  
SCHWEEN G, 2005, IN PRESS PLANT BIOL