Collembase: a repository for springtail genomics and soil quality assessment

被引:45
作者
Timmermans, Martijn J. T. N. [1 ]
de Boer, Muriel E. [1 ]
Nota, Benjamin [1 ]
de Boer, Tjalf E. [1 ]
Marien, Janine [1 ]
Klein-Lankhorst, Rene M. [2 ]
van Straalen, Nico M. [1 ]
Roelofs, Dick [1 ]
机构
[1] Vrije Univ Amsterdam, Inst Ecol Sci, Dept Anim Ecol, NL-1081 HV Amsterdam, Netherlands
[2] PRI Greenomics, NL-6708 PB Wageningen, Netherlands
关键词
D O I
10.1186/1471-2164-8-341
中图分类号
Q81 [生物工程学(生物技术)]; Q93 [微生物学];
学科分类号
071005 ; 0836 ; 090102 ; 100705 ;
摘要
Background: Environmental quality assessment is traditionally based on responses of reproduction and survival of indicator organisms. For soil assessment the springtail Folsomia candida (Collembola) is an accepted standard test organism. We argue that environmental quality assessment using gene expression profiles of indicator organisms exposed to test substrates is more sensitive, more toxicant specific and significantly faster than current risk assessment methods. To apply this species as a genomic model for soil quality testing we conducted an EST sequencing project and developed an online database. Description: Collembase is a web-accessible database comprising springtail (F. candida) genomic data. Presently, the database contains information on 8686 ESTs that are assembled into 5952 unique gene objects. Of those gene objects similar to 40% showed homology to other protein sequences available in GenBank (blastx analysis; non-redundant (nr) database; expect-value < 10(-5)). Software was applied to infer protein sequences. The putative peptides, which had an average length of 115 amino-acids (ranging between 23 and 440) were annotated with Gene Ontology (GO) terms. In total 1025 peptides (similar to 17% of the gene objects) were assigned at least one GO term (expect-value < 10(-25)). Within Collembase searches can be conducted based on BLAST and GO annotation, cluster name or using a BLAST server. The system furthermore enables easy sequence retrieval for functional genomic and Quantitative-PCR experiments. Sequences are submitted to GenBank (Accession numbers: EV473060-EV481745). Conclusion: Collembase http://www.collembase.org is a resource of sequence data on the springtail F. candida. The information within the database will be linked to a custom made microarray, based on the Agilent platform, which can be applied for soil quality testing. In addition, Collembase supplies information that is valuable for related scientific disciplines such as molecular ecology, ecogenomics, molecular evolution and phylogenetics.
引用
收藏
页数:10
相关论文
共 42 条
[1]  
[Anonymous], 1999, 11267 ISO
[2]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[3]   A wing expressed sequence tag resource for Bicyclus anynana butterflies, an evo-devo model [J].
Beldade, Patricia ;
Rudd, Stephen ;
Gruber, Jonathan D. ;
Long, Anthony D. .
BMC GENOMICS, 2006, 7 (1)
[4]   The genomic revolution: What does it mean for risk assessment? [J].
Bishop, WE ;
Clarke, DP ;
Travis, CC .
RISK ANALYSIS, 2001, 21 (06) :983-987
[5]   Habitat modification and refuge from sublethal stress drive a marine plant-herbivore association [J].
Burnaford, JL .
ECOLOGY, 2004, 85 (10) :2837-2849
[6]   Phylogenetic analysis of mitochondrial protein coding genes confirms the reciprocal paraphyly of Hexapoda and Crustacea [J].
Carapelli, Antonio ;
Lio, Pietro ;
Nardi, Francesco ;
van der Wath, Elizabeth ;
Frati, Francesco .
BMC EVOLUTIONARY BIOLOGY, 2007, 7 (Suppl 2)
[7]   Diversity of bacteria associated with Collembola - a cultivation-independent survey based on PCR-amplified 16S rRNA genes [J].
Czarnetzki, AB ;
Tebbe, CC .
FEMS MICROBIOLOGY ECOLOGY, 2004, 49 (02) :217-227
[8]   Suppression subtractive hybridization: A method for generating differentially regulated or tissue-specific cDNA probes and libraries [J].
Diatchenko, L ;
Lau, YFC ;
Campbell, AP ;
Chenchik, A ;
Moqadam, F ;
Huang, B ;
Lukyanov, S ;
Lukyanov, K ;
Gurskaya, N ;
Sverdlov, ED ;
Siebert, PD .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (12) :6025-6030
[9]   Base-calling of automated sequencer traces using phred.: II.: Error probabilities [J].
Ewing, B ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :186-194
[10]   Base-calling of automated sequencer traces using phred.: I.: Accuracy assessment [J].
Ewing, B ;
Hillier, L ;
Wendl, MC ;
Green, P .
GENOME RESEARCH, 1998, 8 (03) :175-185