Proteome Analysis Database: online application of InterPro and CluSTr for the functional classification of proteins in whole genomes

被引:71
作者
Apweiler, R [1 ]
Biswas, W [1 ]
Fleischmann, W [1 ]
Kanapin, A [1 ]
Karavidopoulou, Y [1 ]
Kersey, P [1 ]
Kriventseva, EV [1 ]
Mittard, V [1 ]
Mulder, N [1 ]
Phan, I [1 ]
Zdobnov, E [1 ]
机构
[1] EMBL Outstn, European Bioinformat Inst, Wellcome Trust Genome Campus, Cambridge CB10 1SD, England
关键词
D O I
10.1093/nar/29.1.44
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The SWISS-PROT group at EBI has developed the Proteome Analysis Database utilising existing resources and providing comparative analysis of the predicted protein coding sequences of the complete genomes of bacteria, archaea and eukaryotes (http:// www.ebi.ac.uk/proteome/). The two main projects used, InterPro and CluSTr, give a new perspective on families, domains and sites and cover 31-67% (InterPro statistics) of the proteins from each of the complete genomes. CluSTr covers the three complete eukaryotic genomes and the incomplete human genome data. The Proteome Analysis Database is accompanied by a program that has been designed to carry out InterPro proteome comparisons for any one proteome against any other one or more of the proteomes in the database.
引用
收藏
页码:44 / 48
页数:5
相关论文
共 19 条
[1]   The InterPro database, an integrated documentation resource for protein families, domains and functional sites [J].
Apweiler, R ;
Attwood, TK ;
Bairoch, A ;
Bateman, A ;
Birney, E ;
Biswas, M ;
Bucher, P ;
Cerutti, T ;
Corpet, F ;
Croning, MDR ;
Durbin, R ;
Falquet, L ;
Fleischmann, W ;
Gouzy, J ;
Hermjakob, H ;
Hulo, N ;
Jonassen, I ;
Kahn, D ;
Kanapin, A ;
Karavidopoulou, Y ;
Lopez, R ;
Marx, B ;
Mulder, NJ ;
Oinn, TM ;
Pagni, M ;
Servant, F ;
Sigrist, CJA ;
Zdobnov, EM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :37-40
[2]   Protein sequence databases [J].
Apweiler, R .
ADVANCES IN PROTEIN CHEMISTRY, VOL 54, 2000, 54 :31-71
[3]   Gene Ontology: tool for the unification of biology [J].
Ashburner, M ;
Ball, CA ;
Blake, JA ;
Botstein, D ;
Butler, H ;
Cherry, JM ;
Davis, AP ;
Dolinski, K ;
Dwight, SS ;
Eppig, JT ;
Harris, MA ;
Hill, DP ;
Issel-Tarver, L ;
Kasarskis, A ;
Lewis, S ;
Matese, JC ;
Richardson, JE ;
Ringwald, M ;
Rubin, GM ;
Sherlock, G .
NATURE GENETICS, 2000, 25 (01) :25-29
[4]   PRINTS-S: the database formerly known as PRINTS [J].
Attwood, TK ;
Croning, MDR ;
Flower, DR ;
Lewis, AP ;
Mabey, JE ;
Scordis, P ;
Selley, JN ;
Wright, W .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :225-227
[5]   The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000 [J].
Bairoch, A ;
Apweiler, R .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :45-48
[6]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkh121, 10.1093/nar/gkr1065]
[7]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[8]   The PDB data uniformity project [J].
Bhat, TN ;
Bourne, P ;
Feng, ZK ;
Gilliland, G ;
Jain, S ;
Ravichandran, V ;
Schneider, B ;
Schneider, K ;
Thanki, N ;
Weissig, H ;
Westbrook, J ;
Berman, HM .
NUCLEIC ACIDS RESEARCH, 2001, 29 (01) :214-218
[9]   Significance of Z-value statistics of Smith-Waterman scores for protein alignments [J].
Comet, JP ;
Aude, JC ;
Glémet, E ;
Risler, JL ;
Hénaut, A ;
Slonimski, PP ;
Codani, JJ .
COMPUTERS & CHEMISTRY, 1999, 23 (3-4) :317-331
[10]   ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisons [J].
Corpet, F ;
Servant, F ;
Gouzy, J ;
Kahn, D .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :267-269