Survey of current protein family databases and their application in comparative, structural and functional genomics

被引:9
作者
Reffern, O [1 ]
Grant, A [1 ]
Maibaum, M [1 ]
Orengo, C [1 ]
机构
[1] UCL, Dept Biochem & Mol Biol, London WC1E 6BT, England
来源
JOURNAL OF CHROMATOGRAPHY B-ANALYTICAL TECHNOLOGIES IN THE BIOMEDICAL AND LIFE SCIENCES | 2005年 / 815卷 / 1-2期
关键词
databases; proteomics; genomics; CATH; structural classifications;
D O I
10.1016/j.jchromb.2004.11.010
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
The last two decades have witnessed significant expansions in the databases storing information on the sequences and structures of proteins. This has led to the creation of many excellent protein family resources, which classify proteins according to their evolutionary relationship. These have allowed extensive insights into evolution and particularly how protein function mutates and evolves over time. Such analyses have greatly assisted the inheritance of functional annotations between experimentally characterised and uncharacterised genes. Moreover, the development of bioinformatics tools acts as a companion to the new technologies emerging in biology, such as transcriptomics and proteomics. The latter enable researchers to analyse gene expression profiles and interactions on a genome-wide scale, generating vast datasets of proteins, many of which include experimentally uncharacterised proteins. Protein family/function databases can be used to help interpret this data and allow us to benefit more fully from these technologies. This review aims to summarise the most popular sequence- and structure-based protein family databases. We also cover their application to comparative genomics and the functional annotation of the genomes. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:97 / 107
页数:11
相关论文
共 48 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Domain combinations in archaeal, eubacterial and eukaryotic proteomes [J].
Apic, G ;
Gough, J ;
Teichmann, SA .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 310 (02) :311-325
[3]   PRINTS and its automatic supplement, prePRINTS [J].
Attwood, TK ;
Bradley, P ;
Flower, DR ;
Gaulton, A ;
Maudling, N ;
Mitchell, AL ;
Moulton, G ;
Nordle, A ;
Paine, K ;
Taylor, P ;
Uddin, A ;
Zygouri, C .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :400-402
[4]  
Bateman A, 2004, NUCLEIC ACIDS RES, V32, pD138, DOI [10.1093/nar/gkp985, 10.1093/nar/gkr1065, 10.1093/nar/gkh121]
[5]  
Benson DA, 2003, NUCLEIC ACIDS RES, V31, P23, DOI 10.1093/nar/gkg057
[6]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[7]   The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 [J].
Boeckmann, B ;
Bairoch, A ;
Apweiler, R ;
Blatter, MC ;
Estreicher, A ;
Gasteiger, E ;
Martin, MJ ;
Michoud, K ;
O'Donovan, C ;
Phan, I ;
Pilbout, S ;
Schneider, M .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :365-370
[8]   THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS [J].
CHOTHIA, C ;
LESK, AM .
EMBO JOURNAL, 1986, 5 (04) :823-826
[9]   A unifold, mesofold, and superfold model of protein fold use [J].
Coulson, AFW ;
Moult, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 46 (01) :61-71
[10]   Identification of homology in protein structure classification [J].
Dietmann, S ;
Holm, L .
NATURE STRUCTURAL BIOLOGY, 2001, 8 (11) :953-957