Distant homology recognition using structural classification of proteins

被引:0
作者
Murzin, AG
Bateman, A
机构
[1] MRC Ctr, Ctr Prot Engn, Cambridge CB2 2QH, England
[2] MRC, Mol Biol Lab, Cambridge CB2 2QH, England
关键词
CASP; fold recognition; SCOP; superfamily; structure prediction;
D O I
暂无
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Protein structure prediction is arguably the biggest unsolved problem of structural biology, The notion of the number of naturally occurring different protein folds being limited allows partial solution of this problem by the use of fold recognition methods, which "thread" the sequence in question through a library of known protein folds, The fold recognition methods were thought to be superior to the distant homology recognition methods when there is no significant sequence similarity to known structures, We show here that the Structural Classification of Proteins (SCOP) database, organizing all known protein folds according their structural and evolutionary relationships, can be effectively used to enhance the sensitivity of the distant homology recognition methods to rival the "threading" methods, In the CASP2 experiment, our approach correctly assigned into existing SCOP superfamilies all of the six "fold recognition" targets we attempted, For each of the six targets, we correctly predicted the homologous protein with a very similar structure; often, it was the most similar structure, We correctly predicted local alignments of the sequence features that we found to be characteristic for the protein superfamily containing a given target, Our global alignments, extended manually from these local alignments, also appeared to be rather accurate. (C) 1998 Wiley-Liss, Inc.
引用
收藏
页码:105 / 112
页数:8
相关论文
共 34 条
[1]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[2]   THE SWISS-PROT PROTEIN-SEQUENCE DATA-BANK [J].
BAIROCH, A ;
BOECKMANN, B .
NUCLEIC ACIDS RESEARCH, 1991, 19 :2247-2248
[3]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[4]   CONSTRUCTION OF VALIDATED, NONREDUNDANT COMPOSITE PROTEIN-SEQUENCE DATABASES [J].
BLEASBY, AJ ;
WOOTTON, JC .
PROTEIN ENGINEERING, 1990, 3 (03) :153-159
[5]   Crystal structure of a fungal elicitor secreted by Phytophthora cryptogea, a member of a novel class of plant necrotic proteins [J].
Boissy, G ;
deLaFortelle, E ;
Kahn, R ;
Huet, JC ;
Bricogne, G ;
Pernollet, JC ;
Brunie, S .
STRUCTURE, 1996, 4 (12) :1429-1439
[6]   H-1 AND N-15 RESONANCE ASSIGNMENT AND SECONDARY STRUCTURE OF CAPSICEIN, AN ALPHA-ELICITIN, DETERMINED BY 3-DIMENSIONAL HETERONUCLEAR NMR [J].
BOUAZIZ, S ;
VANHEIJENOORT, C ;
HUET, JC ;
PERNOLLET, JC ;
GUITTET, E .
BIOCHEMISTRY, 1994, 33 (27) :8188-8197
[7]   A PROTEIN CATALYTIC FRAMEWORK WITH AN N-TERMINAL NUCLEOPHILE IS CAPABLE OF SELF-ACTIVATION [J].
BRANNIGAN, JA ;
DODSON, G ;
DUGGLEBY, HJ ;
MOODY, PCE ;
SMITH, JL ;
TOMCHICK, DR ;
MURZIN, AG .
NATURE, 1995, 378 (6555) :416-419
[8]   THREE-DIMENSIONAL STRUCTURE OF O-ACETYLSERINE SULFHYDRYLASE FROM SALMONELLA TYPHIMURIUM [J].
Burkhard, P. ;
Hohenester, E. ;
Rao, G. S. J. ;
Cook, P. F. ;
Jansonius, J. N. .
ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 1996, 52 :C125-C126
[9]   The solution structure of the S1 RNA binding domain: A member of an ancient nucleic acid-binding fold [J].
Bycroft, M ;
Hubbard, TJP ;
Proctor, M ;
Freund, SMV ;
Murzin, AG .
CELL, 1997, 88 (02) :235-242
[10]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544