Structural Characteristics of Novel Protein Folds

被引:49
作者
Fernandez-Fuentes, Narcis [1 ]
Dybas, Joseph M. [2 ]
Fiser, Andras [2 ]
机构
[1] Univ Leeds, Leeds Inst Mol Med, Sect Expt Therapeut, St Jamess Univ Hosp, Leeds, W Yorkshire, England
[2] Albert Einstein Coll Med, Dept Syst & Computat Biol, Dept Biochem, Bronx, NY 10467 USA
关键词
STRUCTURE PREDICTION; SECONDARY STRUCTURE; CLASSIFICATION; FAMILIES; DATABASE; SCOP; GENOMICS; PROGRESS; MOTIFS; NUMBER;
D O I
10.1371/journal.pcbi.1000750
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Folds are the basic building blocks of protein structures. Understanding the emergence of novel protein folds is an important step towards understanding the rules governing the evolution of protein structure and function and for developing tools for protein structure modeling and design. We explored the frequency of occurrences of an exhaustively classified library of supersecondary structural elements (Smotifs), in protein structures, in order to identify features that would define a fold as novel compared to previously known structures. We found that a surprisingly small set of Smotifs is sufficient to describe all known folds. Furthermore, novel folds do not require novel Smotifs, but rather are a new combination of existing ones. Novel folds can be typified by the inclusion of a relatively higher number of rarely occurring Smotifs in their structures and, to a lesser extent, by a novel topological combination of commonly occurring Smotifs. When investigating the structural features of Smotifs, we found that the top 10% of most frequent ones have a higher fraction of internal contacts, while some of the most rare motifs are larger, and contain a longer loop region.
引用
收藏
页数:11
相关论文
共 44 条
[1]   A galaxy of folds [J].
Alva, Vikram ;
Remmert, Michael ;
Biegert, Andreas ;
Lupas, Andrei N. ;
Soeding, Johannes .
PROTEIN SCIENCE, 2010, 19 (01) :124-130
[2]   Data growth and its impact on the SCOP database: new developments [J].
Andreeva, Antonina ;
Howorth, Dave ;
Chandonia, John-Marc ;
Brenner, Steven E. ;
Hubbard, Tim J. P. ;
Chothia, Cyrus ;
Murzin, Alexey G. .
NUCLEIC ACIDS RESEARCH, 2008, 36 :D419-D425
[3]   Closed loops of nearly standard size: common basic element of protein structure [J].
Berezovsky, IN ;
Grosberg, AY ;
Trifonov, EN .
FEBS LETTERS, 2000, 466 (2-3) :283-286
[4]  
Boutonnet NS, 1998, PROTEINS, V30, P193, DOI 10.1002/(SICI)1097-0134(19980201)30:2<193::AID-PROT9>3.0.CO
[5]  
2-O
[6]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544
[7]   Common evolutionary origin of swapped-hairpin and double-psi β barrels [J].
Coles, Murray ;
Hulko, Michael ;
Djuranovic, Sergej ;
Truffault, Vincent ;
Koretke, Kristin ;
Martin, Joerg ;
Lupas, Andrei N. .
STRUCTURE, 2006, 14 (10) :1489-1498
[8]   Macromolecular modeling with Rosetta [J].
Das, Rhiju ;
Baker, David .
ANNUAL REVIEW OF BIOCHEMISTRY, 2008, 77 :363-382
[9]   PSI-2: Structural Genomics to Cover Protein Domain Family Space [J].
Dessailly, Benoit H. ;
Nair, Rajesh ;
Jaroszewski, Lukasz ;
Fajardo, J. Eduardo ;
Kouranov, Andrei ;
Lee, David ;
Fiser, Andras ;
Godzik, Adam ;
Rost, Burkhard ;
Orengo, Christine .
STRUCTURE, 2009, 17 (06) :869-881
[10]   Saturating representation of loop conformational fragments in structure databanks [J].
Fernandez-Fuentes, Narcis ;
Fiser, Andras .
BMC STRUCTURAL BIOLOGY, 2006, 6