Atomic Interaction Networks in the Core of Protein Domains and Their Native Folds

被引:35
作者
Soundararajan, Venkataramanan [1 ]
Raman, Rahul
Raguram, S.
Sasisekharan, V.
Sasisekharan, Ram
机构
[1] MIT, Koch Inst Integrat Canc Res, Harvard Mit Div Hlth Sci & Technol, Cambridge, MA 02139 USA
关键词
STRUCTURE PREDICTION; SECONDARY STRUCTURE; SEQUENCE; EVOLUTION; RECOGNITION; DATABASE; PACKING; YOPM;
D O I
10.1371/journal.pone.0009391
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Vastly divergent sequences populate a majority of protein folds. In the quest to identify features that are conserved within protein domains belonging to the same fold, we set out to examine the entire protein universe on a fold-by-fold basis. We report that the atomic interaction network in the solvent-unexposed core of protein domains are fold-conserved, extraordinary sequence divergence notwithstanding. Further, we find that this feature, termed protein core atomic interaction network (or PCAIN) is significantly distinguishable across different folds, thus appearing to be "signature'' of a domain's native fold. As part of this study, we computed the PCAINs for 8698 representative protein domains from families across the 1018 known protein folds to construct our seed database and an automated framework was developed for PCAIN-based characterization of the protein fold universe. A test set of randomly selected domains that are not in the seed database was classified with over 97% accuracy, independent of sequence divergence. As an application of this novel fold signature, a PCAIN-based scoring scheme was developed for comparative (homology-based) structure prediction, with 1-2 angstroms (mean 1.61A) C-alpha RMSD generally observed between computed structures and reference crystal structures. Our results are consistent across the full spectrum of test domains including those from recent CASP experiments and most notably in the 'twilight' and 'midnight' zones wherein <30% and <10% target-template sequence identity prevails (mean twilight RMSD of 1.69A). We further demonstrate the utility of the PCAIN protocol to derive biological insight into protein structure-function relationships, by modeling the structure of the YopM effector novel E3 ligase (NEL) domain from plague-causative bacterium Yersinia Pestis and discussing its implications for host adaptive and innate immune modulation by the pathogen. Considering the several high-throughput, sequence-identity-independent applications demonstrated in this work, we suggest that the PCAIN is a fundamental fold feature that could be a valuable addition to the arsenal of protein modeling and analysis tools.
引用
收藏
页数:13
相关论文
共 51 条
[1]   The design and characterization of two proteins with 88% sequence identity but different structure and function [J].
Alexander, Patrick A. ;
He, Yanan ;
Chen, Yihong ;
Orban, John ;
Bryan, Philip N. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2007, 104 (29) :11963-11968
[2]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[3]   Energetics of protein folding [J].
Baldwin, Robert L. .
JOURNAL OF MOLECULAR BIOLOGY, 2007, 371 (02) :283-301
[4]  
Bartoli Lisa, 2008, V413, P199
[5]   Salmonella Type III Secretion Effector SlrP Is an E3 Ubiquitin Ligase for Mammalian Thioredoxin [J].
Bernal-Bayard, Joaquin ;
Ramos-Morales, Francisco .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2009, 284 (40) :27587-27595
[6]   Structural determinants of the rate of protein evolution in yeast [J].
Bloom, Jesse D. ;
Drummond, D. Allan ;
Arnold, Frances H. ;
Wilke, Claus O. .
MOLECULAR BIOLOGY AND EVOLUTION, 2006, 23 (09) :1751-1761
[7]   Ab initio protein structure prediction: Progress and prospects [J].
Bonneau, R ;
Baker, D .
ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 2001, 30 :173-189
[8]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[9]   Protein folding mediated by solvation:: Water expulsion and formation of the hydrophobic core occur after the structural collapse [J].
Cheung, MS ;
García, AE ;
Onuchic, JN .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (02) :685-690
[10]   Evolution of protein structural classes and protein sequence families [J].
Choi, In-Geol ;
Kim, Sung-Hou .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (38) :14056-14061