Native secondary structure topology has near minimum contact energy among all possible geometrically constrained topologies

被引:14
作者
Sun, Weitao [1 ,2 ]
He, Jing [1 ]
机构
[1] New Mexico State Univ, Dept Comp Sci, Las Cruces, NM 88003 USA
[2] Tsinghua Univ, Zhou Pei Yuan Ctr Appl Math, Beijing 100084, Peoples R China
关键词
secondary structure; topology; contact energy; electron microscopy; protein structure prediction; 3-DIMENSIONAL STRUCTURES; INFORMATICS APPROACH; RESIDUE POTENTIALS; PROTEIN TOPOLOGY; PEAK DETECTION; BETA-SHEETS; RESOLUTION; PREDICTION; IDENTIFICATION; SIMULATION;
D O I
10.1002/prot.22427
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Secondary structure topology in this article refers to the order and the direction of the secondary structures, such as helices and strands, with respect to the protein sequence. Even when the locations of the secondary structure C alpha atoms are known, there are still (N!2(N)) (M!2(M)) different possible topologies for a protein with N helices and M strands. This work explored the question if the native topology is likely to be identified among a large set of all possible geometrically constrained topologies through an evaluation of the residue contact energy formed by the secondary structures, instead of the entire chain. We developed a contact pair specific and distance specific multiwell function based on the statistical characterization of the side chain distances of 413 proteins in the Protein Data Bank. The multiwell function has specific parameters to each of the 210 pairs of residue contacts. We illustrated a general mathematical method to extend a single well function to a multiwell function to represent the statistical data. We have performed a mutation analysis using 50 proteins to generate all the possible geometrically constrained topologies of the secondary structures. The result shows that the native topology is within the top 25% of the list ranked by the effective contact energies of the secondary structures for all the 50 proteins, and is within the top 5% for 34 proteins. As an application, the method was used to derive the structure of the skeletons from a low resolution density map that can be obtained through electron cryomicroscopy.
引用
收藏
页码:159 / 173
页数:15
相关论文
共 57 条
[1]  
[Anonymous], 1992, ATLAS PROTEIN SIDE C
[2]   Inter-residue potentials in globular proteins and the dominance of highly specific hydrophilic interactions at close separation [J].
Bahar, I ;
Jernigan, RL .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 266 (01) :195-214
[3]   Ab initio modeling of the herpesvirus VP26 core domain assessed by CryoEM density [J].
Baker, Matthew L. ;
Jiang, Wen ;
Wedemeyer, William J. ;
Rixon, Frazer J. ;
Baker, David ;
Chiu, Wah .
PLOS COMPUTATIONAL BIOLOGY, 2006, 2 (10) :1313-1324
[4]   Identification of secondary structure elements in intermediate-resolution density maps [J].
Baker, Matthew L. ;
Ju, Tao ;
Chiu, Wah .
STRUCTURE, 2007, 15 (01) :7-19
[5]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[6]   A new representation for protein secondary structure prediction based on frequent patterns [J].
Birzele, Fabian ;
Kramer, Stefan .
BIOINFORMATICS, 2006, 22 (21) :2628-2634
[7]   A FULLY AUTOMATED CHROMATOGRAPHIC PEAK DETECTION AND TREATMENT SOFTWARE FOR MULTIUSER MULTITASK COMPUTERS [J].
CARDOT, PJP ;
TROLLIARD, P ;
TEMBELY, S .
JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 1990, 8 (8-12) :755-759
[9]   CONFORMATIONS OF FOLDED PROTEINS IN RESTRICTED SPACES [J].
COVELL, DG ;
JERNIGAN, RL .
BIOCHEMISTRY, 1990, 29 (13) :3287-3294
[10]  
Dal Palo A, 2006, Comput Syst Bioinformatics Conf, P89