Quantitative analysis of the conservation of the tertiary structure of protein segments

被引:9
作者
Ruan, Jishou
Chen, Ke
Tuszynski, Jack A.
Kurgan, Lukasz A. [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2V4, Canada
[2] Nankai Univ, Chern Inst Math, Coll Math Sci, Tianjin 300071, Peoples R China
[3] Nankai Univ, LPMC, Tianjin 300071, Peoples R China
[4] Univ Alberta, Dept Phys, Edmonton, AB T6G 2J1, Canada
[5] Cross Canc Inst, Dept Expt Oncol, Edmonton, AB T6G 2J1, Canada
基金
中国国家自然科学基金; 加拿大自然科学与工程研究理事会;
关键词
multi-structural segments; protein structure; protein structure comparison; protein structure conservation; URMSD;
D O I
10.1007/s10930-006-9016-5
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The publication of the crystallographic structure of calmodulin protein has offered an example leading us to believe that it is possible for many protein sequence segments to exhibit multiple 3D structures referred to as multi-structural segments. To this end, this paper presents statistical analysis of uniqueness of the 3D-structure of all possible protein sequence segments stored in the Protein Data Bank (PDB, Jan. of 2003, release 103) that occur at least twice and whose lengths are greater than 10 amino acids (AAs). We refined the set of segments by choosing only those that are not parts of longer segments, which resulted in 9297 segments called a sponge set. By adding 8197 signature segments, which occur uniquely in the PDB, into the sponge set we have generated a benchmark set. Statistical analysis of the sponge set demonstrates that rotating, missing and disarranging operations described in the text, result in the segments becoming multistructural. It turns out that missing segments do not exhibit a change of shape in the 3D-structure of a multi-structural segment. We use the root mean square distance for unit vector sequence (URMSD) as an improved measure to describe the characteristics of hinge rotations, missing, and disarranging segments. We estimated the rate of occurrence for rotating and disarranging segments in the sponge set and divided it by the number of sequences in the benchmark set which is found to be less than 0.85%. Since two of the structure changing operations concern negligible number of segment and the third one is found not to have impact on the structure, we conclude that the 3D-structure of proteins is conserved statistically for more than 98% of the segments. At the same time, the remaining 2% of the sequences may pose problems for the sequence alignment based structure prediction methods.
引用
收藏
页码:301 / 315
页数:15
相关论文
共 29 条
[1]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[2]   PREDICTIVE MODELING OF THE 3-D STRUCTURE OF INTERLEUKIN-13 [J].
BAMBOROUGH, P ;
DUNCAN, D ;
RICHARDS, WG .
PROTEIN ENGINEERING, 1994, 7 (09) :1077-1082
[3]   The domain-swapped dimer of cyanovirin-N is in a metastable folded state: Reconciliation of X-ray and NMR structures [J].
Barrientos, LG ;
Louis, JM ;
Botos, I ;
Mori, T ;
Han, ZZ ;
O'Keefe, BR ;
Boyd, MR ;
Wlodawer, A ;
Gronenborn, AM .
STRUCTURE, 2002, 10 (05) :673-686
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]  
Brody SS, 1999, PROTEINS, V37, P485, DOI 10.1002/(SICI)1097-0134(19991115)37:3<485::AID-PROT15>3.0.CO
[6]  
2-G
[7]   Prediction of three dimensional structure of calmodulin [J].
Chen, Ke ;
Ruan, Jishou ;
Kurgan, Lukasz A. .
PROTEIN JOURNAL, 2006, 25 (01) :57-70
[8]   Fast detection of common geometric substructure in proteins [J].
Chew, LP ;
Huttenlocher, D ;
Kedem, K ;
Kleinberg, J .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1999, 6 (3-4) :313-325
[9]   Structure and functional implications of the polymerase active site region in a complex of HIV-1 RT with a double-stranded DNA template-primer and an antibody Fab fragment at 2.8 Å resolution [J].
Ding, JP ;
Das, K ;
Hsiou, Y ;
Sarafianos, SG ;
Clark, AD ;
Jacobo-Molina, A ;
Tantillo, C ;
Hughes, SH ;
Arnold, E .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 284 (04) :1095-1111
[10]   Structural basis for the activation of anthrax adenylyl cyclase exotoxin by calmodulin [J].
Drum, CL ;
Yan, SZ ;
Bard, J ;
Shen, YQ ;
Lu, D ;
Soelaiman, S ;
Grabarek, Z ;
Bohm, A ;
Tang, WJ .
NATURE, 2002, 415 (6870) :396-402