A New Secondary Structure Assignment Algorithm Using Cα Backbone Fragments

被引:16
作者
Cao, Chen [1 ,2 ]
Wang, Guishen [1 ,2 ]
Liu, An [3 ]
Xu, Shutan [1 ,2 ]
Wang, Lincong [1 ,2 ]
Zou, Shuxue [1 ,2 ]
机构
[1] Jilin Univ, Coll Comp Sci & Technol, Changchun 130012, Peoples R China
[2] Jilin Univ, Minist Educ, Key Lab Symbol Computat & Knowledge Engn, Changchun 130012, Peoples R China
[3] Jilin Univ, Coll Pharmaceut Sci, Changchun 130012, Peoples R China
基金
中国国家自然科学基金;
关键词
secondary structure assignment; protein; C-alpha backbone fragment; outlier detection; cluster; PROTEIN COORDINATE DATA; STRUCTURE PREDICTION; MEMBRANE-PROTEINS; DATA-BANK; IDENTIFICATION; CLASSIFICATION; DEFINITION; FEATURES; PROGRAM; HELICES;
D O I
10.3390/ijms17030333
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The assignment of secondary structure elements in proteins is a key step in the analysis of their structures and functions. We have developed an algorithm, SACF (secondary structure assignment based on C-alpha fragments), for secondary structure element (SSE) assignment based on the alignment of C-alpha backbone fragments with central poses derived by clustering known SSE fragments. The assignment algorithm consists of three steps: First, the outlier fragments on known SSEs are detected. Next, the remaining fragments are clustered to obtain the central fragments for each cluster. Finally, the central fragments are used as a template to make assignments. Following a large-scale comparison of 11 secondary structure assignment methods, SACF, KAKSI and PROSS are found to have similar agreement with DSSP, while PCASSO agrees with DSSP best. SACF and PCASSO show preference to reducing residues in N and C cap regions, whereas KAKSI, P-SEA and SEGNO tend to add residues to the terminals when DSSP assignment is taken as standard. Moreover, our algorithm is able to assign subtle helices (3(10)-helix, pi-helix and left-handed helix) and make uniform assignments, as well as to detect rare SSEs in beta-sheets or long helices as outlier fragments from other programs. The structural uniformity should be useful for protein structure classification and prediction, while outlier fragments underlie the structure-function relationship.
引用
收藏
页数:16
相关论文
共 44 条
[1]   The worldwide Protein Data Bank (wwPDB): ensuring a single, uniform archive of PDB data [J].
Berman, Helen ;
Henrick, Kim ;
Nakamura, Haruki ;
Markley, John L. .
NUCLEIC ACIDS RESEARCH, 2007, 35 :D301-D303
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   An Algorithm for Protein Helix Assignment Using Helix Geometry [J].
Cao, Chen ;
Xu, Shutan ;
Wang, Lincong .
PLOS ONE, 2015, 10 (07)
[4]   COMPARISON OF 3 ALGORITHMS FOR THE ASSIGNMENT OF SECONDARY STRUCTURE IN PROTEINS - THE ADVANTAGES OF A CONSENSUS ASSIGNMENT [J].
COLLOCH, N ;
ETCHEBEST, C ;
THOREAU, E ;
HENRISSAT, B ;
MORNON, JP .
PROTEIN ENGINEERING, 1993, 6 (04) :377-382
[5]   Secondary structure assignment that accurately reflects physical and evolutionary characteristics [J].
Cubellis, MV ;
Cailliez, F ;
Lovell, SC .
BMC BIOINFORMATICS, 2005, 6 (Suppl 4)
[6]   Occurrence, conformational features and amino acid propensities for the π-helix [J].
Fodje, MN ;
Al-Karadaghi, S .
PROTEIN ENGINEERING, 2002, 15 (05) :353-358
[7]   Knowledge-based protein secondary structure assignment [J].
Frishman, D ;
Argos, P .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (04) :566-579
[8]   A survey of outlier detection methodologies [J].
Hodge V.J. ;
Austin J. .
Artificial Intelligence Review, 2004, 22 (2) :85-126
[9]   Update on protein structure prediction: Results of the 1995 IRBM workshop [J].
Hubbard, T ;
Tramontano, A ;
Barton, G ;
Jones, D ;
Sippl, M ;
Valencia, A ;
Lesk, A ;
Moult, J ;
Rost, B ;
Sander, C ;
Schneider, R ;
Lahm, A ;
Leplae, R ;
Buta, C ;
Eisenstein, M ;
Fjellstrom, O ;
Floeckner, H ;
Grossmann, JG ;
Hansen, J ;
Citterich, MH ;
Joergensen, FS ;
MarchlerBauer, A ;
Osuna, J ;
Park, J ;
Reinhardt, A ;
dePouplana, LR ;
RojoDominguez, A ;
Saudek, V ;
Sinclair, J ;
Sturrock, S ;
Venclovas, C ;
Vinals, C .
FOLDING & DESIGN, 1996, 1 (03) :R55-R63
[10]  
Hutchinson EG, 1996, PROTEIN SCI, V5, P212