Development of a TSR-Based Method for Protein 3-D Structural Comparison With Its Applications to Protein Classification and Motif Discovery

被引:11
作者
Kondra, Sarika [1 ]
Sarkar, Titli [1 ]
Raghavan, Vijay [1 ]
Xu, Wu [2 ]
机构
[1] Univ Louisiana Lafayette, Ctr Adv Comp Studies, Lafayette, LA 70504 USA
[2] Univ Louisiana Lafayette, Dept Chem, Lafayette, LA 70504 USA
关键词
protein structure comparison; triangular spatial relationship; structure motifs; protein classification; protein structure and function relation; protein secondary structure; molecular dynamics simulation; protein conformational change; STRUCTURE ALIGNMENT; SECONDARY-STRUCTURE; MOLECULAR-DYNAMICS; DATABASE; RECOGNITION; SIMILARITY; SCOP; CATH; PARAMETERS; RESOLUTION;
D O I
10.3389/fchem.2020.602291
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Development of protein 3-D structural comparison methods is important in understanding protein functions. At the same time, developing such a method is very challenging. In the last 40 years, ever since the development of the first automated structural method, similar to 200 papers were published using different representations of structures. The existing methods can be divided into five categories: sequence-, distance-, secondary structure-, geometry-based, and network-based structural comparisons. Each has its uniqueness, but also limitations. We have developed a novel method where the 3-D structure of a protein is modeled using the concept of Triangular Spatial Relationship (TSR), where triangles are constructed with the C-alpha atoms of a protein as vertices. Every triangle is represented using an integer, which we denote as "key," A key is computed using the length, angle, and vertex labels based on a rule-based formula, which ensures assignment of the same key to identical TSRs across proteins. A structure is thereby represented by a vector of integers. Our method is able to accurately quantify similarity of structure or substructure by matching numbers of identical keys between two proteins. The uniqueness of our method includes: (i) a unique way to represent structures to avoid performing structural superimposition; (ii) use of triangles to represent substructures as it is the simplest primitive to capture shape; (iii) complex structure comparison is achieved by matching integers corresponding to multiple TSRs. Every substructure of one protein is compared to every other substructure in a different protein. The method is used in the studies of proteases and kinases because they play essential roles in cell signaling, and a majority of these constitute drug targets. The new motifs or substructures we identified specifically for proteases and kinases provide a deeper insight into their structural relations. Furthermore, the method provides a unique way to study protein conformational changes. In addition, the results from CATH and SCOP data sets clearly demonstrate that our method can distinguish alpha helices from beta pleated sheets and vice versa. Our method has the potential to be developed into a powerful tool for efficient structure-BLAST search and comparison, just as BLAST is for sequence search and alignment.
引用
收藏
页数:28
相关论文
共 113 条
[1]  
Ackerman M, 2016, J MACH LEARN RES, V17
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Inferring topological features of proteins from amino acid residue networks [J].
Alves, Nelson Augusto ;
Martinez, Alexandre Souto .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2007, 375 (01) :336-344
[4]   The effect of backbone on the small-world properties of protein contact maps [J].
Bartoli, L. ;
Fariselli, P. ;
Casadio, R. .
PHYSICAL BIOLOGY, 2007, 4 (04) :L1-L5
[5]   Long-range Electrostatic Complementarity Governs Substrate Recognition by Human Chymotrypsin C, a Key Regulator of Digestive Enzyme Activation [J].
Batra, Jyotica ;
Szabo, Andras ;
Caulfield, Thomas R. ;
Soares, Alexei S. ;
Sahin-Toth, Miklos ;
Radisky, Evette S. .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2013, 288 (14) :9848-9859
[6]   The Protein Data Bank and the challenge of structural genomics [J].
Berman, HM ;
Bhat, TN ;
Bourne, PE ;
Feng, ZK ;
Gilliland, G ;
Weissig, H ;
Westbrook, J .
NATURE STRUCTURAL BIOLOGY, 2000, 7 (Suppl 11) :957-959
[7]   The tortuous story of Asp ... His ... Ser: structural analysis of alpha-chymotrypsin [J].
Blow, DM .
TRENDS IN BIOCHEMICAL SCIENCES, 1997, 22 (10) :405-408
[8]   18TH KREBS,HANS LECTURE - KNOWLEDGE-BASED PROTEIN MODELING AND DESIGN [J].
BLUNDELL, T ;
CARNEY, D ;
GARDNER, S ;
HAYES, F ;
HOWLIN, B ;
HUBBARD, T ;
OVERINGTON, J ;
SINGH, DA ;
SIBANDA, BL ;
SUTCLIFFE, M .
EUROPEAN JOURNAL OF BIOCHEMISTRY, 1988, 172 (03) :513-520
[9]   Proteases: History, discovery, and roles in health and disease [J].
Bond, Judith S. .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2019, 294 (05) :1643-1651
[10]  
Brenner SE, 1996, METHOD ENZYMOL, V266, P635