Development of a TSR-Based Method for Protein 3-D Structural Comparison With Its Applications to Protein Classification and Motif Discovery

被引:11
作者
Kondra, Sarika [1 ]
Sarkar, Titli [1 ]
Raghavan, Vijay [1 ]
Xu, Wu [2 ]
机构
[1] Univ Louisiana Lafayette, Ctr Adv Comp Studies, Lafayette, LA 70504 USA
[2] Univ Louisiana Lafayette, Dept Chem, Lafayette, LA 70504 USA
关键词
protein structure comparison; triangular spatial relationship; structure motifs; protein classification; protein structure and function relation; protein secondary structure; molecular dynamics simulation; protein conformational change; STRUCTURE ALIGNMENT; SECONDARY-STRUCTURE; MOLECULAR-DYNAMICS; DATABASE; RECOGNITION; SIMILARITY; SCOP; CATH; PARAMETERS; RESOLUTION;
D O I
10.3389/fchem.2020.602291
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Development of protein 3-D structural comparison methods is important in understanding protein functions. At the same time, developing such a method is very challenging. In the last 40 years, ever since the development of the first automated structural method, similar to 200 papers were published using different representations of structures. The existing methods can be divided into five categories: sequence-, distance-, secondary structure-, geometry-based, and network-based structural comparisons. Each has its uniqueness, but also limitations. We have developed a novel method where the 3-D structure of a protein is modeled using the concept of Triangular Spatial Relationship (TSR), where triangles are constructed with the C-alpha atoms of a protein as vertices. Every triangle is represented using an integer, which we denote as "key," A key is computed using the length, angle, and vertex labels based on a rule-based formula, which ensures assignment of the same key to identical TSRs across proteins. A structure is thereby represented by a vector of integers. Our method is able to accurately quantify similarity of structure or substructure by matching numbers of identical keys between two proteins. The uniqueness of our method includes: (i) a unique way to represent structures to avoid performing structural superimposition; (ii) use of triangles to represent substructures as it is the simplest primitive to capture shape; (iii) complex structure comparison is achieved by matching integers corresponding to multiple TSRs. Every substructure of one protein is compared to every other substructure in a different protein. The method is used in the studies of proteases and kinases because they play essential roles in cell signaling, and a majority of these constitute drug targets. The new motifs or substructures we identified specifically for proteases and kinases provide a deeper insight into their structural relations. Furthermore, the method provides a unique way to study protein conformational changes. In addition, the results from CATH and SCOP data sets clearly demonstrate that our method can distinguish alpha helices from beta pleated sheets and vice versa. Our method has the potential to be developed into a powerful tool for efficient structure-BLAST search and comparison, just as BLAST is for sequence search and alignment.
引用
收藏
页数:28
相关论文
共 113 条
[11]   FINDING ALL CLIQUES OF AN UNDIRECTED GRAPH [H] [J].
BRON, C ;
KERBOSCH, J .
COMMUNICATIONS OF THE ACM, 1973, 16 (09) :575-577
[12]   DISSECTING THE CATALYTIC TRIAD OF A SERINE PROTEASE [J].
CARTER, P ;
WELLS, JA .
NATURE, 1988, 332 (6164) :564-568
[13]   The Amber biomolecular simulation programs [J].
Case, DA ;
Cheatham, TE ;
Darden, T ;
Gohlke, H ;
Luo, R ;
Merz, KM ;
Onufriev, A ;
Simmerling, C ;
Wang, B ;
Woods, RJ .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2005, 26 (16) :1668-1688
[14]   Protein kinases - the major drug targets of the twenty-first century? [J].
Cohen, P .
NATURE REVIEWS DRUG DISCOVERY, 2002, 1 (04) :309-315
[15]  
de Brevern AG, 2000, PROTEINS, V41, P271, DOI 10.1002/1097-0134(20001115)41:3<271::AID-PROT10>3.0.CO
[16]  
2-Z
[17]   RASMOT-3D PRO: a 3D motif search webserver [J].
Debret, Gaelle ;
Martel, Arnaud ;
Cuniasse, Philippe .
NUCLEIC ACIDS RESEARCH, 2009, 37 :W459-W464
[18]   The development of imatinib as a therapeutic agent for chronic myeloid leukemia [J].
Deininger, M ;
Buchdunger, E ;
Druker, BJ .
BLOOD, 2005, 105 (07) :2640-2653
[19]   Multi-class protein fold recognition using support vector machines and neural networks [J].
Ding, CHQ ;
Dubchak, I .
BIOINFORMATICS, 2001, 17 (04) :349-358
[20]   Catalytic triads and their relatives [J].
Dodson, G ;
Wlodawer, A .
TRENDS IN BIOCHEMICAL SCIENCES, 1998, 23 (09) :347-352