Biological insights from topology independent comparison of protein 3D structures

被引:37
作者
Nguyen, Minh N. [1 ]
Madhusudhan, M. S. [1 ,2 ,3 ]
机构
[1] Bioinformat Inst, Singapore 138671, Singapore
[2] Natl Univ Singapore, Dept Biol Sci, Singapore 117548, Singapore
[3] Nanyang Technol Univ, Sch Biol Sci, Singapore, Singapore
关键词
RNA TERTIARY STRUCTURES; STRUCTURE ALIGNMENT; TOPOFIT METHOD; WEB SERVER; DATABASE; SEQUENCE; MODEL; CLASSIFICATION; INTERFACES; COMPLEX;
D O I
10.1093/nar/gkr348
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Comparing and classifying the three-dimensional (3D) structures of proteins is of crucial importance to molecular biology, from helping to determine the function of a protein to determining its evolutionary relationships. Traditionally, 3D structures are classified into groups of families that closely resemble the grouping according to their primary sequence. However, significant structural similarities exist at multiple levels between proteins that belong to these different structural families. In this study, we propose a new algorithm, CLICK, to capture such similarities. The method optimally superimposes a pair of protein structures independent of topology. Amino acid residues are represented by the Cartesian coordinates of a representative point (usually the C-alpha atom), side chain solvent accessibility, and secondary structure. Structural comparison is effected by matching cliques of points. CLICK was extensively benchmarked for alignment accuracy on four different sets: (i) 9537 pair-wise alignments between two structures with the same topology; (ii) 64 alignments from set (i) that were considered to constitute difficult alignment cases; (iii) 199 pair-wise alignments between proteins with similar structure but different topology; and (iv) 1275 pair-wise alignments of RNA structures. The accuracy of CLICK alignments was measured by the average structure overlap score and compared with other alignment methods, including HOMSTRAD, MUSTANG, Geometric Hashing, SALIGN, DALI, GANGSTA(+), FATCAT, ARTS and SARA. On average, CLICK produces pair-wise alignments that are either comparable or statistically significantly more accurate than all of these other methods. We have used CLICK to uncover relationships between (previously) unrelated proteins. These new biological insights include: (i) detecting hinge regions in proteins where domain or sub-domains show flexibility; (ii) discovering similar small molecule binding sites from proteins of different folds and (iii) discovering topological variants of known structural/sequence motifs. Our method can generally be applied to compare any pair of molecular structures represented in Cartesian coordinates as exemplified by the RNA structure superimposition benchmark.
引用
收藏
页数:16
相关论文
共 50 条
[11]   SWORD2: hierarchical analysis of protein 3D structures [J].
Cretin, Gabriel ;
Galochkina, Tatiana ;
Vander Meersche, Yann ;
de Brevern, Alexandre G. ;
Postic, Guillaume ;
Gelly, Jean-Christophe .
NUCLEIC ACIDS RESEARCH, 2022, 50 (W1) :W732-W738
[12]   GIANT: pattern analysis of molecular interactions in 3D structures of protein-small ligand complexes [J].
Kasahara, Kota ;
Kinoshita, Kengo .
BMC BIOINFORMATICS, 2014, 15
[13]   Predicting 3D protein structures in light of evolution [J].
Bershtein, Shimon ;
Kleiner, Daniel ;
Mishmar, Dan .
NATURE ECOLOGY & EVOLUTION, 2021, 5 (09) :1195-1198
[14]   Similarity Comparison of 3D Protein Structure Based on Riemannian Manifold [J].
Zhou Fengli ;
Lin Xiaoli .
INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT II, 2017, 10362 :387-397
[15]   Persistent homology reveals strong phylogenetic signal in 3D protein structures [J].
Bou Dagher, Lea ;
Madern, Dominique ;
Malbos, Philippe ;
Brochier-Armanet, Celine .
PNAS NEXUS, 2024, 3 (04)
[16]   Protein contact maps: A binary depiction of protein 3D structures [J].
Emerson, Isaac Arnold ;
Amala, Arumugam .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2017, 465 :782-791
[17]   ROC analysis: applications to the classification of biological sequences and 3D structures [J].
Sonego, Paolo ;
Kocsor, Andras ;
Pongor, Sandor .
BRIEFINGS IN BIOINFORMATICS, 2008, 9 (03) :198-209
[18]   SHEEP: A TOOL FOR DESCRIPTION OF β-SHEETS IN PROTEIN 3D STRUCTURES [J].
Aksianov, Evgeniy ;
Alexeevski, Andrei .
JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2012, 10 (02)
[19]   Geometrical comparison of two protein structures using Wigner-D functions [J].
Fathi, S. M. Saberi ;
White, Diana T. ;
Tuszynski, Jack A. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2014, 82 (10) :2756-2769
[20]   3D protein classification using topological, geometrical and biological information [J].
Tsatsaias, V. ;
Daras, P. ;
Strintzis, M. G. .
2007 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-7, 2007, :3333-+