Structural search and retrieval using a tableau representation of protein folding patterns

被引:23
|
作者
Konagurthu, Arun S. [1 ,2 ]
Stuckey, Peter J. [3 ,4 ]
Lesk, Arthur M. [1 ,2 ]
机构
[1] Penn State Univ, Dept Biochem & Mol Biol, University Pk, PA 16802 USA
[2] Penn State Univ, Huck Inst Genom Proteom & Bioinformat, University Pk, PA 16802 USA
[3] Univ Melbourne, Dept Comp Sci & Software Engn, Melbourne, Vic 3010, Australia
[4] Univ Melbourne, NICTA Victoria Labs, Melbourne, Vic 3010, Australia
关键词
D O I
10.1093/bioinformatics/btm641
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Comparison and classification of folding patterns from a database of protein structures is crucial to understand the principles of protein architecture, evolution and function. Current search methods for proteins with similar folding patterns are slow and computationally intensive. The sharp growth in the number of known protein structures poses severe challenges for methods of structural comparison. There is a need for methods that can search the database of structures accurately and rapidly. We provide several methods to search for similar folding patterns using a concise tableau representation of proteins that encodes the relative geometry of secondary structural elements. Our first approach allows the extraction of identical and very closely-related protein folding patterns in constant-time (per hit). Next, we address the hard computational problem of extraction of maximally-similar subtableaux, when comparing two tableaux. We solve the problem using Quadratic and Linear integer programming formulations and demonstrate their power to identify subtle structural similarities, especially when protein structures significantly diverge. Finally, we describe a rapid and accurate method for comparing a query structure against a database of protein domains, TableauSearch. TableauSearch is rapid enough to search the entire structural database in seconds on a standard desktop computer. Our analysis of TableauSearch on many queries shows that the method is very accurate in identifying similarities of folding patterns, even between distantly related proteins.
引用
收藏
页码:645 / 651
页数:7
相关论文
共 50 条
  • [41] Protein structural codes and nucleation sites for protein folding
    Jiang Fan
    Li Nan
    CHINESE PHYSICS, 2007, 16 (02): : 392 - 404
  • [42] On Representing Protein Folding Patterns Using Non-Linear Parametric Curves
    Kasarapu, Parthan
    de la Banda, Maria Garcia
    Konagurthu, Arun S.
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2014, 11 (06) : 1218 - 1228
  • [43] Protein modeling with reduced representation: statistical potentials and protein folding mechanism
    Ekonomiuk, D
    Kielbasinski, M
    Kolinski, A
    ACTA BIOCHIMICA POLONICA, 2005, 52 (04) : 741 - 748
  • [44] From propensities to patterns to principles in protein folding
    Rose, George D.
    PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2025, 93 (01) : 105 - 111
  • [46] MEDICAL IMAGE SEARCH AND RETRIEVAL USING LOCAL BINARY PATTERNS AND KLT FEATURE POINTS
    Unay, Devrim
    Ekin, Ahmet
    Jasinschi, Radu
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 997 - 1000
  • [47] Medical Image Search and Retrieval using Local Binary Patterns and KLT Feature Points
    Unay, Devrim
    Ekin, Ahmet
    Jasinschi, Radu S.
    2009 IEEE 17TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, VOLS 1 AND 2, 2009, : 279 - 282
  • [48] 3D Similarity Search Using a Weighted Structural Histogram Representation
    Lu, Tong
    Gao, Rongjun
    Wang, Tuantuan
    Yang, Yubin
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING-PCM 2010, PT I, 2010, 6297 : 348 - 356
  • [49] EXAMINATION OF FOLDING PATTERNS FOR PREDICTING PROTEIN TOPOLOGIES
    BUSETTA, B
    BIOCHIMICA ET BIOPHYSICA ACTA, 1986, 870 (02) : 327 - 338
  • [50] Prediction of folding patterns for intrinsic disordered protein
    Yang J.
    Cheng W.-X.
    Wu G.
    Sheng S.
    Zhang P.
    Scientific Reports, 13 (1)