Eigen THREADER: analogous protein fold recognition by efficient contact map threading

被引:42
作者
Buchan, Daniel W. A. [1 ]
Jones, David T. [1 ]
机构
[1] UCL, Dept Comp Sci, Gower St, London WC1E 6BT, England
基金
英国生物技术与生命科学研究理事会;
关键词
STRUCTURE PREDICTION; SEQUENCE; ALIGNMENT; COEVOLUTION;
D O I
10.1093/bioinformatics/btx217
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Protein fold recognition when appropriate, evolutionarily-related, structural templates can be identified is often trivial and may even be viewed as a solved problem. However in cases where no homologous structural templates can be detected, fold recognition is a notoriously difficult problem (Moult et al., 2014). Here we present EigenTHREADER, a novel fold recognition method capable of identifying folds where no homologous structures can be identified. EigenTHREADER takes a query amino acid sequence, generates a map of intra-residue contacts, and then searches a library of contact maps of known structures. To allow the contact maps to be compared, we use eigenvector decomposition to resolve the principal eigenvectors these can then be aligned using standard dynamic programming algorithms. The approach is similar to the AlEigen approach of Di Lena et al. (2010), but with improvements made both to speed and accuracy. With this search strategy, EigenTHREADER does not depend directly on sequence homology between the target protein and entries in the fold library to generate models. This in turn enables EigenTHREADER to correctly identify analogous folds where little or no sequence homology information is. Results: EigenTHREADER outperforms well-established fold recognition methods such as pGenTHREADER and HHSearch in terms of True Positive Rate in the difficult task of analogous fold recognition. This should allow template-based modelling to be extended to many new protein families that were previously intractable to homology based fold recognition methods.
引用
收藏
页码:2684 / 2690
页数:7
相关论文
共 31 条
[1]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[2]  
Creighton T., 1992, PROTEINS STRUCTURES
[3]   Fast overlapping of protein contact maps by alignment of eigenvectors [J].
Di Lena, Pietro ;
Fariselli, Piero ;
Margara, Luciano ;
Vassura, Marco ;
Casadio, Rita .
BIOINFORMATICS, 2010, 26 (18) :2250-2258
[4]   BioShell-Threading: versatile Monte Carlo package for protein 3D threading [J].
Gniewek, Pawel ;
Kolinski, Andrzej ;
Kloczkowski, Andrzej ;
Gront, Dominik .
BMC BIOINFORMATICS, 2014, 15
[5]  
GOLDMAN D, 1999, FOCS 99 P 40 ANN S F, P512, DOI DOI 10.1109/SFFCS.1999.814624
[6]   MetaPSICOV: combining coevolution methods for accurate prediction of contacts and long range hydrogen bonding in proteins [J].
Jones, David T. ;
Singh, Tanya ;
Kosciolek, Tomasz ;
Tetchner, Stuart .
BIOINFORMATICS, 2015, 31 (07) :999-1006
[7]   PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments [J].
Jones, David T. ;
Buchan, Daniel W. A. ;
Cozzetto, Domenico ;
Pontil, Massimiliano .
BIOINFORMATICS, 2012, 28 (02) :184-190
[8]   A NEW APPROACH TO PROTEIN FOLD RECOGNITION [J].
JONES, DT ;
TAYLOR, WR ;
THORNTON, JM .
NATURE, 1992, 358 (6381) :86-89
[9]   FreeContact: fast and free software for protein contact prediction from residue co-evolution [J].
Kajan, Laszlo ;
Hopf, Thomas A. ;
Kalas, Matus ;
Marks, Debora S. ;
Rost, Burkhard .
BMC BIOINFORMATICS, 2014, 15
[10]   Assessment of CASP11 contact-assisted predictions [J].
Kinch, Lisa N. ;
Li, Wenlin ;
Monastyrskyy, Bohdan ;
Kryshtafovych, Andriy ;
Grishin, Nick V. .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2016, 84 :164-180