Protein fold recognition by prediction-based threading

被引:204
作者
Rost, B [1 ]
Schneider, R [1 ]
Sander, C [1 ]
机构
[1] EBI, CAMBRIDGE CB10 1RQ, ENGLAND
关键词
protein structure prediction; threading; remote homology detection; fold recognition; secondary structure;
D O I
10.1006/jmbi.1997.1101
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In fold recognition by threading one takes the amino acid sequence of a protein and evaluates how well it fits into one of the known three-dimensional (3D) protein structures. The quality of sequence-structure fit is typically evaluated using inter-residue potentials of mean force or other statistical parameters. Here, we present an alternative approach to evaluating sequence-structure fitness. Starting from the amino acid sequence we first predict secondary structure and solvent accessibility for each residue. We then thread the resulting one-dimensional (1D) profile of predicted structure assignments into each of the known 3D structures. The optimal threading for each sequence-structure pair is obtained using dynamic programming. The overall best sequence-structure pair constitutes the predicted 3D structure for the input sequence. The method is fine-tuned by adding information from direct sequence-sequence comparison and applying a series of empirical filters. Although the method relies on reduction of 3D information into 1D structure profiles, its accuracy is, surprisingly, not clearly inferior to methods based on evaluation of residue interactions in 3D. We therefore hypothesise that existing 1D-3D threading methods essentially do not capture more than the fitness of an amino acid sequence for a particular 1D succession of secondary structure segments and residue solvent accessibility. The prediction-based threading method on average finds any structurally homologous region at first rank in 29% of the cases (including sequence information). For the 22% first hits detected at highest scores, the expected accuracy rose to 75%. However, the task of detecting entire folds rather than homologous fragments was managed much better; 45 to 75% of the first hits correctly recognised the fold. (C) 1997 Academic Press Limited.
引用
收藏
页码:471 / 480
页数:10
相关论文
共 50 条
[41]   ORION: a web server for protein fold recognition and structure prediction using evolutionary hybrid profiles [J].
Ghouzam, Yassine ;
Postic, Guillaume ;
Guerin, Pierre-Edouard ;
de Brevern, Alexandre G. ;
Gelly, Jean-Christophe .
SCIENTIFIC REPORTS, 2016, 6
[42]   Sequence-based recognition of protein folds using the threading method and frameworks of globular proteins [J].
Rykunov, DS ;
Lobanov, MY ;
Finkelstein, AV .
MOLECULAR BIOLOGY, 1998, 32 (03) :428-438
[43]   Protein fold recognition by total alignment probability [J].
Bienkowska, JR ;
Yu, LH ;
Zarakhovich, S ;
Rogers, RG ;
Smith, TF .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2000, 40 (03) :451-462
[44]   Classification Modeling and Recognition of Protein Fold Type [J].
Liu Yue ;
Li Xiao-Qin ;
Xu Hai-Song ;
Qiao Hui .
ACTA PHYSICO-CHIMICA SINICA, 2009, 25 (12) :2558-2564
[45]   Improving protein fold recognition by random forest [J].
Taeho Jo ;
Jianlin Cheng .
BMC Bioinformatics, 15
[46]   Classification and recognition of Rossmann-fold protein [J].
Liu, Yue ;
Li, Xiaoqin ;
Xu, Haisong ;
Qiao, Hui .
FBIE: 2008 INTERNATIONAL SEMINAR ON FUTURE BIOMEDICAL INFORMATION ENGINEERING, PROCEEDINGS, 2008, :78-81
[47]   Protein Fold Recognition by Functional Domain Composition [J].
Yan Jin-Li ;
Chen Zhi-Wei ;
Xu Hai-Song ;
Li Xiao-Qin .
PROGRESS IN BIOCHEMISTRY AND BIOPHYSICS, 2011, 38 (02) :166-172
[48]   Mining sequential patterns for protein fold recognition [J].
Exarchos, Themis P. ;
Papaloukas, Costas ;
Lampros, Christos ;
Fotiadis, Dimitrios I. .
JOURNAL OF BIOMEDICAL INFORMATICS, 2008, 41 (01) :165-179
[49]   DescFold: A web server for protein fold recognition [J].
Ren-Xiang Yan ;
Jing-Na Si ;
Chuan Wang ;
Ziding Zhang .
BMC Bioinformatics, 10
[50]   DescFold: A web server for protein fold recognition [J].
Yan, Ren-Xiang ;
Si, Jing-Na ;
Wang, Chuan ;
Zhang, Ziding .
BMC BIOINFORMATICS, 2009, 10