Improving protein structure prediction with model-based search

被引:16
作者
Brunette, TJ [1 ]
Brock, O [1 ]
机构
[1] Univ Massachusetts, Bioinformat Res Lab, Dept Comp Sci, Amherst, MA 01003 USA
关键词
D O I
10.1093/bioinformatics/bti1029
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: De novo protein structure prediction can be formulated as search in a high-dimensional space. One of the most frequently used computational tools to solve such search problems is the Monte Carlo method. We present a novel search technique, called model-based search. This method samples the high-dimensional search space to build an approximate model of the underlying function. This model is incrementally refined in areas of interest, whereas areas that are not of interest are excluded from further exploration. Model-based search derives its efficiency from the fact that the information obtained during the exploration of the search space is used to guide further exploration. In contrast, Monte Carlo-based techniques lack memory and exploration is performed based on random walks, ignoring the information obtained in previous steps. Results: Model-based search is applied to protein structure prediction, where search is employed to find the global minimum of the protein's energy landscape. We show that model-based search uses computational resources more efficiently to find lower-energy conformations of proteins than one of the leading protein structure prediction methods, which relies on a tailored Monte Carlo method to perform a search. The performance improvements become more pronounced as the dimensionality of the search problem increases. We argue that model-based search will enable more accurate protein structure prediction than was previously possible. Furthermore, we believe that similar performance improvements can be expected in other problems that are currently solved using Monte Carlo-based search methods.
引用
收藏
页码:I66 / I74
页数:9
相关论文
共 25 条
[1]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[2]  
[Anonymous], 1997, Tabu Search
[3]   MULTICANONICAL ENSEMBLE - A NEW APPROACH TO SIMULATE 1ST-ORDER PHASE-TRANSITIONS [J].
BERG, BA ;
NEUHAUS, T .
PHYSICAL REVIEW LETTERS, 1992, 68 (01) :9-12
[4]   Rosetta predictions in CASP5: Successes, failures, and prospects for complete automation [J].
Bradley, P ;
Chivian, D ;
Meiler, J ;
Misura, KMS ;
Rohl, CA ;
Schief, WR ;
Wedemeyer, WJ ;
Schueler-Furman, O ;
Murphy, P ;
Schonbrun, J ;
Strauss, CEM ;
Baker, D .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :457-468
[5]  
BURNS B, 2005, P IEEE INT C ROB AUT
[6]   Active learning with statistical models [J].
Cohn, DA ;
Ghahramani, Z ;
Jordan, MI .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :129-145
[7]   REDUCING QUASI-ERGODIC BEHAVIOR IN MONTE-CARLO SIMULATIONS BY J-WALKING - APPLICATIONS TO ATOMIC CLUSTERS [J].
FRANTZ, DD ;
FREEMAN, DL ;
DOLL, JD .
JOURNAL OF CHEMICAL PHYSICS, 1990, 93 (04) :2769-2784
[8]   Parallel tempering algorithm for conformational studies of biological molecules [J].
Hansmann, UHE .
CHEMICAL PHYSICS LETTERS, 1997, 281 (1-3) :140-150
[9]  
HOLLAND J, 1975, ARTIF INTELL, V36, P177
[10]  
Hubbard TJP, 1999, PROTEINS, P15