BWM: A Novel, Provable, Ensemble-based Dynamic Programming Algorithm for Sparse Approximations of Computational Protein Design

被引:15
|
作者
Jou, Jonathan D. [1 ]
Jain, Swati [1 ,2 ,3 ,4 ]
Georgiev, Ivelin S. [1 ,6 ]
Donald, Bruce R. [1 ,2 ,5 ]
机构
[1] Duke Univ, Dept Comp Sci, Durham, NC 27708 USA
[2] Duke Univ, Med Ctr, Dept Biochem, Durham, NC 27708 USA
[3] Duke Univ, Dept Computat Biol, Durham, NC 27708 USA
[4] Duke Univ, Bioinformat Program, Durham, NC 27708 USA
[5] Duke Univ, Dept Chem, Durham, NC 27708 USA
[6] NIAID, Vaccine Res Ctr, NIH, 9000 Rockville Pike, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
protein design; sparse residue interaction graphs; provable algorithms; branch-decomposition; dynamic programming; ensemble-based algorithms; OSPREY; DEAD-END ELIMINATION; SIDE-CHAIN; GRAMICIDIN SYNTHETASE; SEARCH ALGORITHM; PREDICTION; REDESIGN; CONFORMATION; SPECIFICITY; RESISTANCE; POTENCY;
D O I
10.1089/cmb.2015.0194
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Sparse energy functions that ignore long range interactions between residue pairs are frequently used by protein design algorithms to reduce computational cost. Current dynamic programming algorithms that fully exploit the optimal substructure produced by these energy functions only compute the GMEC. This disproportionately favors the sequence of a single, static conformation and overlooks better binding sequences with multiple low-energy conformations. Provable, ensemble-based algorithms such as A* avoid this problem, but A* cannot guarantee better performance than exhaustive enumeration. We propose a novel, provable, dynamic programming algorithm called Branch-Width Minimization* (BWM*) to enumerate a gap-free ensemble of conformations in order of increasing energy. Given a branch-decomposition of branch-width w for an n-residue protein design with at most q discrete side-chain conformations per residue, BWM* returns the sparse GMEC in O(nw(2) q(3/2w)) time and enumerates each additional conformation in merely O(n log q) time. We define a new measure, Total Effective Search Space (TESS), which can be computed efficiently a priori before BWM* or A* is run. We ran BWM* on 67 protein design problems and found that TESS discriminated between BWM*-efficient and A*-efficient cases with 100% accuracy. As predicted by TESS and validated experimentally, BWM* outperforms A* in 73% of the cases and computes the full ensemble or a close approximation faster than A*, enumerating each additional conformation in milliseconds. Unlike A*, the performance of BWM* can be predicted in polynomial time before running the algorithm, which gives protein designers the power to choose the most efficient algorithm for their particular design problem.
引用
收藏
页码:413 / 424
页数:12
相关论文
共 50 条
  • [1] BWM*: A Novel, Provable, Ensemble-Based Dynamic Programming Algorithm for Sparse Approximations of Computational Protein Design
    Jou, Jonathan D.
    Jain, Swati
    Georgiev, Ivelin
    Donald, Bruce R.
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY (RECOMB 2015), 2015, 9029 : 154 - 166
  • [2] Design of Protein-Protein Interactions with a Novel Ensemble-Based Scoring Algorithm
    Roberts, Kyle E.
    Cushing, Patrick R.
    Boisguerin, Prisca
    Madden, Dean R.
    Donald, Bruce R.
    RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, 2011, 6577 : 361 - +
  • [3] Novel, provable algorithms for efficient ensemble-based computational protein design and their application to the redesign of the c-Raf-RBD:KRas protein-protein interface
    Lowegard, Anna U.
    Frenkel, Marcel S.
    Holt, Graham T.
    Jou, Jonathan D.
    Ojewole, Adegoke A.
    Donald, Bruce R.
    PLOS COMPUTATIONAL BIOLOGY, 2020, 16 (06)
  • [4] Minimization-Aware Recursive K*: A Novel, Provable Algorithm that Accelerates Ensemble-Based Protein Design and Provably Approximates the Energy Landscape
    Jou, Jonathan D.
    Holt, Graham T.
    Lowegard, Anna U.
    Donald, Bruce R.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2020, 27 (04) : 550 - 564
  • [5] BBK* (Branch and Bound Over K*): A Provable and Efficient Ensemble-Based Protein Design Algorithm to Optimize Stability and Binding Affinity Over Large Sequence Spaces
    Ojewole, Adegoke A.
    Jou, Jonathan D.
    Fowler, Vance G.
    Donald, Bruce R.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2018, 25 (07) : 726 - 739
  • [6] Computational Prediction of Cervical Cancer Diagnosis Using Ensemble-Based Classification Algorithm
    Gupta, Surbhi
    Gupta, Manoj K.
    COMPUTER JOURNAL, 2022, 65 (06): : 1527 - 1539
  • [7] A Novel Clusterer Ensemble Algorithm Based on Dynamic Cooperation
    Kang, Kai
    Zhang, Hua-Xiang
    Fan, Ying
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 1, PROCEEDINGS, 2008, : 32 - 35
  • [8] Component Ensemble-based UML/MARTE Extensions for the Design of Dynamic Cyber-Physical Systems
    Fredj, Nissaf
    Kacem, Yessine Hadj
    Kanoun, Olfa
    Abid, Mohamed
    PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON SOFTWARE TECHNOLOGIES (ICSOFT), 2021, : 158 - 166
  • [9] An efficient ordering-based ensemble pruning algorithm via dynamic programming
    Dai, Qun
    Han, Xiaomeng
    APPLIED INTELLIGENCE, 2016, 44 (04) : 816 - 830
  • [10] An efficient ordering-based ensemble pruning algorithm via dynamic programming
    Qun Dai
    Xiaomeng Han
    Applied Intelligence, 2016, 44 : 816 - 830