BWM: A Novel, Provable, Ensemble-based Dynamic Programming Algorithm for Sparse Approximations of Computational Protein Design

被引:15
|
作者
Jou, Jonathan D. [1 ]
Jain, Swati [1 ,2 ,3 ,4 ]
Georgiev, Ivelin S. [1 ,6 ]
Donald, Bruce R. [1 ,2 ,5 ]
机构
[1] Duke Univ, Dept Comp Sci, Durham, NC 27708 USA
[2] Duke Univ, Med Ctr, Dept Biochem, Durham, NC 27708 USA
[3] Duke Univ, Dept Computat Biol, Durham, NC 27708 USA
[4] Duke Univ, Bioinformat Program, Durham, NC 27708 USA
[5] Duke Univ, Dept Chem, Durham, NC 27708 USA
[6] NIAID, Vaccine Res Ctr, NIH, 9000 Rockville Pike, Bethesda, MD 20892 USA
基金
美国国家卫生研究院;
关键词
protein design; sparse residue interaction graphs; provable algorithms; branch-decomposition; dynamic programming; ensemble-based algorithms; OSPREY; DEAD-END ELIMINATION; SIDE-CHAIN; GRAMICIDIN SYNTHETASE; SEARCH ALGORITHM; PREDICTION; REDESIGN; CONFORMATION; SPECIFICITY; RESISTANCE; POTENCY;
D O I
10.1089/cmb.2015.0194
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Sparse energy functions that ignore long range interactions between residue pairs are frequently used by protein design algorithms to reduce computational cost. Current dynamic programming algorithms that fully exploit the optimal substructure produced by these energy functions only compute the GMEC. This disproportionately favors the sequence of a single, static conformation and overlooks better binding sequences with multiple low-energy conformations. Provable, ensemble-based algorithms such as A* avoid this problem, but A* cannot guarantee better performance than exhaustive enumeration. We propose a novel, provable, dynamic programming algorithm called Branch-Width Minimization* (BWM*) to enumerate a gap-free ensemble of conformations in order of increasing energy. Given a branch-decomposition of branch-width w for an n-residue protein design with at most q discrete side-chain conformations per residue, BWM* returns the sparse GMEC in O(nw(2) q(3/2w)) time and enumerates each additional conformation in merely O(n log q) time. We define a new measure, Total Effective Search Space (TESS), which can be computed efficiently a priori before BWM* or A* is run. We ran BWM* on 67 protein design problems and found that TESS discriminated between BWM*-efficient and A*-efficient cases with 100% accuracy. As predicted by TESS and validated experimentally, BWM* outperforms A* in 73% of the cases and computes the full ensemble or a close approximation faster than A*, enumerating each additional conformation in milliseconds. Unlike A*, the performance of BWM* can be predicted in polynomial time before running the algorithm, which gives protein designers the power to choose the most efficient algorithm for their particular design problem.
引用
收藏
页码:413 / 424
页数:12
相关论文
共 50 条
  • [41] Hardware Accelerator Design for Dynamic-Programming-Based Protein Sequence Alignment with Affine Gap Tracebacks
    Lin, Mao-Jan
    Li, Yu-Cheng
    Lu, Yi-Chang
    2019 IEEE BIOMEDICAL CIRCUITS AND SYSTEMS CONFERENCE (BIOCAS 2019), 2019,
  • [42] Prediction of protein secondary structure based on residue pair types and conformational states using dynamic programming algorithm
    Sadeghi, M
    Parto, S
    Arab, S
    Ranjbar, B
    FEBS LETTERS, 2005, 579 (16) : 3397 - 3400
  • [43] A novel sparse behavioral model design method based on the global representative point selection and the randomized SVD algorithm
    Luo, Xin
    Li, Mingyu
    Jin, Yi
    Dai, Zhijiang
    Pang, Jingzhou
    Shi, Weimin
    AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS, 2024, 185
  • [44] Approximate Dynamic Programming Based Controller Design Using an Improved Learning Algorithm with Application to Tracking Control of Aircraft
    Luo, Xiong
    Zhou, Yuchao
    Sun, Zengqi
    PROCEEDINGS OF 2013 CHINESE INTELLIGENT AUTOMATION CONFERENCE: INTELLIGENT AUTOMATION & INTELLIGENT TECHNOLOGY AND SYSTEMS, 2013, 255 : 141 - 148
  • [45] A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients
    Fili, Mohammad
    Hu, Guiping
    Han, Changze
    Kort, Alexa
    Trettin, John
    Haim, Hillel
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2023, 18 (01)
  • [46] A classification algorithm based on dynamic ensemble selection to predict mutational patterns of the envelope protein in HIV-infected patients
    Mohammad Fili
    Guiping Hu
    Changze Han
    Alexa Kort
    John Trettin
    Hillel Haim
    Algorithms for Molecular Biology, 18
  • [47] Design of Novel Dynamic March Algorithm Based on Memory Built-in Self-test
    Cai Z.
    Yu H.
    Yang H.
    Wang Z.
    Guo Y.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2023, 45 (09): : 3420 - 3429
  • [48] Multi-hop consensus time synchronization algorithm for sparse wireless sensor network: A distributed constraint-based dynamic programming approach
    Panigrahi, Niranjan
    Khilar, Pabitra Mohan
    AD HOC NETWORKS, 2017, 61 : 124 - 138
  • [49] Design of Dynamic Programming Model for Multi-Objective Cold Chain Logistics Deployment Path Based on Meme Algorithm
    Liu, Yinghuan
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF CIVIL ENGINEERING, 2022, 46 (03) : 2553 - 2560
  • [50] Design of Dynamic Programming Model for Multi-Objective Cold Chain Logistics Deployment Path Based on Meme Algorithm
    Yinghuan Liu
    Iranian Journal of Science and Technology, Transactions of Civil Engineering, 2022, 46 : 2553 - 2560