SwiftLib: rapid degenerate-codon-library optimization through dynamic programming

被引:42
作者
Jacobs, Timothy M. [1 ]
Yumerefendi, Hayretin [1 ]
Kuhlman, Brian [1 ,2 ]
Leaver-Fay, Andrew [1 ]
机构
[1] Univ N Carolina, Dept Biochem, Chapel Hill, NC 27599 USA
[2] Univ N Carolina, Lineberger Comprehens Canc Ctr, Chapel Hill, NC 27599 USA
基金
美国国家卫生研究院;
关键词
FLUORESCENT PROTEINS; COMPUTATIONAL DESIGN; DIRECTED EVOLUTION; GENE LIBRARIES; DIVERSITY; SPECIFICITY; MUTAGENESIS; ALGORITHMS; SEQUENCES; AFFINITY;
D O I
10.1093/nar/gku1323
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Degenerate codon (DC) libraries efficiently address the experimental library-size limitations of directed evolution by focusing diversity toward the positions and toward the amino acids (AAs) that are most likely to generate hits; however, manually constructing DC libraries is challenging, error prone and time consuming. This paper provides a dynamic programming solution to the task of finding the best DCs while keeping the size of the library beneath some given limit, improving on the existing integer-linear programming formulation. It then extends the algorithm to consider multiple DCs at each position, a heretofore unsolved problem, while adhering to a constraint on the number of primers needed to synthesize the library. In the two library-design problems examined here, the use of multiple DCs produces libraries that very nearly cover the set of desired AAs while still staying within the experimental size limits. Surprisingly, the algorithm is able to find near-perfect libraries where the ratio of amino-acid sequences to nucleic-acid sequences approaches 1; it effectively side-steps the degeneracy of the genetic code. Our algorithm is freely available through our web server and solves most design problems in about a second.
引用
收藏
页数:10
相关论文
共 46 条
[1]   Experimental library screening demonstrates the successful application of computational protein design to large structural ensembles [J].
Allen, Benjamin D. ;
Nisthal, Alex ;
Mayo, Stephen L. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (46) :19838-19843
[2]  
[Anonymous], 1972, Complexity of Computer Computations, DOI [10.1007/978-3-540-68279-0-8, DOI 10.1007/978-1-4684-2001-2]
[3]  
ARKIN AP, 1992, NAT BIOTECHNOL, V10, P297
[4]   Computation-Guided Backbone Grafting of a Discontinuous Motif onto a Protein Scaffold [J].
Azoitei, Mihai L. ;
Correia, Bruno E. ;
Ban, Yih-En Andrew ;
Carrico, Chris ;
Kalyuzhniy, Oleksandr ;
Chen, Lei ;
Schroeter, Alexandria ;
Huang, Po-Ssu ;
McLellan, Jason S. ;
Kwong, Peter D. ;
Baker, David ;
Strong, Roland K. ;
Schief, William R. .
SCIENCE, 2011, 334 (6054) :373-376
[5]  
Bellman RE., 1957, Dynamic Programming
[6]   Precision is essential for efficient catalysis in an evolved Kemp eliminase [J].
Blomberg, Rebecca ;
Kries, Hajo ;
Pinkas, Daniel M. ;
Mittl, Peer R. E. ;
Gruetter, Markus G. ;
Privett, Heidi K. ;
Mayo, Stephen L. ;
Hilvert, Donald .
NATURE, 2013, 503 (7476) :418-+
[7]   Structure-Based Redesign of the Binding Specificity of Anti-Apoptotic Bcl-xL [J].
Chen, T. Scott ;
Palacios, Hector ;
Keating, Amy E. .
JOURNAL OF MOLECULAR BIOLOGY, 2013, 425 (01) :171-185
[8]   Generation of longer emission wavelength red fluorescent proteins using computationally designed libraries [J].
Chica, Roberto A. ;
Moore, Matthew M. ;
Allen, Benjamin D. ;
Mayo, Stephen L. .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (47) :20257-20262
[9]   Optimizing nucleotide sequence ensembles for combinatorial protein libraries using a genetic algorithm [J].
Craig, Roger A. ;
Lu, Jin ;
Luo, Jinquan ;
Shi, Lei ;
Liao, Li .
NUCLEIC ACIDS RESEARCH, 2010, 38 (02) :e10.1-e10.9
[10]   Macromolecular modeling with Rosetta [J].
Das, Rhiju ;
Baker, David .
ANNUAL REVIEW OF BIOCHEMISTRY, 2008, 77 :363-382