Replicates in Biocatalysis Experiments: Machine Learning for Enzyme Cascade Optimization

被引:1
作者
Siedentop, Regine [1 ]
Siska, Maximilian [2 ]
Hermes, Johanna [1 ]
Luetz, Stephan [1 ]
von Lieres, Eric [2 ,3 ]
Rosenthal, Katrin [4 ]
机构
[1] TU Dortmund Univ, Dept Biochem & Chem Engn, Emil Figge Str 66, D-44227 Dortmund, Germany
[2] Forschungszentrum Julich, Inst Bioand Geosci, Wilhelm Johnen Str, D-52428 Julich, Germany
[3] Rhein Westfal TH Aachen, Computat Syst Biotechnol, Forckenbeckstr 51, D-52074 Aachen, Germany
[4] Constructor Univ, Sch Sci, Campus Ring 6, D-28759 Bremen, Germany
关键词
ATP regeneration; Bayesian optimization; Biocatalysis; Gaussian process regression; Machine learning; ATP REGENERATION; BAYESIAN OPTIMIZATION; POLYPHOSPHATE KINASE; GAUSSIAN-PROCESSES; DESIGN;
D O I
10.1002/cctc.202400777
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
The optimization of enzyme cascades is a complex and resource-demanding task due to the multitude of parameters and synergistic effects involved. Machine learning can support the identification of optimal reaction conditions, for example, in the case of Bayesian optimization (BO), by proposing new experiments based on Gaussian process regression (GPR) and expected improvement (EI). Here, in this research BO is used to optimize the concentrations of the reaction components of an enzyme cascade. The productivity-cost-ratio is chosen as the optimization objective in order to achieve the highest possible productivity, which was normalized to the costs of the materials used to prevent convergence to ever-increasing enzyme concentrations. To reduce the experimental effort, contrary to common practice in biological experiments, replicates were not used; instead, the algorithm's proposed experiments and inherent uncertainty quantification were relied upon. This approach balances parameter space exploration and exploitation, which is critical for the efficient and effective identification of optimal reaction conditions. At the optimized reaction conditions identified in this study, the productivity-cost ratio is doubled to 38.6 mmol L-1 h-1 <euro>-1 compared to a reference experiment. The parameter optimization required only 52 experiments while being robust to outlying experimental results. An in vitro enzyme cascade for mevalonate phosphorylation and ATP regeneration was optimized for productivity and productivity-cost-ratio using Bayesian optimization (BO) for the proposal of new experiments with an expected improvement. In four BO rounds, the optimal compound concentrations were found to reach an optimum, while avoiding replicated experiments and being robust to outlying experimental results. Institute and/or researcher Twitter usernames: @Luetz_Lab: @RosenthalKatrin; @ericvonlieres. image
引用
收藏
页数:13
相关论文
共 68 条
  • [1] PyMC: a modern, and comprehensive probabilistic programming framework in Python']Python
    Abril-Pla, Oriol
    Andreani, Virgile
    Carroll, Colin
    Dong, Larry
    Fonnesbeck, Christopher J.
    Kochurov, Maxim
    Kumar, Ravin
    Lao, Junpeng
    Luhmann, Christian C.
    Martin, Osvaldo A.
    Osthege, Michael
    Vieira, Ricardo
    Wiecki, Thomas
    Zinkov, Robert
    [J]. PEERJ COMPUTER SCIENCE, 2023, 9
  • [2] Alim GS, 2023, ADV BIOCHEM ENG BIOT, V186, P29, DOI 10.1007/10_2023_222
  • [3] Emerging Enzymes for ATP Regeneration in Biocatalytic Processes
    Andexer, Jennifer N.
    Richter, Michael
    [J]. CHEMBIOCHEM, 2015, 16 (03) : 380 - 386
  • [4] [Anonymous], COLORMAP REFERENCE M
  • [5] Directed Evolution: Bringing New Chemistry to Life
    Arnold, Frances H.
    [J]. ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2018, 57 (16) : 4143 - 4148
  • [6] Multistep enzyme cascades as a route towards green and sustainable pharmaceutical syntheses
    Benitez-Mateos, Ana, I
    Padrosa, David Roura
    Paradisi, Francesca
    [J]. NATURE CHEMISTRY, 2022, 14 (05) : 489 - 499
  • [7] Dynamic modeling and optimization of sustainable algal production with uncertainty using multivariate Gaussian processes
    Bradford, Eric
    Schweidtmann, Artur M.
    Zhang, Dongda
    Jing, Keju
    del Rio-Chanona, Ehecatl Antonio
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 2018, 118 : 143 - 158
  • [8] Efficient multiobjective optimization employing Gaussian processes, spectral sampling and a genetic algorithm
    Bradford, Eric
    Schweidtmann, Artur M.
    Lapkin, Alexei
    [J]. JOURNAL OF GLOBAL OPTIMIZATION, 2018, 71 (02) : 407 - 438
  • [9] Greed Is Good: Exploration and Exploitation Trade-offs in Bayesian Optimisation
    De Ath G.
    Everson R.M.
    Rahat A.A.M.
    Fieldsend J.E.
    [J]. ACM Transactions on Evolutionary Learning and Optimization, 2021, 1 (01):
  • [10] Active Learning and Bayesian Optimization: A Unified Perspective to Learn with a Goal
    Di Fiore, Francesco
    Nardelli, Michela
    Mainini, Laura
    [J]. ARCHIVES OF COMPUTATIONAL METHODS IN ENGINEERING, 2024, 31 (05) : 2985 - 3013