Machine Learning: A Suitable Method for Biocatalysis

被引:15
作者
Sampaio, Pedro Sousa [1 ]
Fernandes, Pedro [2 ,3 ,4 ,5 ]
机构
[1] Lusofona Univ, Computacao & Cognicao Centrada Pessoas, Campo Grande 376, P-1749024 Lisbon, Portugal
[2] Lusofona Univ ULHT, BioRG Biomed Res Grp, Campo Grande 376, P-1749024 Lisbon, Portugal
[3] Lusofona Univ ULHT, Fac Engn, Campo Grande 376, P-1749024 Lisbon, Portugal
[4] Univ Lisbon, Inst Super Tecn, iBB Inst Bioengn & Biosci, Av Rovisco Pais, P-1049001 Lisbon, Portugal
[5] Univ Lisbon, Inst Super Tecn, i4HB Inst Hlth & Bioecon, Associate Lab, Av Rovisco Pais, P-1049001 Lisbon, Portugal
关键词
machine learning; biocatalysis; engineered enzymes; enzyme formulations; NEURAL-NETWORK; ARTIFICIAL-INTELLIGENCE; ENZYME IMMOBILIZATION; BIOSYNTHETIC-ENZYMES; GAUSSIAN PROCESS; PREDICTION; DESIGN; PROTEINS; SEQUENCE; OPTIMIZATION;
D O I
10.3390/catal13060961
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
Biocatalysis is currently a workhorse used to produce a wide array of compounds, from bulk to fine chemicals, in a green and sustainable manner. The success of biocatalysis is largely thanks to an enlargement of the feasible chemical reaction toolbox. This materialized due to major advances in enzyme screening tools and methods, together with high-throughput laboratory techniques for biocatalyst optimization through enzyme engineering. Therefore, enzyme-related knowledge has significantly increased. To handle the large number of data now available, computational approaches have been gaining relevance in biocatalysis, among them machine learning methods (MLMs). MLMs use data and algorithms to learn and improve from experience automatically. This review intends to briefly highlight the contribution of biocatalysis within biochemical engineering and bioprocesses and to present the key aspects of MLMs currently used within the scope of biocatalysis and related fields, mostly with readers non-skilled in MLMs in mind. Accordingly, a brief overview and the basic concepts underlying MLMs are presented. This is complemented with the basic steps to build a machine learning model and followed by insights into the types of algorithms used to intelligently analyse data, identify patterns and develop realistic applications in biochemical engineering and bioprocesses. Notwithstanding, and given the scope of this review, some recent illustrative examples of MLMs in protein engineering, enzyme production, biocatalyst formulation and enzyme screening are provided, and future developments are suggested. Overall, it is envisaged that the present review will provide insights into MLMs and how these are major assets for more efficient biocatalysis.
引用
收藏
页数:26
相关论文
共 255 条
[1]   PES-Learn: An Open-Source Software Package for the Automated Generation of Machine Learning Models of Molecular Potential Energy Surfaces [J].
Abbott, Adam S. ;
Turney, Justin M. ;
Zhang, Boyi ;
Smith, Daniel G. A. ;
Altarawy, Doaa ;
Schaefer, Henry F., III .
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2019, 15 (08) :4386-4398
[2]  
Abraham G K., Computational Intelligence and Its Applications in Healthcare, V2020, P145, DOI [DOI 10.1016/B978-0-12-820604-1.00010-8, 10.1016/B978-0-12-820604-1.00010-8]
[3]   Protein secondary structure prediction (PSSP) using different machine algorithms [J].
Afify, Heba M. ;
Abdelhalim, Mohamed B. ;
Mabrouk, Mai S. ;
Sayed, Ahmed Y. .
EGYPTIAN JOURNAL OF MEDICAL HUMAN GENETICS, 2021, 22 (01)
[4]   Basic concepts of artificial neural network (ANN) modeling and its application in pharmaceutical research [J].
Agatonovic-Kustrin, S ;
Beresford, R .
JOURNAL OF PHARMACEUTICAL AND BIOMEDICAL ANALYSIS, 2000, 22 (05) :717-727
[5]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[6]   Unified rational protein engineering with sequence-based deep representation learning [J].
Alley, Ethan C. ;
Khimulya, Grigory ;
Biswas, Surojit ;
AlQuraishi, Mohammed ;
Church, George M. .
NATURE METHODS, 2019, 16 (12) :1315-+
[7]   Enzyme immobilization: what have we learned in the past five years? [J].
Almeida, Francisco L. C. ;
Prata, Ana S. ;
Forte, Marcus B. S. .
BIOFUELS BIOPRODUCTS & BIOREFINING-BIOFPR, 2022, 16 (02) :587-608
[8]   Kinetic models in industrial biotechnology - Improving cell factory performance [J].
Almquist, Joachim ;
Cvijovic, Marija ;
Hatzimanikatis, Vassily ;
Nielsen, Jens ;
Jirstrand, Mats .
METABOLIC ENGINEERING, 2014, 24 :38-60
[9]   Genetically engineered proteins with two active sites for enhanced biocatalysis and synergistic chemo- and biocatalysis [J].
Alonso, Sandra ;
Santiago, Gerard ;
Cea-Rama, Isabel ;
Fernandez-Lopez, Laura ;
Coscolin, Cristina ;
Modregger, Jan ;
Ressmann, Anna K. ;
Martinez-Martinez, Monica ;
Marrero, Helena ;
Bargiela, Rafael ;
Pita, Marcos ;
Gonzalez-Alfonso, Jose L. ;
Briand, Manon L. ;
Rojo, David ;
Barbas, Coral ;
Plou, Francisco J. ;
Golyshin, Peter N. ;
Shahgaldian, Patrick ;
Sanz-Aparicio, Julia ;
Guallar, Victor ;
Ferrer, Manuel .
NATURE CATALYSIS, 2020, 3 (03) :319-328
[10]   Omics-Driven Biotechnology for Industrial Applications [J].
Amer, Bashar ;
Baidoo, Edward E. K. .
FRONTIERS IN BIOENGINEERING AND BIOTECHNOLOGY, 2021, 9