Computer-aided multi-objective optimization in small molecule discovery

被引:64
作者
Fromer, Jenna C. [1 ]
Coley, Connor W. [1 ,2 ]
机构
[1] MIT, Dept Chem Engn, Cambridge, MA 02139 USA
[2] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
来源
PATTERNS | 2023年 / 4卷 / 02期
关键词
GENETIC ALGORITHM; DRUG DISCOVERY; DESIGN; GENERATION; IMPLEMENTATION; SOLUBILITY; LIBRARIES; QUALITY; QSAR;
D O I
10.1016/j.patter.2023.100678
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Molecular discovery is a multi-objective optimization problem that requires identifying a molecule or set of molecules that balance multiple, often competing, properties. Multi-objective molecular design is commonly addressed by combining properties of interest into a single objective function using scalarization, which im-poses assumptions about relative importance and uncovers little about the trade-offs between objectives. In contrast to scalarization, Pareto optimization does not require knowledge of relative importance and reveals the trade-offs between objectives. However, it introduces additional considerations in algorithm design. In this review, we describe pool-based and de novo generative approaches to multi-objective molecular discov-ery with a focus on Pareto optimization algorithms. We show how pool-based molecular discovery is a rela-tively direct extension of multi-objective Bayesian optimization and how the plethora of different generative models extend from single-objective to multi-objective optimization in similar ways using non-dominated sorting in the reward function (reinforcement learning) or to select molecules for retraining (distribution learning) or propagation (genetic algorithms). Finally, we discuss some remaining challenges and opportu-nities in the field, emphasizing the opportunity to adopt Bayesian optimization techniques into multi -objec-tive de novo design.
引用
收藏
页数:17
相关论文
共 156 条
[1]  
Abbasi M., 2021, 29 EUROPEAN S ARTIFI
[2]  
Abeer ANMN, 2024, Arxiv, DOI [arXiv:2203.00526, 10.48550/arXiv.2203.00526]
[3]   Discovery of Energy Storage Molecular Materials Using Quantum Chemistry-Guided Multiobjective Bayesian Optimization [J].
Agarwal, Garvit ;
Doan, Hieu A. ;
Robertson, Lily A. ;
Zhang, Lu ;
Assary, Rajeev S. .
CHEMISTRY OF MATERIALS, 2021, 33 (20) :8133-8144
[4]   Deep learning and knowledge-based methods for computer-aided molecular design-toward a unified approach: State-of-the-art and future directions [J].
Alshehri, Abdulelah S. ;
Gani, Rafiqul ;
You, Fengqi .
COMPUTERS & CHEMICAL ENGINEERING, 2020, 141
[5]  
[Anonymous], 6 INT C
[6]  
[Anonymous], COMPUT AIDED GEOM D, V17, P445
[7]   MolGPT: Molecular Generation Using a Transformer-Decoder Model [J].
Bagal, Viraj ;
Aggarwal, Rishal ;
Vinod, P. K. ;
Priyakumar, U. Deva .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2022, 62 (09) :2064-2076
[8]   Multi-Property Molecular Optimization using an Integrated Poly-Cycle Architecture [J].
Barshatski, Guy ;
Nordon, Galia ;
Radinsky, Kira .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, :3727-3736
[9]   Molecular similarity: a key technique in molecular informatics [J].
Bender, A ;
Glen, RC .
ORGANIC & BIOMOLECULAR CHEMISTRY, 2004, 2 (22) :3204-3218
[10]   Generative models for molecular discovery: Recent advances and challenges [J].
Bilodeau, Camille ;
Jin, Wengong ;
Jaakkola, Tommi ;
Barzilay, Regina ;
Jensen, Klavs F. .
WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL MOLECULAR SCIENCE, 2022, 12 (05)