Computer-Aided Synthesis Planning (CASP) and Machine Learning: Optimizing Chemical Reaction Conditions

被引:5
作者
Han, Yu [1 ]
Deng, Mingjing [1 ]
Liu, Ke [1 ]
Chen, Jia [1 ]
Wang, Yuting [1 ]
Xu, Yu-Ning [1 ]
Dian, Longyang [1 ,2 ]
机构
[1] Shandong Univ, Inst Microbial Technol, State Key Lab Microbial Technol, 72 Binhai Ave, Qingdao 266237, Peoples R China
[2] Shandong Univ, Suzhou Inst, 388 Ruoshui Rd,Suzhou Ind Pk, Suzhou 215123, Peoples R China
基金
中国国家自然科学基金;
关键词
CASP; Machine learning; Chemical reaction condition; High-throughput screening; YIELD PREDICTION; RANDOM FOREST; CHEMISTRY; CLASSIFICATION; SYSTEM; TOOL;
D O I
10.1002/chem.202401626
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Computer-aided synthesis planning (CASP) has garnered increasing attention in light of recent advancements in machine learning models. While the focus is on reverse synthesis or forward outcome prediction, optimizing reaction conditions remains a significant challenge. For datasets with multiple variables, the choice of descriptors and models is pivotal. This selection dictates the effective extraction of conditional features and the achievement of higher prediction accuracy. This review delineates the origins of data in conditional optimization, the criteria for descriptor selection, the response models, and the metrics for outcome evaluation, aiming to acquaint readers with the latest research trends and facilitate more informed research in this domain. This paper reviews the topic of the Computer Aided Synthesis Project (CASP) and gives a detailed overview of the methods of machine learning to predict chemical reaction conditions, including the problem of data and descriptor selection, the indicators for the evaluation of results and the current research progress of various algorithms. The future research trends and possible challenges in chemical condition prediction are prospected. image
引用
收藏
页数:16
相关论文
共 111 条
[1]   Machine learning and molecular descriptors enable rational solvent selection in asymmetric catalysis [J].
Amar, Yehia ;
Schweidtmann, ArturM. ;
Deutsch, Paul ;
Cao, Liwei ;
Lapkin, Alexei .
CHEMICAL SCIENCE, 2019, 10 (27) :6697-6706
[2]   Reagent prediction with a molecular transformer improves reaction data quality [J].
Andronov, Mikhail ;
Voinarovska, Varvara ;
Andronova, Natalia ;
Wand, Michael ;
Clevert, Djork-Arne ;
Schmidhuber, Jurgen .
CHEMICAL SCIENCE, 2023, 14 (12) :3235-3246
[3]  
[Anonymous], 2018, CAS SciFinder
[4]   Why is Tanimoto index an appropriate choice for fingerprint-based similarity calculations? [J].
Bajusz, David ;
Racz, Anita ;
Heberger, Kroly .
JOURNAL OF CHEMINFORMATICS, 2015, 7
[5]  
Baldi P., 2021, J. Chem. Inf. Model, V62, P2011
[6]  
BAYES T, 1958, BIOMETRIKA, V45, P296
[7]   Machine Learning May Sometimes Simply Capture LiteraturePopularity Trends: A Case Study of Heterocyclic Suzuki-MiyauraCoupling [J].
Beker, Wiktor ;
Roszak, Rafal ;
Wolos, Agnieszka ;
Angello, Nicholas H. ;
Rathore, Vandana ;
Burke, Martin D. ;
Grzybowski, Bartosz A. .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2022, 144 (11) :4819-4827
[8]   Bayesian optimization as a valuable tool for sustainable chemical reaction development [J].
Braconi, Elena .
NATURE REVIEWS METHODS PRIMERS, 2023, 3 (01)
[9]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[10]   MetaRF: attention-based random forest for reaction yield prediction with a few trails [J].
Chen, Kexin ;
Chen, Guangyong ;
Li, Junyou ;
Huang, Yuansheng ;
Wang, Ercheng ;
Hou, Tingjun ;
Heng, Pheng-Ann .
JOURNAL OF CHEMINFORMATICS, 2023, 15 (01)