Machine learning-based yield prediction for transition metal-catalyzed cross-coupling reactions

被引:1
作者
Rajalakshmi, C. [1 ,2 ]
Vijay, Vivek [1 ]
Vijayakumar, Abhirami [1 ]
Salim, Shajila [1 ]
Cherian, Sherin Susan [1 ]
Santhoshkumar, Parvathi [1 ]
Kottooran, John B. [1 ]
Abraham, Ann Miriam [1 ]
Krishnaveni, G. [1 ]
Anjanakutty, C. S. [1 ]
Varghese, Binuja [1 ]
Thomas, Vibin Ipe [1 ,2 ]
机构
[1] Mahatma Gandhi Univ, CMS Coll Kottayam Autonomous, Dept Chem, Kottayam 686001, Kerala, India
[2] CMS Coll Kottayam Autonomous, ECG Sudarshan Ctr Theoret Sci, Kottayam 686001, Kerala, India
关键词
Machine learning; Cross-coupling reactions; Predictive modeling; Transition metal catalysts; Regression; Classification; TERMINAL ALKYNES; ARYL IODIDES; PROTOCOL; HALIDES;
D O I
10.1007/s00214-024-03159-0
中图分类号
O64 [物理化学(理论化学)、化学物理学];
学科分类号
070304 ; 081704 ;
摘要
The advent of transition metal-catalyzed cross-coupling reactions has marked a significant milestone in the field of organic chemistry, primarily due to their pivotal role in facilitating the construction of carbon-carbon and carbon-heteroatom bonds. Traditionally, yield determination in cross-coupling reactions has predominantly relied on experimental methods. However, recent advancements in machine learning (ML) algorithms have revolutionized yield prediction through the use of predictive models. While the prior studies have primarily concentrated on homogeneous datasets of cross-coupling reactions, the accurate prediction of yields for heterogeneous datasets possess a formidable challenge. To address this issue, this study aims to develop machine learning models for yield prediction by curating extendable, open-access heterogeneous datasets of transition metal-catalyzed cross-coupling reactions. In our study, we employed both regression and classification models, leveraging various featurization methods. Among them, the DRFP featurized random forest model is found to have better predictive performance obtaining an R2 value of 0.79 over the neural network and KNN models. By identifying suitable machine learning models for yield prediction, this study contributes to the development of predictive models for sustainable transition metal catalysis.
引用
收藏
页数:9
相关论文
共 42 条
[1]   Predicting reaction performance in C-N cross-coupling using machine learning [J].
Ahneman, Derek T. ;
Estrada, Jesus G. ;
Lin, Shishi ;
Dreher, Spencer D. ;
Doyle, Abigail G. .
SCIENCE, 2018, 360 (6385) :186-190
[2]   A Novel Protocol for the Cu-Catalyzed Sonogashira Coupling Reaction between Aryl Halides and Terminal Alkynes using trans-1,2-Diaminocyclohexane Ligand [J].
Asha, Sujatha ;
Thomas, Anns Maria ;
Ujwaldev, Sankuviruthiyil M. ;
Anilkumar, Gopinathan .
CHEMISTRYSELECT, 2016, 1 (13) :3938-3941
[3]   Recent advances in transition metal-catalysed cross-coupling of (hetero)aryl halides and analogues under ligand-free conditions [J].
Ayogu, Jude I. ;
Onoabedje, Efeturi A. .
CATALYSIS SCIENCE & TECHNOLOGY, 2019, 9 (19) :5233-5255
[4]   Random forests [J].
Breiman, L .
MACHINE LEARNING, 2001, 45 (01) :5-32
[5]   Combinatorial explosion in homogeneous catalysis: Screening 60,000 cross-coupling reactions [J].
Burello, E ;
Farrusseng, D ;
Rothenberg, G .
ADVANCED SYNTHESIS & CATALYSIS, 2004, 346 (13-15) :1844-1853
[6]   Comment on "Predicting reaction performance in C-N cross-coupling using machine learning" [J].
Chuang, Kangway V. ;
Keiser, Michael J. .
SCIENCE, 2018, 362 (6416)
[7]   Data-Driven Advancement of Homogeneous Nickel Catalyst Activity for Aryl Ether Cleavage [J].
Cordova, Manuel ;
Wodrich, Matthew D. ;
Meyer, Benjamin ;
Sawatlon, Boodsarin ;
Corminboeuf, Clemence .
ACS CATALYSIS, 2020, 10 (13) :7021-7031
[8]  
Das M, 2022, J Chem Phys
[9]   Palladium-Catalyzed Cross-Coupling Reactions: A Powerful Tool for the Synthesis of Agrochemicals [J].
Devendar, Ponnam ;
Qu, Ren-Yu ;
Kang, Wei-Ming ;
He, Bo ;
Yang, Guang-Fu .
JOURNAL OF AGRICULTURAL AND FOOD CHEMISTRY, 2018, 66 (34) :8914-8934
[10]  
Han FS, 2013, CHEM SOC REV, V42, P5270, DOI 10.1039/c3cs35521g