The Evolution of Data-Driven Modeling in Organic Chemistry

被引:92
作者
Williams, Wendy L. [1 ,2 ]
Zeng, Lingyu [3 ]
Gensch, Tobias [4 ]
Sigman, Matthew S. [5 ]
Doyle, Abigail G. [1 ,2 ]
Anslyn, Eric, V [3 ]
机构
[1] Univ Calif Los Angeles, Dept Chem & Biochem, Los Angeles, CA 90095 USA
[2] Princeton Univ, Dept Chem, Princeton, NJ 08544 USA
[3] Univ Texas Austin, Dept Chem, Austin, TX 78712 USA
[4] TU Berlin, Dept Chem, D-10623 Berlin, Germany
[5] Univ Utah, Dept Chem, Salt Lake City, UT 84112 USA
关键词
SUPPORT VECTOR MACHINES; CHEMICAL PROBLEMS; PATTERN-RECOGNITION; ARTIFICIAL-INTELLIGENCE; NEURAL-NETWORKS; CHEMOMETRICS; CLASSIFICATION; PREDICTION; DESIGN; CHEMOINFORMATICS;
D O I
10.1021/acscentsci.1c00535
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Organic chemistry is replete with complex relationships: for example, how a reactant's structure relates to the resulting product formed; how reaction conditions relate to yield; how a catalyst's structure relates to enantioselectivity. Questions like these are at the foundation of understanding reactivity and developing novel and improved reactions. An approach to probing these questions that is both longstanding and contemporary is data-driven modeling. Here, we provide a synopsis of the history of data-driven modeling in organic chemistry and the terms used to describe these endeavors. We include a timeline of the steps that led to its current state. The case studies included highlight how, as a community, we have advanced physical organic chemistry tools with the aid of computers and data to augment the intuition of expert chemists and to facilitate the prediction of structure-activity and structure-property relationships.
引用
收藏
页码:1622 / 1637
页数:16
相关论文
共 141 条
[1]   Predicting reaction performance in C-N cross-coupling using machine learning [J].
Ahneman, Derek T. ;
Estrada, Jesus G. ;
Lin, Shishi ;
Dreher, Spencer D. ;
Doyle, Abigail G. .
SCIENCE, 2018, 360 (6385) :186-190
[2]   A comparative study of K-Nearest Neighbour, Support Vector Machine and Multi-Layer Perceptron for Thalassemia screening [J].
Amendolia, SR ;
Cossu, G ;
Ganadu, ML ;
Golosio, B ;
Masala, GL ;
Mura, GM .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2003, 69 (1-2) :13-20
[3]  
[Anonymous], 2007, PROG PHYS ORG CHEM
[4]  
Anslyn E.V., 2005, Modern Physical Organic Chemistry
[5]   AUTOMATIC CLASSIFICATION OF TWO-DIMENSIONAL GEL-ELECTROPHORESIS PICTURES BY HEURISTIC CLUSTERING ANALYSIS - A STEP TOWARD MACHINE LEARNING [J].
APPEL, R ;
HOCHSTRASSER, D ;
ROCH, C ;
FUNK, M ;
MULLER, AF ;
PELLEGRINI, C .
ELECTROPHORESIS, 1988, 9 (03) :136-142
[6]  
Bajorath J., 2004, Chemoinformatics: Concepts, Methods, and Tools for Drug Discovery
[7]   COMPUTERS AND ORGANIC SYNTHESIS [J].
BERSOHN, M ;
ESACK, A .
CHEMICAL REVIEWS, 1976, 76 (02) :269-282
[8]   Chemometric Methods for Spectroscopy-Based Pharmaceutical Analysis [J].
Biancolillo, Alessandra ;
Marini, Federico .
FRONTIERS IN CHEMISTRY, 2018, 6
[9]  
Bishop C.M., 2006, Pattern Recognition and Machine Learning
[10]  
Bongard M.M., 1970, PATTERN RECOGN