Machine Learning for Fluid Property Correlations: Classroom Examples with MATLAB

被引:40
作者
Joss, Lisa [1 ]
Mueller, Erich A. [1 ]
机构
[1] Imperial Coll London, Dept Chem Engn, London SW7 2AZ, England
关键词
First-Year Undergraduate/General; Second-Year Undergraduate; Chemical Engineering; Computer-Based Learning; Mathematics/Symbolic Mathematics; Molecular Properties/Structure; Physical Properties; NEURAL-NETWORK; PREDICTION;
D O I
10.1021/acs.jchemed.8b00692
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Recent advances in computer hardware and algorithms are spawning an explosive growth in the use of computer-based systems aimed at analyzing and ultimately correlating large amounts of experimental and synthetic data. As these machine learning tools become more widespread, it is becoming imperative that scientists and researchers become familiar with them, both in terms of understanding the tools and the current limitations of artificial intelligence, and more importantly being able to critically separate the hype from the real potential. This article presents a classroom exercise aimed at first-year science and engineering college students, where a task is set to produce a correlation to predict the normal boiling point of organic compounds from an unabridged data set of >6000 compounds. The exercise, which is fully documented in terms of the problem statement and the solution, guides the students to initially perform a linear correlation of the boiling point data with a plausible relevant variable (the molecular weight) and to further refine it using multivariate linear fitting employing a second descriptor (the acentric factor). Finally, the data are processed through an artificial neural network to eventually provide an engineering-quality correlation. The problem statements, data files for the development of the exercise, and solutions are provided within a MATLAB environment but are general in nature.
引用
收藏
页码:697 / 703
页数:7
相关论文
共 32 条
[21]   Genetic programming (GP) approach for prediction of supercritical CO2 thermal conductivity [J].
Rostami, Alireza ;
Arabloo, Milad ;
Ebadi, Hojatollah .
CHEMICAL ENGINEERING RESEARCH & DESIGN, 2017, 122 :164-175
[22]   Machine Learning [J].
Schneider, William F. ;
Guo, Hua .
JOURNAL OF PHYSICAL CHEMISTRY B, 2018, 122 (04) :1347-1347
[23]   ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost [J].
Smith, J. S. ;
Isayev, O. ;
Roitberg, A. E. .
CHEMICAL SCIENCE, 2017, 8 (04) :3192-3203
[24]   Machine learning for crystal identification and discovery [J].
Spellings, Matthew ;
Glotzer, Sharon C. .
AICHE JOURNAL, 2018, 64 (06) :2198-2206
[25]   OPENING UP THE BLACK-BOX OF ARTIFICIAL NEURAL NETWORKS [J].
SPINING, MT ;
DARSEY, JA ;
SUMPTER, BG ;
NOID, DW .
JOURNAL OF CHEMICAL EDUCATION, 1994, 71 (05) :406-411
[26]   ESTIMATION OF NORMAL BOILING POINTS FROM GROUP CONTRIBUTIONS [J].
STEIN, SE ;
BROWN, RL .
JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 1994, 34 (03) :581-587
[27]   Computational Modeling of β-Secretase 1 (BACE-1) Inhibitors Using Ligand Based Approaches [J].
Subramanian, Govindan ;
Ramsundar, Bharath ;
Pande, Vijay ;
Denny, Rajiah Aldrin .
JOURNAL OF CHEMICAL INFORMATION AND MODELING, 2016, 56 (10) :1936-1949
[28]   THEORY AND APPLICATIONS OF NEURAL COMPUTING IN CHEMICAL SCIENCE [J].
SUMPTER, BG ;
GETINO, C ;
NOID, DW .
ANNUAL REVIEW OF PHYSICAL CHEMISTRY, 1994, 45 :439-481
[29]   PROCESS FAULT-DETECTION AND DIAGNOSIS USING NEURAL NETWORKS .1. STEADY-STATE PROCESSES [J].
VENKATASUBRAMANIAN, V ;
VAIDYANATHAN, R ;
YAMAMOTO, Y .
COMPUTERS & CHEMICAL ENGINEERING, 1990, 14 (07) :699-712
[30]   Nonlinear machine learning in simulations of soft and biological materials [J].
Wang, J. ;
Ferguson, A. L. .
MOLECULAR SIMULATION, 2018, 44 (13-14) :1090-1107