Gradient Boosted Machine Learning Model to Predict H2, CH4, and CO2 Uptake in Metal-Organic Frameworks Using Experimental Data

被引:15
作者
Bailey, Tom [1 ]
Jackson, Adam [1 ]
Berbece, Razvan-Antonio [1 ]
Wu, Kejun [1 ,2 ]
Hondow, Nicole [1 ]
Martin, Elaine [1 ]
机构
[1] Univ Leeds, Sch Chem & Proc Engn, Leeds LS2 9JT, England
[2] Zhejiang Univ, Coll Chem & Biol Engn, Zhejiang Prov Key Lab Adv Chem Engn Manufacture Te, Hangzhou 310027, Peoples R China
基金
英国工程与自然科学研究理事会;
关键词
GAS-STORAGE;
D O I
10.1021/acs.jcim.3c00135
中图分类号
R914 [药物化学];
学科分类号
100701 ;
摘要
Predictive screening of metal-organic framework(MOF) materialsfor their gas uptake properties has been previously limited by usingdata from a range of simulated sources, meaning the final predictionsare dependent on the performance of these original models. In thiswork, experimental gas uptake data has been used to create a GradientBoosted Tree model for the prediction of H-2, CH4, and CO2 uptake over a range of temperatures and pressuresin MOF materials. The descriptors used in this database were obtainedfrom the literature, with no computational modeling needed. This modelwas repeated 10 times, showing an average R (2) of 0.86 and a mean absolute error (MAE) of & PLUSMN;2.88 wt % acrossthe runs. This model will provide gas uptake predictions for a rangeof gases, temperatures, and pressures as a one-stop solution, withthe data provided being based on previous experimental observationsin the literature, rather than simulations, which may differ fromtheir real-world results. The objective of this work is to createa machine learning model for the inference of gas uptake in MOFs.The basis of model development is experimental as opposed to simulateddata to realize its applications by practitioners. The real-worldnature of this research materializes in a focus on the applicationof algorithms as opposed to the detailed assessment of the algorithms.
引用
收藏
页码:4545 / 4551
页数:7
相关论文
共 21 条
[1]   Chemically intuited, large-scale screening of MOFs by machine learning techniques [J].
Borboudakis, Giorgos ;
Stergiannakos, Taxiarchis ;
Frysali, Maria ;
Klontzas, Emmanuel ;
Tsamardinos, Ioannis ;
Froudakis, George E. .
NPJ COMPUTATIONAL MATERIALS, 2017, 3
[2]   Tuning porosity in macroscopic monolithic metal-organic frameworks for exceptional natural gas storage [J].
Connolly, B. M. ;
Aragones-Anglada, M. ;
Gandara-Loe, J. ;
Danaf, N. A. ;
Lamb, D. C. ;
Mehta, J. P. ;
Vulpe, D. ;
Wuttke, S. ;
Silvestre-Albero, J. ;
Moghadam, P. Z. ;
Wheatley, A. E. H. ;
Fairen-Jimenez, D. .
NATURE COMMUNICATIONS, 2019, 10 (1)
[3]   Industrial applications of metal-organic frameworks [J].
Czaja, Alexander U. ;
Trukhan, Natalia ;
Mueller, Ulrich .
CHEMICAL SOCIETY REVIEWS, 2009, 38 (05) :1284-1293
[4]  
Dmitrienko A., 2007, PHARM STAT USING SAS
[5]   A working guide to boosted regression trees [J].
Elith, J. ;
Leathwick, J. R. ;
Hastie, T. .
JOURNAL OF ANIMAL ECOLOGY, 2008, 77 (04) :802-813
[6]   A Universal Machine Learning Algorithm for Large-Scale Screening of Materials [J].
Fanourgakis, George S. ;
Gkagkas, Konstantinos ;
Tylianakis, Emmanuel ;
Froudakis, George E. .
JOURNAL OF THE AMERICAN CHEMICAL SOCIETY, 2020, 142 (08) :3814-3822
[7]   Advancesn in CO2 capture technology -: The US Department of Energy's Carbon Sequestration Program [J].
Figueroa, Jose D. ;
Fout, Timothy ;
Plasynski, Sean ;
McIlvried, Howard ;
Srivastava, Rameshwar D. .
INTERNATIONAL JOURNAL OF GREENHOUSE GAS CONTROL, 2008, 2 (01) :9-20
[8]   Greedy function approximation: A gradient boosting machine [J].
Friedman, JH .
ANNALS OF STATISTICS, 2001, 29 (05) :1189-1232
[9]   Variable influence on projection (VIP) for OPLS models and its applicability in multivariate time series analysis [J].
Galindo-Prieto, Beatriz ;
Eriksson, Lennart ;
Trygg, Johan .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2015, 146 :297-304
[10]  
Leps J., 2020, BIOSTAT R, P219, DOI [10.1017/9781108616041.015, DOI 10.1017/9781108616041.015]