Support vector machines for predictive modeling in heterogeneous catalysis: A comprehensive introduction and overfitting investigation based on two real applications

被引:72
作者
Baumes, L. A. [1 ]
Serra, J. M. [1 ]
Serna, P. [1 ]
Corma, A. [1 ]
机构
[1] Univ Politecn Valencia, CSIC, Inst Tecnol Quim, Valencia 46022, Spain
来源
JOURNAL OF COMBINATORIAL CHEMISTRY | 2006年 / 8卷 / 04期
关键词
ARTIFICIAL NEURAL-NETWORKS; OXIDATIVE DEHYDROGENATION; METHANOL SYNTHESIS; OPTIMIZATION; DESIGN; LIBRARIES; DISCOVERY; ETHYLENE; PROPANE; SEARCH;
D O I
10.1021/cc050093m
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
This works provides an introduction to support vector machines (SVMs) for predictive modeling in heterogeneous catalysis, describing step by step the methodology with a highlighting of the points which make such technique an attractive approach. We first investigate linear SVMs, working in detail through a simple example based on experimental data derived from a study aiming at optimizing olefin epoxidation catalysts applying high-throughput experimentation. This case study has been chosen to underline SVM features in a visual manner because of the few catalytic variables investigated. It is shown how SVMs transform original data into another representation space of higher dimensionality. The concepts of Vapnik-Chervonenkis dimension and structural risk minimization are introduced. The SVM methodology is evaluated with a second catalytic application, that is, light paraffin isomerization. Finally, we discuss why SVMs is a strategic method, as compared to other machine learning techniques, such as neural networks or induction trees, and why emphasis is put on the problem of overfitting.
引用
收藏
页码:583 / 596
页数:14
相关论文
共 73 条
[31]   Note on free lunches and cross-validation [J].
Goutte, C .
NEURAL COMPUTATION, 1997, 9 (06) :1245-1249
[32]   Fundamental insights into the oxidative dehydrogenation of ethane to ethylene over catalytic materials discovered by an evolutionary approach [J].
Grubert, G ;
Kondratenko, E ;
Kolf, S ;
Baerns, M ;
van Geem, P ;
Parton, R .
CATALYSIS TODAY, 2003, 81 (03) :337-345
[33]   Applications of combinatorial methods in catalysis [J].
Hagemeyer, A ;
Jandeleit, B ;
Liu, YM ;
Poojary, DM ;
Turner, HW ;
Volpe, AF ;
Weinberg, WH .
APPLIED CATALYSIS A-GENERAL, 2001, 221 (1-2) :23-43
[34]  
Hjorth JSU., 1994, COMPUTER INTENSIVE S
[35]   Feedforward neural networks in catalysis - A tool for the approximation of the dependency of yield on catalyst composition, and for knowledge extraction [J].
Holena, M ;
Baerns, M .
CATALYSIS TODAY, 2003, 81 (03) :485-494
[36]   Improving process operations using support vector machines and decision trees [J].
Jemwa, GT ;
Aldrich, C .
AICHE JOURNAL, 2005, 51 (02) :526-543
[37]  
Joachims T, 2002, Learning to classify text using support vector machines, P35
[38]   The development of descriptors for solids:: Teaching "catalytic intuition" to a computer [J].
Klanner, C ;
Farrusseng, D ;
Baumes, L ;
Lengliz, M ;
Mirodatos, C ;
Schüth, F .
ANGEWANDTE CHEMIE-INTERNATIONAL EDITION, 2004, 43 (40) :5347-5349
[39]   How to design diverse libraries of solid catalysts? [J].
Klanner, C ;
Farrusseng, D ;
Baumes, L ;
Mirodatos, C ;
Schüth, F .
QSAR & COMBINATORIAL SCIENCE, 2003, 22 (07) :729-736
[40]   Prediction of electrophoretic mobility of substituted aromatic acids in different aqueous-alcoholic solvents by capillary zone electrophoresis based on support vector machine [J].
Liu, HM ;
Zhang, RS ;
Yao, XJ ;
Liu, MC ;
Hu, ZD ;
Fan, BT .
ANALYTICA CHIMICA ACTA, 2004, 525 (01) :31-41