Support vector machines for predictive modeling in heterogeneous catalysis: A comprehensive introduction and overfitting investigation based on two real applications

被引:72
作者
Baumes, L. A. [1 ]
Serra, J. M. [1 ]
Serna, P. [1 ]
Corma, A. [1 ]
机构
[1] Univ Politecn Valencia, CSIC, Inst Tecnol Quim, Valencia 46022, Spain
来源
JOURNAL OF COMBINATORIAL CHEMISTRY | 2006年 / 8卷 / 04期
关键词
ARTIFICIAL NEURAL-NETWORKS; OXIDATIVE DEHYDROGENATION; METHANOL SYNTHESIS; OPTIMIZATION; DESIGN; LIBRARIES; DISCOVERY; ETHYLENE; PROPANE; SEARCH;
D O I
10.1021/cc050093m
中图分类号
O69 [应用化学];
学科分类号
081704 ;
摘要
This works provides an introduction to support vector machines (SVMs) for predictive modeling in heterogeneous catalysis, describing step by step the methodology with a highlighting of the points which make such technique an attractive approach. We first investigate linear SVMs, working in detail through a simple example based on experimental data derived from a study aiming at optimizing olefin epoxidation catalysts applying high-throughput experimentation. This case study has been chosen to underline SVM features in a visual manner because of the few catalytic variables investigated. It is shown how SVMs transform original data into another representation space of higher dimensionality. The concepts of Vapnik-Chervonenkis dimension and structural risk minimization are introduced. The SVM methodology is evaluated with a second catalytic application, that is, light paraffin isomerization. Finally, we discuss why SVMs is a strategic method, as compared to other machine learning techniques, such as neural networks or induction trees, and why emphasis is put on the problem of overfitting.
引用
收藏
页码:583 / 596
页数:14
相关论文
共 73 条
[1]  
[Anonymous], P INT C COMP SCI INF
[2]  
[Anonymous], 1999, REPOSIT TU DORTMUND, DOI DOI 10.17877/DE290R-5098
[3]  
[Anonymous], 1991, Computer systems that learn classification and prediction methods from statistics, neural nets, machine learning and expert systems
[4]  
[Anonymous], 1987, Practical methods of optimization: Unconstrained Optimization
[5]  
Bauman Zygmunt., 2001, J CONSUM CULT, V1, P9, DOI [DOI 10.1177/146954050100100102, 10.1177/146954050100100102]
[6]   Using Artificial Neural Networks to boost high-throughput discovery in heterogeneous catalysis [J].
Baumes, L ;
Farrusseng, D ;
Lengliz, M ;
Mirodatos, C .
QSAR & COMBINATORIAL SCIENCE, 2004, 23 (09) :767-778
[7]   MAP: An iterative experimental design methodology for the optimization of catalytic search space structure modeling [J].
Baumes, LA .
JOURNAL OF COMBINATORIAL CHEMISTRY, 2006, 8 (03) :304-314
[8]  
BAUMES LA, 2005, INT C EUR CAT 7
[9]  
BAUMES LA, 2003, LECT NOTES AI LNCS L
[10]  
Bishop C. M., 1995, NEURAL NETWORKS PATT