High fidelity mathematical model building with experimental data: A Bayesian approach

被引:48
作者
Blau, Gary [1 ]
Lasinski, Michael [1 ]
Orcun, Seza [1 ]
Hsu, Shuo-Huan [2 ]
Caruthers, Jim [2 ]
Delgass, Nicholas [2 ]
Venkatasubramanian, Venkat [2 ]
机构
[1] Purdue Univ, e Enterprise Ctr, W Lafayette, IN 47907 USA
[2] Purdue Univ, Sch Chem Engn, Ctr Catalyst Design, W Lafayette, IN 47907 USA
关键词
Bayes' theorem; Markov Chain Monte Carlo (MCMC) sampling; model discrimination and validation; design of experiments (DOE); nonlinear statistics;
D O I
10.1016/j.compchemeng.2007.04.008
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Mathematical models of physicochemical systems are usually built in an iterative fashion during file Course of an experimental investigation. In this paper, a novel Bayesian approach to model building is presented. This approach is now feasible because of breakthroughs in Monte Carlo sampling procedures and high performance computing, that make it possible to deal directly with the nonlinear mathematical models themselves instead of their linear approximations. By including an error model for experimental data, it is further possible to use nonlinear statistical concepts to test a given model for adequacy against experimental data and prior knowledge, and to place realistic confidence limits oil the resulting model parameters. In this paper a model building work flow that takes advantage of these recent advances to enable high fidelity mathematical modeling is proposed. A set of models and their parameters are needed to initiate the process. Probability distributions for the models and their parameters based oil available quantitative and subjective information must also be supplied. Finally, an error model describing the heteroscedasticity in the data along with probability distributions for the error model parameters must be generated from exploratory data. Then experiments are designed and data collected. Using Bayes' theorem, Monte Carlo (MC) or Markov Chain Monte Carlo (MCMC) methods are used to generate a sequence of samples of parameter values for each postulated model. These sets of samples are then used to discriminate among the models using the criteria introduced in this paper. Once discrimination is achieved, a global lack of fit test is introduced to determine model adequacy. After a single adequate model is selected, highest probability density (HPD) intervals are determined for the individual parameters and HPD density regions are constructed for all model parameter pairs. Experiments are then designed to reduce the uncertainty in the joint posterior probability HPD regions. Finally, a sampling procedure is described to property represent uncertainties in predictions made from the model. The proposed approach is demonstrated by all illustrative problem where three simple models are discriminated and the parameters in the most suitable ones are estimated rigorously. (C) 2007 Published by Elsevier Ltd.
引用
收藏
页码:971 / 989
页数:19
相关论文
共 26 条
  • [1] AITKIN M, 1991, J ROY STAT SOC B MET, V53, P111
  • [2] [Anonymous], 2005, 1 COURSE MONTE CARLO
  • [3] [Anonymous], 1988, Nonlinear regression analysis and its applications, DOI DOI 10.1002/9780470316757
  • [4] ATKINSON AC, 1974, J ROY STAT SOC B MET, V36, P321
  • [5] ATKINSON AC, 1975, BIOMETRIKA, V62, P289, DOI 10.2307/2335364
  • [6] Bard Y., 1974, Nonlinear Parameter Estimation
  • [7] BOX GEP, 1959, BIOMETRIKA, V46, P77, DOI 10.1093/biomet/46.1-2.77
  • [8] DISCRIMINATION AMONG MECHANISTIC MODELS
    BOX, GEP
    HILL, WJ
    [J]. TECHNOMETRICS, 1967, 9 (01) : 57 - +
  • [9] Brooks SP, 1998, J ROY STAT SOC D-STA, V47, P69, DOI 10.1111/1467-9884.00117
  • [10] Markov chain Monte Carlo convergence diagnostics: A comparative review
    Cowles, MK
    Carlin, BP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (434) : 883 - 904