Gaussian process enhanced semi-automatic approximate Bayesian computation: parameter inference in a stochastic differential equation system for chemotaxis

被引:5
作者
Borowska, Agnieszka [1 ]
Giurghita, Diana [1 ]
Husmeier, Dirk [1 ]
机构
[1] Univ Glasgow, Sch Math & Stat, Glasgow G12 8QQ, Lanark, Scotland
基金
英国工程与自然科学研究理事会;
关键词
Biophysics; Statistical inference; Approximate Bayesian computations; Gaussian processes; Chemotaxis; Stochastic differential equations; CHAIN MONTE-CARLO; POPULATION-GROWTH; SELECTION; MODELS; CELLS;
D O I
10.1016/j.jcp.2020.109999
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Chemotaxis is a type of cell movement in response to a chemical stimulus which plays a key role in multiple biophysical processes, such as embryogenesis and wound healing, and which is crucial for understanding metastasis in cancer research. In the literature, chemotaxis has been modelled using biophysical models based on systems of nonlinear stochastic partial differential equations (NSPDEs), which are known to be challenging for statistical inference due to the intractability of the associated likelihood and the high computational costs of their numerical integration. Therefore, data analysis in this context has been limited to comparing predictions from NSPDE models to laboratory data using simple descriptive statistics. We present a statistically rigorous framework for parameter estimation in complex biophysical systems described by NSPDEs such as the one of chemotaxis. We adopt a likelihood-free approach based on approximate Bayesian computations with sequential Monte Carlo (ABC-SMC) which allows for circumventing the intractability of the likelihood. To find informative summary statistics, crucial for the performance of ABC, we propose to use a Gaussian process (GP) regression model. The interpolation provided by the GP regression turns out useful on its own merits: it relatively accurately estimates the parameters of the NSPDE model and allows for uncertainty quantification, at a very low computational cost. Our proposed methodology allows for a considerable part of computations to be completed before having observed any data, providing a practical toolbox to experimental scientists whose modes of operation frequently involve experiments and inference taking place at distinct points in time. In an application to externally provided synthetic data we demonstrate that the correction provided by ABC-SMC is essential for accurate estimation of some of the NSPDE model parameters and for more flexible uncertainty quantification. (C) 2020 Elsevier Inc. All rights reserved.
引用
收藏
页数:34
相关论文
共 43 条
[1]   Particle Markov chain Monte Carlo methods [J].
Andrieu, Christophe ;
Doucet, Arnaud ;
Holenstein, Roman .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2010, 72 :269-342
[2]   THE PSEUDO-MARGINAL APPROACH FOR EFFICIENT MONTE CARLO COMPUTATIONS [J].
Andrieu, Christophe ;
Roberts, Gareth O. .
ANNALS OF STATISTICS, 2009, 37 (02) :697-725
[3]  
[Anonymous], 2020, -categories, Handbook of homotopy theory, 549617
[4]  
Beaumont MA, 2003, GENETICS, V164, P1139
[5]  
Beaumont MA, 2002, GENETICS, V162, P2025
[6]   Adaptive approximate Bayesian computation [J].
Beaumont, Mark A. ;
Cornuet, Jean-Marie ;
Marin, Jean-Michel ;
Robert, Christian P. .
BIOMETRIKA, 2009, 96 (04) :983-990
[7]  
Bonhomme V, 2014, J STAT SOFTW, V56, P1
[8]   Gaussian process emulation of dynamic computer codes [J].
Conti, S. ;
Gosling, J. P. ;
Oakley, J. E. ;
O'Hagan, A. .
BIOMETRIKA, 2009, 96 (03) :663-676
[9]   Bayesian emulation of complex multi-output and dynamic computer models [J].
Conti, Stefano ;
O'Hagan, Anthony .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2010, 140 (03) :640-651
[10]  
Cox D.R., 2000, THEORY DESIGN EXPT