Bayesian uncertainty analysis for complex systems biology models: emulation, global parameter searches and evaluation of gene functions

被引:45
作者
Vernon, Ian [1 ]
Liu, Junli [2 ]
Goldstein, Michael [1 ]
Rowe, James [2 ,3 ]
Topping, Jen [2 ]
Lindsey, Keith [2 ]
机构
[1] Univ Durham, Dept Math Sci, South Rd, Durham DH1 3LE, England
[2] Univ Durham, Dept Biosci, South Rd, Durham DH1 3LE, England
[3] Univ Sheffield, Dept Mol Biol & Biotechnol, Western Bank, Firth Court, Sheffield S10 2TN, S Yorkshire, England
基金
英国生物技术与生命科学研究理事会; 英国工程与自然科学研究理事会;
关键词
Parameter search; Kinetic models; Emulation; Bayesian uncertainty analysis; Arabidopsis; Root development; Hormonal signalling; GALAXY FORMATION; COMPUTER-MODEL; ROOT-GROWTH; CALIBRATION; AUXIN; PREDICTION; INFERENCE; PEPTIDE; DESIGN;
D O I
10.1186/s12918-017-0484-3
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Many mathematical models have now been employed across every area of systems biology. These models increasingly involve large numbers of unknown parameters, have complex structure which can result in substantial evaluation time relative to the needs of the analysis, and need to be compared to observed data of various forms. The correct analysis of such models usually requires a global parameter search, over a high dimensional parameter space, that incorporates and respects the most important sources of uncertainty. This can be an extremely difficult task, but it is essential for any meaningful inference or prediction to be made about any biological system. It hence represents a fundamental challenge for the whole of systems biology. Methods: Bayesian statistical methodology for the uncertainty analysis of complex models is introduced, which is designed to address the high dimensional global parameter search problem. Bayesian emulators that mimic the systems biology model but which are extremely fast to evaluate are embeded within an iterative history match: an efficient method to search high dimensional spaces within a more formal statistical setting, while incorporating major sources of uncertainty. Results: The approach is demonstrated via application to a model of hormonal crosstalk in Arabidopsis root development, which has 32 rate parameters, for which we identify the sets of rate parameter values that lead to acceptable matches between model output and observed trend data. The multiple insights into the model's structure that this analysis provides are discussed. The methodology is applied to a second related model, and the biological consequences of the resulting comparison, including the evaluation of gene functions, are described. Conclusions: Bayesian uncertainty analysis for complex models using both emulators and history matching is shown to be a powerful technique that can greatly aid the study of a large class of systems biology models. It both provides insight into model behaviour and identifies the sets of rate parameters of interest.
引用
收藏
页数:29
相关论文
共 70 条
[1]   Tools for kinetic modeling of biochemical networks [J].
Alves, Rui ;
Antunes, Fernando ;
Salvador, Armindo .
NATURE BIOTECHNOLOGY, 2006, 24 (06) :667-672
[2]  
Andrianakis I, 2017, J ROY STAT SOC C, V66, P717, DOI 10.1111/rssc.12198
[3]   Bayesian History Matching of Complex Infectious Disease Models Using Emulation: A Tutorial and a Case Study on HIV in Uganda [J].
Andrianakis, Ioannis ;
Vernon, Ian R. ;
McCreesh, Nicky ;
McKinley, Trevelyan J. ;
Oakley, Jeremy E. ;
Nsubuga, Rebecca N. ;
Goldstein, Michael ;
White, Richard G. .
PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (01)
[4]  
[Anonymous], 1999, Encyclopedia of Statistical Sciences
[5]  
[Anonymous], 2003, QUAL ENG
[6]  
[Anonymous], 2013, COMPUT MATH METHODS
[7]  
[Anonymous], BAYES LINEAR EMULATI
[8]  
[Anonymous], 2012, GPy: A gaussian process framework in python
[9]  
[Anonymous], 2015, A language and environment for statistical computing
[10]  
[Anonymous], BAYESIAN ANAL