Nested sampling for parameter inference in systems biology: application to an exemplar circadian model

被引:33
作者
Aitken, Stuart [1 ]
Akman, Ozgur E. [2 ]
机构
[1] Univ Edinburgh, IGMM, MRC Human Genet Unit, Edinburgh EH4 2XU, Midlothian, Scotland
[2] Univ Exeter, Coll Engn Math & Phys Sci, Ctr Syst Dynam & Control, Exeter EX4 4QF, Devon, England
基金
英国生物技术与生命科学研究理事会; 英国惠康基金;
关键词
Model selection; Parameter inference; Nested sampling; Circadian rhythms; BAYESIAN-INFERENCE; TRANSCRIPTIONAL REGULATION; CLOCKS; NEUROSPORA; EFFICIENT; SELECTION;
D O I
10.1186/1752-0509-7-72
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Model selection and parameter inference are complex problems that have yet to be fully addressed in systems biology. In contrast with parameter optimisation, parameter inference computes both the parameter means and their standard deviations (or full posterior distributions), thus yielding important information on the extent to which the data and the model topology constrain the inferred parameter values. Results: We report on the application of nested sampling, a statistical approach to computing the Bayesian evidence Z, to the inference of parameters, and the estimation of log Z in an established model of circadian rhythms. A ten-fold difference in the coefficient of variation between degradation and transcription parameters is demonstrated. We further show that the uncertainty remaining in the parameter values is reduced by the analysis of increasing numbers of circadian cycles of data, up to 4 cycles, but is unaffected by sampling the data more frequently. Novel algorithms for calculating the likelihood of a model, and a characterisation of the performance of the nested sampling algorithm are also reported. The methods we develop considerably improve the computational efficiency of the likelihood calculation, and of the exploratory step within nested sampling. Conclusions: We have demonstrated in an exemplar circadian model that the estimates of posterior parameter densities (as summarised by parameter means and standard deviations) are influenced predominately by the length of the time series, becoming more narrowly constrained as the number of circadian cycles considered increases. We have also shown the utility of the coefficient of variation for discriminating between highly-constrained and less-well constrained parameters.
引用
收藏
页数:12
相关论文
共 38 条
[1]   Modelling Reveals Kinetic Advantages of Co-Transcriptional Splicing [J].
Aitken, Stuart ;
Alexander, Ross D. ;
Beggs, Jean D. .
PLOS COMPUTATIONAL BIOLOGY, 2011, 7 (10)
[2]   Isoform switching facilitates period control in the Neurospora crassa circadian clock [J].
Akman, Ozgur E. ;
Locke, James C. W. ;
Tang, Sanyi ;
Carre, Isabelle ;
Millar, Andrew J. ;
Rand, David A. .
MOLECULAR SYSTEMS BIOLOGY, 2008, 4 (1)
[3]  
[Anonymous], 2006, Stochastic modelling for systems biology
[4]  
[Anonymous], 1992, Statistical Science, DOI [10.1214/ss/1177011137, DOI 10.1214/SS/1177011137]
[5]  
Calderhead B, 2007, COMP
[6]   How to sample from a truncated distribution if you must [J].
Chatpatanasiri, Ratthachat .
ARTIFICIAL INTELLIGENCE REVIEW, 2009, 31 (1-4) :1-15
[7]   Properties of nested sampling [J].
Chopin, Nicolas ;
Robert, Christian P. .
BIOMETRIKA, 2010, 97 (03) :741-755
[8]   Plant circadian clocks increase photosynthesis, growth, survival, and competitive advantage [J].
Dodd, AN ;
Salathia, N ;
Hall, A ;
Kévei, E ;
Tóth, R ;
Nagy, F ;
Hibberd, JM ;
Millar, AJ ;
Webb, AAR .
SCIENCE, 2005, 309 (5734) :630-633
[9]  
Dondelinger F, 2012, P 9 INT WORKSH COMP, P15
[10]   Large-scale discovery of promoter motifs in Drosophila melanogaster [J].
Down, Thomas A. ;
Bergman, Casey M. ;
Su, Jing ;
Hubbard, Tim J. P. .
PLOS COMPUTATIONAL BIOLOGY, 2007, 3 (01) :95-109