Consistency of estimators of population scaled parameters using composite likelihood

被引:0
作者
Carsten Wiuf
机构
[1] University of Aarhus,Bioinformatics Research Center
来源
Journal of Mathematical Biology | 2006年 / 53卷
关键词
Coalescent theory; Composite likelihood; Consistency; Estimator; Genomic data;
D O I
暂无
中图分类号
学科分类号
摘要
Composite likelihood methods have become very popular for the analysis of large-scale genomic data sets because of the computational intractability of the basic coalescent process and its generalizations: It is virtually impossible to calculate the likelihood of an observed data set spanning a large chromosomal region without using approximate or heuristic methods. Composite likelihood methods are approximate methods and, in the present article, assume the likelihood is written as a product of likelihoods, one for each of a number of smaller regions that together make up the whole region from which data is collected. A very general framework for neutral coalescent models is presented and discussed. The framework comprises many of the most popular coalescent models that are currently used for analysis of genetic data. Assume data is collected from a series of consecutive regions of equal size. Then it is shown that the observed data forms a stationary, ergodic process. General conditions are given under which the maximum composite estimator of the parameters describing the model (e.g. mutation rates, demographic parameters and the recombination rate) is a consistent estimator as the number of regions tends to infinity.
引用
收藏
页码:821 / 841
页数:20
相关论文
共 45 条
[1]  
Adams M.(2004)Maximum-likelihood estimation of demographic parameters using the frequency spectrum of unlinked single-nucleotide polymorphisms Genetics 168 1699-1712
[2]  
Hudson R.R.(2004)A note on pseudolikelihood constructed from marginal densities Biometrika 91 729-737
[3]  
Cox D.R.(2003)Consistency of estimators of the population-scaled recombination rate Theor. Pop. Biol. 64 67-79
[4]  
Reid N.(2001)Estimating recombination rates from population genetic data Genetics 159 1299-1318
[5]  
Fearnhead P.(2002)Approximate likelihood methods for estimating local recombination rates P. J. Roy. Stat. Soc. B 64 657-680
[6]  
Fearnhead P.(1994)A codon-based model of nucleotide substitution for protein-coding DNA sequences Mol. Biol. Evol. 11 725-736
[7]  
Donnelly P.(1996)Ancestral inference from samples of DNA sequences with recombination J. Comput. Biol. 3 479-502
[8]  
Fearnhead P.(1994)Simulating probability distributions in the coalescent Theor. Pop. Biol. 46 131-159
[9]  
Donnelly P.(1994)Sampling theory for neutral alleles in varying environment Phil. Trans. R. Soc. Lond. B 344 403-410
[10]  
Goldman N.(1996)Markov chain inference methods in population genetics Math. Comput. Modelling 23 141-158