Decoding Coalescent Hidden Markov Models in Linear Time

被引:0
|
作者
Harris, Kelley [1 ]
Sheehan, Sara [2 ]
Kamm, John A. [3 ]
Song, Yun S. [2 ,3 ,4 ]
机构
[1] Univ Calif Berkeley, Dept Math, Berkeley, CA 94720 USA
[2] Univ Calif, Div Comp Sci, Berkeley, CA USA
[3] Univ Calif, Dept Stat, Berkeley, CA USA
[4] Univ Calif, Dept Integrat Biol, Berkeley, CA USA
来源
RESEARCH IN COMPUTATIONAL MOLECULAR BIOLOGY, RECOMB2014 | 2014年 / 8394卷
关键词
Demographic inference; effective population size; coalescent with recombination; expectation-maximization; augmented hidden Markov model; human migration out of Africa; CONDITIONAL SAMPLING DISTRIBUTION; DROSOPHILA-MELANOGASTER; BAYESIAN-INFERENCE; HUMAN DEMOGRAPHY; GENOME SEQUENCE; RECOMBINATION; POPULATION; EVOLUTION; DISCOVERY; IDENTITY;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In many areas of computational biology, hidden Markov models (HMMs) have been used to model local genomic features. In particular, coalescent HMMs have been used to infer ancient population sizes, migration rates, divergence times, and other parameters such as mutation and recombination rates. As more loci, sequences, and hidden states are added to the model, however, the runtime of coalescent HMMs can quickly become prohibitive. Here we present a new algorithm for reducing the runtime of coalescent HMMs from quadratic in the number of hidden time states to linear, without making any additional approximations. Our algorithm can be incorporated into various coalescent HMMs, including the popular method PSMC for inferring variable effective population sizes. Here we implement this algorithm to speed up our demographic inference method diCal, which is equivalent to PSMC when applied to a sample of two haplotypes. We demonstrate that the linear-time method can reconstruct a population size change history more accurately than the quadratic-time method, given similar computation resources. We also apply the method to data from the 1000 Genomes project, inferring a high-resolution history of size changes in the European population.
引用
收藏
页码:100 / 114
页数:15
相关论文
共 50 条
  • [1] Applications of hidden Markov models in bar code decoding
    Kresic-Juric, S.
    Madej, D.
    Santosa, Fadil
    PATTERN RECOGNITION LETTERS, 2006, 27 (14) : 1665 - 1672
  • [2] Optimal Decoding of Hidden Markov Models with Consistency Constraints
    Dubray, Alexandre
    Derval, Guillaume
    Nijssen, Siegfried
    Schaus, Pierre
    DISCOVERY SCIENCE (DS 2022), 2022, 13601 : 407 - 417
  • [3] Hidden Markov models for circular and linear-circular time series
    Holzmann, Hajo
    Munk, Axel
    Suster, Max
    Zucchini, Walter
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2006, 13 (03) : 325 - 347
  • [4] Hidden Markov models for circular and linear-circular time series
    Hajo Holzmann
    Axel Munk
    Max Suster
    Walter Zucchini
    Environmental and Ecological Statistics, 2006, 13 : 325 - 347
  • [5] Iterative decoding of two-dimensional hidden Markov models
    Perronnin, F
    Dugelay, JL
    Rose, K
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE & MULTIDIMENSIONAL SIGNAL PROCESSING SIGNAL, PROCESSING EDUCATION, 2003, : 329 - 332
  • [6] Hidden Markov models for the burst error statistics of Viterbi decoding
    Chao, CC
    Yao, YL
    IEEE TRANSACTIONS ON COMMUNICATIONS, 1996, 44 (12) : 1620 - 1622
  • [7] CONDITIONAL ITERATIVE DECODING OF TWO DIMENSIONAL HIDDEN MARKOV MODELS
    Sargin, M. E.
    Altinok, A.
    Rose, K.
    Manjunath, B. S.
    2008 15TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1-5, 2008, : 2552 - 2555
  • [8] On Efficient Viterbi Decoding for Hidden semi-Markov Models
    Datta, Ritendra
    Hu, Jianying
    Ray, Bonnie
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 2593 - 2596
  • [9] Hidden Markov chains in generalized linear models
    Turner, TR
    Cameron, MA
    Thomson, PJ
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 1998, 26 (01): : 107 - 125
  • [10] Exact Decoding of a Sequentially Markov Coalescent Model in Genetics
    Ki, Caleb
    Terhorst, Jonathan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (547) : 2242 - 2255