Correcting for Sequencing Error in Maximum Likelihood Phylogeny Inference

被引:8
|
作者
Kuhner, Mary K. [1 ]
McGill, James [1 ]
机构
[1] Univ Washington, Dept Genome Sci, Seattle, WA 98195 USA
来源
G3-GENES GENOMES GENETICS | 2014年 / 4卷 / 12期
基金
美国国家科学基金会;
关键词
sequencing error; phylogeny inference; maximum likelihood; TREES;
D O I
10.1534/g3.114.014365
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Accurate phylogenies are critical to taxonomy as well as studies of speciation processes and other evolutionary patterns. Accurate branch lengths in phylogenies are critical for dating and rate measurements. Such accuracy may be jeopardized by unacknowledged sequencing error. We use simulated data to test a correction for DNA sequencing error in maximum likelihood phylogeny inference. Over a wide range of data polymorphism and true error rate, we found that correcting for sequencing error improves recovery of the branch lengths, even if the assumed error rate is up to twice the true error rate. Low error rates have little effect on recovery of the topology. When error is high, correction improves topological inference; however, when error is extremely high, using an assumed error rate greater than the true error rate leads to poor recovery of both topology and branch lengths. The error correction approach tested here was proposed in 2004 but has not been widely used, perhaps because researchers do not want to commit to an estimate of the error rate. This study shows that correction with an approximate error rate is generally preferable to ignoring the issue.
引用
收藏
页码:2544 / 2551
页数:8
相关论文
共 50 条
  • [31] Maximum likelihood, multiple imputation and regression calibration for measurement error adjustment
    Messer, Karen
    Natarajan, Loki
    STATISTICS IN MEDICINE, 2008, 27 (30) : 6332 - 6350
  • [32] Error Estimation of Iterative Maximum Likelihood Localization in Wireless Sensor Networks
    Zhao, Jizhong
    Mo, Lufeng
    Wu, Xiaoping
    Wang, Guoying
    Liu, Enbin
    Dai, Dan
    AD HOC & SENSOR WIRELESS NETWORKS, 2014, 23 (3-4) : 277 - 295
  • [33] REDUCED LISTS OF ERROR PATTERNS FOR MAXIMUM-LIKELIHOOD SOFT DECODING
    SNYDERS, J
    IEEE TRANSACTIONS ON INFORMATION THEORY, 1991, 37 (04) : 1194 - 1200
  • [34] The AxML program family for maximum likelihood-based phylogenetic tree inference
    Stamatakis, AP
    Ludwig, T
    Meier, H
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2004, 16 (09): : 975 - 988
  • [35] The Effect of Ambiguous Data on Phylogenetic Estimates Obtained by Maximum Likelihood and Bayesian Inference
    Lemmon, Alan R.
    Brown, Jeremy M.
    Stanger-Hall, Kathrin
    Lemmon, Emily Moriarty
    SYSTEMATIC BIOLOGY, 2009, 58 (01) : 130 - 145
  • [36] Maximum-Likelihood Inference of Population Size Contractions from Microsatellite Data
    Leblois, Raphael
    Pudlo, Pierre
    Neron, Joseph
    Bertaux, Francois
    Beeravolu, Champak Reddy
    Vitalis, Renaud
    Rousset, Francois
    MOLECULAR BIOLOGY AND EVOLUTION, 2014, 31 (10) : 2805 - 2823
  • [37] Maximum likelihood inference in weakly identified dynamic stochastic general equilibrium models
    Andrews, Isaiah
    Mikusheva, Anna
    QUANTITATIVE ECONOMICS, 2015, 6 (01) : 123 - 152
  • [38] Accuracy analysis of time domain maximum likelihood method and sample maximum likelihood method for errors-in-variables and output error identification
    Soderstrom, Torsten
    Hong, Mei
    Schoukens, Johan
    Pintelon, Rik
    AUTOMATICA, 2010, 46 (04) : 721 - 727
  • [39] Penalized maximum likelihood inference under the mixture cure model in sparse data
    Xu, Changchang
    Bull, Shelley B.
    STATISTICS IN MEDICINE, 2023, 42 (13) : 2134 - 2161
  • [40] Maximum Likelihood Phylogenetic Inference is Consistent on Multiple Sequence Alignments, with or without Gaps
    Truszkowski, Jakub
    Goldman, Nick
    SYSTEMATIC BIOLOGY, 2016, 65 (02) : 328 - 333