Statistical Assignment of DNA Sequences Using Bayesian Phylogenetics

被引:149
作者
Munch, Kasper [1 ]
Boomsma, Wouter [2 ]
Huelsenbeck, John P. [1 ]
Willerslev, Eske [3 ,4 ]
Nielsen, Rasmus [1 ,3 ,5 ]
机构
[1] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[2] Univ Copenhagen, Bioinformat Ctr, DK-2200 Copenhagen N, Denmark
[3] Univ Copenhagen, Dept Biol, DK-2100 Copenhagen O, Denmark
[4] Univ Copenhagen, Ctr Ancient Genet, DK-2100 Copenhagen O, Denmark
[5] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
关键词
Assignment; barcoding; Bayesian; phylogenetics;
D O I
10.1080/10635150802422316
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We provide a new automated statistical method for DNA barcoding based on a Bayesian phylogenetic analysis. The method is based on automated database sequence retrieval, alignment, and phylogenetic analysis using a custom-built program for Bayesian phylogenetic analysis. We show on real data that the method outperforms Blast searches as a measure of confidence and can help eliminate 80% of all false assignment based on best Blast hit. However, the most important advance of the method is that it provides statistically meaningful measures of confidence. We apply the method to a re-analysis of previously published ancient DNA data and show that, with high statistical confidence, most of the published sequences are in fact of Neanderthal origin. However, there are several cases of chimeric sequences that are comprised of a combination of both Neanderthal and modern human DNA.
引用
收藏
页码:750 / 757
页数:8
相关论文
共 31 条
  • [21] Testing the utility of partial COI sequences for phylogenetic estimates of gastropod relationships
    Remigio, EA
    Hebert, PDN
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2003, 29 (03) : 641 - 647
  • [22] RUSCH D, 2007, PLOS BIOL, V5, pE83
  • [23] The Neandertal type site revisited:: Interdisciplinary investigations of skeletal remains from the Neander Valley, Germany
    Schmitz, RW
    Serre, D
    Bonani, G
    Feine, S
    Hillgruber, F
    Krainitzki, H
    Pääbo, S
    Smith, FH
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2002, 99 (20) : 13342 - 13347
  • [24] No evidence of neandertal mtDNA contribution to early modern humans
    Serre, D
    Langaney, A
    Chech, M
    Teschler-Nicola, M
    Paunovic, M
    Mennecier, P
    Hofreiter, M
    Possnert, G
    Pääbo, S
    [J]. PLOS BIOLOGY, 2004, 2 (03) : 313 - 317
  • [25] TaxI: a software tool for DNA barcoding using distance methods
    Steinke, D
    Vences, M
    Salzburger, W
    Meyer, A
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2005, 360 (1462) : 1975 - 1980
  • [26] Tavar? S., 1986, LECT MATH LIFE SCI, V17, P57, DOI DOI 10.1016/J.MARPOLBUL.2009.11.011
  • [27] Metagenomics: DNA sequencing of environmental samples
    Tringe, SG
    Rubin, EM
    [J]. NATURE REVIEWS GENETICS, 2005, 6 (11) : 805 - 814
  • [28] Environmental genome shotgun sequencing of the Sargasso Sea
    Venter, JC
    Remington, K
    Heidelberg, JF
    Halpern, AL
    Rusch, D
    Eisen, JA
    Wu, DY
    Paulsen, I
    Nelson, KE
    Nelson, W
    Fouts, DE
    Levy, S
    Knap, AH
    Lomas, MW
    Nealson, K
    White, O
    Peterson, J
    Hoffman, J
    Parsons, R
    Baden-Tillson, H
    Pfannkoch, C
    Rogers, YH
    Smith, HO
    [J]. SCIENCE, 2004, 304 (5667) : 66 - 74
  • [29] YANG ZH, 1993, MOL BIOL EVOL, V10, P1396
  • [30] MAXIMUM-LIKELIHOOD PHYLOGENETIC ESTIMATION FROM DNA-SEQUENCES WITH VARIABLE RATES OVER SITES - APPROXIMATE METHODS
    YANG, ZH
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1994, 39 (03) : 306 - 314