FastML: a web server for probabilistic reconstruction of ancestral sequences

被引:243
作者
Ashkenazy, Haim [1 ]
Penn, Osnat [1 ]
Doron-Faigenboim, Adi [3 ]
Cohen, Ofir [1 ]
Cannarozzi, Gina [2 ]
Zomer, Oren [1 ]
Pupko, Tal [1 ]
机构
[1] Tel Aviv Univ, George S Wise Fac Life Sci, Dept Cell Res & Immunol, IL-69978 Tel Aviv, Israel
[2] Univ Bern, Inst Plant Sci, CH-3013 Bern, Switzerland
[3] ARO, Volcani Ctr, Inst Plant Sci, IL-50250 Bet Dagan, Israel
基金
以色列科学基金会;
关键词
AMINO-ACID-SEQUENCES; MITOCHONDRIAL-DNA; PHYLETIC PATTERNS; PROTEIN SEQUENCES; INFERENCE; MODEL; SUBSTITUTION; SITES; DIVERSITY; ALGORITHM;
D O I
10.1093/nar/gks498
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Ancestral sequence reconstruction is essential to a variety of evolutionary studies. Here, we present the FastML web server, a user-friendly tool for the reconstruction of ancestral sequences. FastML implements various novel features that differentiate it from existing tools: (i) FastML uses an indel-coding method, in which each gap, possibly spanning multiples sites, is coded as binary data. FastML then reconstructs ancestral indel states assuming a continuous time Markov process. FastML provides the most likely ancestral sequences, integrating both indels and characters; (ii) FastML accounts for uncertainty in ancestral states: it provides not only the posterior probabilities for each character and indel at each sequence position, but also a sample of ancestral sequences from this posterior distribution, and a list of the k-most likely ancestral sequences; (iii) FastML implements a large array of evolutionary models, which makes it generic and applicable for nucleotide, protein and codon sequences; and (iv) a graphical representation of the results is provided, including, for example, a graphical logo of the inferred ancestral sequences. The utility of FastML is demonstrated by reconstructing ancestral sequences of the Env protein from various HIV-1 subtypes. FastML is freely available for all academic users and is available online at http://fastml.tau.ac.il/.
引用
收藏
页码:W580 / W584
页数:5
相关论文
共 37 条
  • [1] Plastid genome phylogeny and a model of amino acid substitution for proteins encoded by chloroplast DNA
    Adachi, J
    Waddell, PJ
    Martin, W
    Hasegawa, M
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 2000, 50 (04) : 348 - 358
  • [2] Adachi J, 1996, J MOL EVOL, V42, P459
  • [3] Reconstructing large regions of an ancestral mammalian genome in silico
    Blanchette, M
    Green, ED
    Miller, W
    Haussler, D
    [J]. GENOME RESEARCH, 2004, 14 (12) : 2412 - 2423
  • [4] Recreating a functional ancestral archosaur visual pigment
    Chang, BSW
    Jönsson, K
    Kazmi, MA
    Donoghue, MJ
    Sakmar, TP
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (09) : 1483 - 1489
  • [5] A likelihood framework to analyse phyletic patterns
    Cohen, Ofir
    Rubinstein, Nimrod D.
    Stern, Adi
    Gophna, Uri
    Pupko, Tal
    [J]. PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2008, 363 (1512) : 3903 - 3911
  • [6] Inference of Gain and Loss Events from Phyletic Patterns Using Stochastic Mapping and Maximum Parsimony-A Simulation Study
    Cohen, Ofir
    Pupko, Tal
    [J]. GENOME BIOLOGY AND EVOLUTION, 2011, 3 : 1265 - 1275
  • [7] Utilizing natural diversity to evolve protein function: applications towards thermostability
    Cole, Megan F.
    Gaucher, Eric A.
    [J]. CURRENT OPINION IN CHEMICAL BIOLOGY, 2011, 15 (03) : 399 - 406
  • [8] WebLogo: A sequence logo generator
    Crooks, GE
    Hon, G
    Chandonia, JM
    Brenner, SE
    [J]. GENOME RESEARCH, 2004, 14 (06) : 1188 - 1190
  • [9] Dayhoff M O., 1978, Atlas of Protein Seq Struct, ppp 345
  • [10] A combined empirical and mechanistic codon model
    Doron-Faigenboim, Adi
    Pupko, Tal
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2007, 24 (02) : 388 - 397