Bio plus plus : Efficient Extensible Libraries and Tools for Computational Molecular Evolution

被引:123
作者
Gueguen, Laurent [1 ]
Gaillard, Sylvain [2 ,3 ,4 ]
Boussau, Bastien [1 ,5 ]
Gouy, Manolo [1 ]
Groussin, Mathieu [1 ]
Rochette, Nicolas C. [1 ]
Bigot, Thomas [1 ]
Fournier, David [6 ]
Pouyet, Fanny [1 ]
Cahais, Vincent [7 ]
Bernard, Aurelien [7 ]
Scornavacca, Celine [7 ]
Nabholz, Benoit [7 ]
Haudry, Annabelle [1 ]
Dachary, Loic
Galtier, Nicolas [7 ]
Belkhir, Khalid [7 ]
Dutheil, Julien Y. [7 ,8 ]
机构
[1] Univ Lyon, CNRS, INRIA, Lab Biometrie & Biol Evolut, Villeurbanne, France
[2] INRA, Inst Rech Hort & Semences, F-49000 Angers, France
[3] Agrocampus Ouest, Inst Rech Hort & Semences, Angers, France
[4] LUNAM Univ, Univ Angers, Inst Rech Hort & Semences, Angers, France
[5] Univ Calif Berkeley, Dept Integrat Biol, Berkeley, CA 94720 USA
[6] Max Delbrueck Ctr Mol Med, Computat Biol & Data Min Grp, Berlin, Germany
[7] Univ Montpellier 2, Inst Sci Evolut, Montpellier, France
[8] Max Planck Inst Terr Microbiol, Dept Organism Interact, D-35043 Marburg, Germany
基金
欧洲研究理事会;
关键词
bioinformatics; models of sequence evolution; phylogeny; C plus plus libraries; AMINO-ACID SITES; NUCLEOTIDE SUBSTITUTION; CODON-SUBSTITUTION; MODELS; SELECTION; DATABASES; PRESSURE; ACCURATE; RATES;
D O I
10.1093/molbev/mst097
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Efficient algorithms and programs for the analysis of the ever-growing amount of biological sequence data are strongly needed in the genomics era. The pace at which new data and methodologies are generated calls for the use of preexisting, optimized-yet extensible-code, typically distributed as libraries or packages. This motivated the Bio++ project, aiming at developing a set of C++ libraries for sequence analysis, phylogenetics, population genetics, and molecular evolution. The main attractiveness of Bio++ is the extensibility and reusability of its components through its object-oriented design, without compromising the computer-efficiency of the underlying methods. We present here the second major release of the libraries, which provides an extended set of classes and methods. These extensions notably provide built-in access to sequence databases and new data structures for handling and manipulating sequences from the omics era, such as multiple genome alignments and sequencing reads libraries. More complex models of sequence evolution, such as mixture models and generic n-tuples alphabets, are also included.
引用
收藏
页码:1745 / 1750
页数:6
相关论文
共 41 条
[1]   Accurate Estimation of Substitution Rates with Neighbor-Dependent Models in a Phylogenetic Context [J].
Berard, Jean ;
Gueguen, Laurent .
SYSTEMATIC BIOLOGY, 2012, 61 (03) :510-521
[2]  
Blanchette J., 2008, C++ GUI Programming with Qt 4, V2nd
[3]   Genome-scale coestimation of species and gene trees [J].
Boussau, Bastien ;
Szoellosi, Gergely J. ;
Duret, Laurent ;
Gouy, Manolo ;
Tannier, Eric ;
Daubin, Vincent .
GENOME RESEARCH, 2013, 23 (02) :323-330
[4]   Proteome-Wide Analysis of Functional Divergence in Bacteria: Exploring a Host of Ecological Adaptations [J].
Caffrey, Brian E. ;
Williams, Tom A. ;
Jiang, Xiaowei ;
Toft, Christina ;
Hokamp, Karsten ;
Fares, Mario A. .
PLOS ONE, 2012, 7 (04)
[5]  
Durbin R., 1998, Biological sequence analysis: probabilistic models of proteins and nucleic acids
[6]   Bio++:: a set of C++ libraries for sequence analysis, phylogenetics, molecular evolution and population genetics [J].
Dutheil, Julien ;
Gaillard, Sylvain ;
Bazin, Eric ;
Glemin, Sylvain ;
Ranwez, Vincent ;
Galtier, Nicolas ;
Belkhir, Khalid .
BMC BIOINFORMATICS, 2006, 7 (1)
[7]   Non-homogeneous models of sequence evolution in the Bio++ suite of libraries and programs [J].
Dutheil, Julien ;
Boussau, Bastien .
BMC EVOLUTIONARY BIOLOGY, 2008, 8 (1)
[8]   Efficient Selection of Branch-Specific Models of Sequence Evolution [J].
Dutheil, Julien Y. ;
Galtier, Nicolas ;
Romiguier, Jonathan ;
Douzery, Emmanuel J. P. ;
Ranwez, Vincent ;
Boussau, Bastien .
MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (07) :1861-1874
[9]   EVOLUTION OF STRUCTURE OF FERREDOXIN BASED ON LIVING RELICS OF PRIMITIVE AMINO ACID SEQUENCES [J].
ECK, RV ;
DAYHOFF, MO .
SCIENCE, 1966, 152 (3720) :363-&
[10]  
Felsenstein J., 2003, Inferring phylogenies