FUNYBASE: a FUNgal phYlogenomic dataBASE

被引:55
作者
Marthey, Sylvain [2 ]
Aguileta, Gabriela [1 ,2 ]
Rodolphe, Francois [2 ]
Gendrault, Annie [2 ]
Giraud, Tatiana [1 ]
Fournier, Elisabeth [3 ]
Lopez-Villavicencio, Manuela [4 ]
Gautier, Angelique [5 ]
Lebrun, Marc-Henri [6 ]
Chiapello, Helene [2 ]
机构
[1] Univ Paris 11, CNRS, UMR ESE, F-91405 Orsay, France
[2] INRA, UR MIG, F-78350 Domaine De Vilvert, France
[3] AgroSup, INRA, UMR BGPI, CIRAD, F-34398 Montpellier 5, France
[4] MNHN, Dept Syst & Evolut, F-75005 Paris, France
[5] Ctr INRA Versailles, INRA, UMR BIOGER, F-78026 Versailles, France
[6] Univ Lyon 1, CNRS, UMR MAP, INSA,BAYER CS, F-69009 Lyon, France
关键词
D O I
10.1186/1471-2105-9-456
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The increasing availability of fungal genome sequences provides large numbers of proteins for evolutionary and phylogenetic analyses. However the heterogeneity of data, including the quality of genome annotation and the difficulty of retrieving true orthologs, makes such investigations challenging. The aim of this study was to provide a reliable and integrated resource of orthologous gene families to perform comparative and phylogenetic analyses in fungi. Description: FUNYBASE is a database dedicated to the analysis of fungal single-copy genes extracted from available fungal genomes sequences, their classification into reliable clusters of orthologs, and the assessment of their informative value for phylogenetic reconstruction based on amino acid sequences. The current release of FUNYBASE contains two types of protein data: (i) a complete set of protein sequences extracted from 30 public fungal genomes and classified into clusters of orthologs using a robust automated procedure, and (ii) a subset of 246 reliable ortholog clusters present as single copy genes in 21 fungal genomes. For each of these 246 ortholog clusters, phylogenetic trees were reconstructed based on their amino acid sequences. To assess the informative value of each ortholog cluster, each was compared to a reference species tree constructed using a concatenation of roughly half of the 246 sequences that are best approximated by the WAG evolutionary model. The orthologs were classified according to a topological score, which measures their ability to recover the same topology as the reference species tree. The full results of these analyses are available on-line with a user-friendly interface that allows for searches to be performed by species name, the ortholog cluster, various keywords, or using the BLAST algorithm. Examples of fruitful utilization of FUNYBASE for investigation of fungal phylogenetics are also presented. Conclusion: FUNYBASE constitutes a novel and useful resource for two types of analyses: (i) comparative studies can be greatly facilitated by reliable clusters of orthologs across sets of user-defined fungal genomes, and (ii) phylogenetic reconstruction can be improved by identifying genes with the highest informative value at the desired taxonomic level.
引用
收藏
页数:10
相关论文
共 28 条
[1]   ProtTest: selection of best-fit models of protein evolution [J].
Abascal, F ;
Zardoya, R ;
Posada, D .
BIOINFORMATICS, 2005, 21 (09) :2104-2105
[2]   Assessing the Performance of Single-Copy Genes for Recovering Robust Phylogenies [J].
Aguileta, G. ;
Marthey, S. ;
Chiapello, H. ;
Lebrun, M-H. ;
Rodolphe, F. ;
Fournier, E. ;
Gendrault-Jacquemard, A. ;
Giraud, T. .
SYSTEMATIC BIOLOGY, 2008, 57 (04) :613-627
[3]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[4]   Evaluation of clustering algorithms for protein-protein interaction networks [J].
Brohee, Sylvain ;
van Helden, Jacques .
BMC BIOINFORMATICS, 2006, 7 (1)
[5]   Assessing Performance of Orthology Detection Strategies Applied to Eukaryotic Genomes [J].
Chen, Feng ;
Mackey, Aaron J. ;
Vermunt, Jeroen K. ;
Roos, David S. .
PLOS ONE, 2007, 2 (04)
[6]  
COSTA GGL, 2005, P 5 BRAZ S MATH COMP
[7]   Genome evolution in yeasts [J].
Dujon, B ;
Sherman, D ;
Fischer, G ;
Durrens, P ;
Casaregola, S ;
Lafontaine, I ;
de Montigny, J ;
Marck, C ;
Neuvéglise, C ;
Talla, E ;
Goffard, N ;
Frangeul, L ;
Aigle, M ;
Anthouard, V ;
Babour, A ;
Barbe, V ;
Barnay, S ;
Blanchin, S ;
Beckerich, JM ;
Beyne, E ;
Bleykasten, C ;
Boisramé, A ;
Boyer, J ;
Cattolico, L ;
Confanioleri, F ;
de Daruvar, A ;
Despons, L ;
Fabre, E ;
Fairhead, C ;
Ferry-Dumazet, H ;
Groppi, A ;
Hantraye, F ;
Hennequin, C ;
Jauniaux, N ;
Joyet, P ;
Kachouri, R ;
Kerrest, A ;
Koszul, R ;
Lemaire, M ;
Lesur, I ;
Ma, L ;
Muller, H ;
Nicaud, JM ;
Nikolski, M ;
Oztas, S ;
Ozier-Kalogeropoulos, O ;
Pellenz, S ;
Potier, S ;
Richard, GF ;
Straub, ML .
NATURE, 2004, 430 (6995) :35-44
[8]   An efficient algorithm for large-scale detection of protein families [J].
Enright, AJ ;
Van Dongen, S ;
Ouzounis, CA .
NUCLEIC ACIDS RESEARCH, 2002, 30 (07) :1575-1584
[9]   A fungal phylogeny based on 42 complete genomes derived from supertree and combined gene analysis [J].
Fitzpatrick, David A. ;
Logue, Mary E. ;
Stajich, Jason E. ;
Butler, Geraldine .
BMC EVOLUTIONARY BIOLOGY, 2006, 6 (1)
[10]   Partition of the Botrytis cinerea complex in France using multiple gene genealogies [J].
Fournier, Elisabeth ;
Giraud, Tatiana ;
Albertini, Catherine ;
Brygoo, Yves .
MYCOLOGIA, 2005, 97 (06) :1251-1267