Comparative analysis of chloroplast genomes: Functional annotation, genome-based phylogeny, and deduced evolutionary patterns

被引:95
作者
De las Rivas, J
Lozano, JJ
Ortiz, AR [1 ]
机构
[1] NYU, Mt Sinai Sch Med, Dept Physiol & Biophys, New York, NY 10029 USA
[2] CSIC, Inst Recursos Nat & Agrobiol, Salamanca 37071, Spain
关键词
D O I
10.1101/gr.209402
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
All protein sequences from 19 complete chloroplast genomes (cpDNA) have been studied using a new computational method able to analyze functional correlations among series of protein sequences contained in complete proteomes. First, all open reading frames (ORFs) from the cpDNAs, comprising a total of 2266 protein sequences, were compared against the 3168 proteins from Synechocystis PCC6803 complete genome to find functionally related orthologous proteins. Additionally, all cpDNA genomes were pairwise compared to find orthologous groups not present in cyanobacteria. Annotations in the cluster of othologous proteins database and CyanoBase were used as reference for the functional assignments. Following this protocol, new functional assignments were made for ORFs of unknown function and for ycfs (hypothetical chloroplast frames), which still lack a functional assignment. Using this information, a matrix of functional relationships was derived from profiles of the presence and/or absence of orthologous proteins; the matrix included 1837 proteins in 277 orthologous Clusters. A factor analysis study of this matrix, followed by cluster analysis, allowed us to obtain accurate phylogenetic reconstructions and the detection of genes probably involved in speciation as phylogenetic correlates. Finally, by grouping common evolutionary patterns, we show that it is possible to determine functionally linked protein networks. This has allowed us to suggest putative associations for some unknown ORFs.
引用
收藏
页码:567 / 583
页数:17
相关论文
共 51 条
[1]   Do aligned sequences share the same fold? [J].
Abagyan, RA ;
Batalov, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 273 (01) :355-368
[2]   Plastid genome phylogeny and a model of amino acid substitution for proteins encoded by chloroplast DNA [J].
Adachi, J ;
Waddell, PJ ;
Martin, W ;
Hasegawa, M .
JOURNAL OF MOLECULAR EVOLUTION, 2000, 50 (04) :348-358
[3]  
Allen JF, 2000, PHILOS T R SOC B, V355, P1351, DOI 10.1098/rstb.2000.0697
[4]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[5]   Bioinformatics: From genome data to biological knowledge [J].
Andrade, MA ;
Sander, C .
CURRENT OPINION IN BIOTECHNOLOGY, 1997, 8 (06) :675-683
[6]   PHOTOINHIBITION OF PHOTOSYSTEM-2 - INACTIVATION, PROTEIN DAMAGE AND TURNOVER [J].
ARO, EM ;
VIRGIN, I ;
ANDERSSON, B .
BIOCHIMICA ET BIOPHYSICA ACTA, 1993, 1143 (02) :113-134
[7]   Abnormal regulation of photosynthetic electron transport in a chloroplast ycf9 inactivation mutant [J].
Baena-González, E ;
Gray, JC ;
Tyystjärvi, E ;
Aro, EM ;
Mäenpää, P .
JOURNAL OF BIOLOGICAL CHEMISTRY, 2001, 276 (23) :20795-20802
[8]   Molecular biology of chloroplast biogenesis: gene expression, protein import and intraorganellar sorting [J].
Bauer, J ;
Hiltbrunner, A ;
Kessler, F .
CELLULAR AND MOLECULAR LIFE SCIENCES, 2001, 58 (03) :420-433
[9]   Predicting function: From genes to genomes and back [J].
Bork, P ;
Dandekar, T ;
Diaz-Lazcoz, Y ;
Eisenhaber, F ;
Huynen, M ;
Yuan, YP .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 283 (04) :707-725
[10]   Identification of a functional respiratory complex in chloroplasts through analysis of tobacco mutants containing disrupted plastid ndh genes [J].
Burrows, PA ;
Sazanov, LA ;
Svab, Z ;
Maliga, P ;
Nixon, PJ .
EMBO JOURNAL, 1998, 17 (04) :868-876