Characterizing the Native Codon Usages of a Genome: An Axis Projection Approach

被引:19
作者
Davis, James J. [1 ,2 ]
Olsen, Gary J. [1 ,2 ]
机构
[1] Univ Illinois, Dept Microbiol, Urbana, IL 61801 USA
[2] Univ Illinois, Inst Genom Biol, Urbana, IL 61801 USA
基金
美国国家航空航天局; 美国国家卫生研究院;
关键词
horizontal gene transfer; foreign genes; codon adaptation index; factorial correspondence analysis; GENE-EXPRESSION LEVEL; COLI TRANSFER-RNAS; AMINO-ACID USAGE; ESCHERICHIA-COLI; PSEUDOMONAS-AERUGINOSA; RESPECTIVE CODONS; BACTERIAL GENOMES; ADAPTATION INDEX; CATALOG USAGE; PROTEIN GENES;
D O I
10.1093/molbev/msq185
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Codon usage can provide insights into the nature of the genes in a genome. Genes that are "native" to a genome (have not been recently acquired by horizontal transfer) range in codon usage from a low-bias "typical" usage to a more biased "high-expression" usage characteristic of genes encoding abundant proteins. Genes that differ from these native codon usages are candidates for foreign genes that have been recently acquired by horizontal gene transfer. In this study, we present a method for characterizing the codon usages of native genes-both typical and highly expressed-within a genome. Each gene is evaluated relative to a half line (or axis) in a 59D space of codon usage. The axis begins at the modal codon usage, the usage that matches the largest number of genes in the genome, and it passes through a point representing the codon usage of a set of genes with expression-related bias. A gene whose codon usage matches (does not significantly differ from) a point on this axis is a candidate native gene, and the location of its projection onto the axis provides a general estimate of its expression level. A gene that differs significantly from all points on the axis is a candidate foreign gene. This automated approach offers significant improvements over existing methods. We illustrate this by analyzing the genomes of Pseudomonas aeruginosa PAO1 and Bacillus anthracis A0248, which can be difficult to analyze with commonly used methods due to their biased base compositions. Finally, we use this approach to measure the proportion of candidate foreign genes in 923 bacterial and archaeal genomes. The organisms with the most homogeneous genomes (containing the fewest candidate foreign genes) are mostly endosymbionts and parasites, though with exceptions that include Pelagibacter ubique and Beutenbergia cavernae. The organisms with the most heterogeneous genomes (containing the most candidate foreign genes) include members of the genera Bacteroides, Corynebacterium, Desulfotalea, Neisseria, Xylella, and Thermobaculum.
引用
收藏
页码:211 / 221
页数:11
相关论文
共 50 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   Codon usage and base composition in Rickettsia prowazekii [J].
Andersson, SGE ;
Sharp, PM .
JOURNAL OF MOLECULAR EVOLUTION, 1996, 42 (05) :525-536
[3]  
Badger JH, 1999, THESIS U ILLINOIS UR, P45
[4]   Gene expression level shapes the amino acid usages in Prochlorococcus marinus MED4 [J].
Banerjee, T ;
Ghosh, TC .
JOURNAL OF BIOMOLECULAR STRUCTURE & DYNAMICS, 2006, 23 (05) :547-553
[5]  
BENNETZEN JL, 1982, J BIOL CHEM, V257, P3026
[6]   THE MOSAIC GENOME OF WARM-BLOODED VERTEBRATES [J].
BERNARDI, G ;
OLOFSSON, B ;
FILIPSKI, J ;
ZERIAL, M ;
SALINAS, J ;
CUNY, G ;
MEUNIERROTIVAL, M ;
RODIER, F .
SCIENCE, 1985, 228 (4702) :953-958
[7]  
BERNARDI G, 1989, ANNU REV GENET, V23, P637, DOI 10.1146/annurev.ge.23.120189.003225
[8]   Codon adaptation index as a measure of dominating codon bias [J].
Carbone, A ;
Zinovyev, A ;
Képès, F .
BIOINFORMATICS, 2003, 19 (16) :2005-2015
[9]   Identification of genes subject to positive selection in uropathogenic strains of Escherichia coli:: A comparative genomics approach [J].
Chen, SL ;
Hung, CS ;
Xu, JA ;
Reigstad, CS ;
Magrini, V ;
Sabo, A ;
Blasiar, D ;
Bieri, T ;
Meyer, RR ;
Ozersky, P ;
Armstrong, JR ;
Fulton, RS ;
Latreille, JP ;
Spieth, J ;
Hooton, TM ;
Mardis, ER ;
Hultgren, SJ ;
Gordon, JI .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (15) :5977-5982
[10]   Bacterial Genomes as new gene homes:: The genealogy of ORFans in E-coli [J].
Daubin, V ;
Ochman, H .
GENOME RESEARCH, 2004, 14 (06) :1036-1042