Comparative analysis of amino acid usage and protein length distribution between alternatively and non-alternatively spliced genes across six eukaryotic genomes

被引:19
作者
Zhuang, YL
Ma, F [1 ]
Li-Ling, J
Xu, XF
Li, YD
机构
[1] Tsinghua Univ, Inst Bioinformat, Beijing 100084, Peoples R China
[2] China Med Univ, Dept Med Genet, Shenyang, Peoples R China
[3] Nanjing Normal Univ, Life Sci Coll, Nanjing, Peoples R China
关键词
amino acid usage; alternatively spliced genes; protein length distribution; eukaryotes; evolution;
D O I
10.1093/molbev/msg203
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Alternative splicing has been discovered in nearly all metazoan organisms as a mechanism to increase the diversity of gene products. However, the origin and evolution of alternatively spliced genes are still poorly understood. To understand the mechanisms for the evolution of alternatively spliced genes, it may be important to study the differences between alternatively and non-alternatively spliced genes. The aim of this research was to compare amino acid usage and protein length distribution between alternatively and non-alternatively spliced genes across six nearly complete eukaryotic genomes, including those of human (Homo sapiens), mouse (Mus musculus), rat (Rattus norvegicus), fruit fly (Drosophila melanogaster), Coenorhabditis elegans, and bovine (Bos taurus). Our results have suggested the following: (1) across the six species, alternatively and non-alternatively spliced genes have very similar tendency for amino acids usage for not only the overall scale but also those highly expressed genes, with all of the highly expressed genes having preferred amino acids including A, E, G, K, L, P, S, V, R, T, and D. (2) For not only the overall genes but also those highly expressed ones, the average length of the protein products of alternatively spliced genes is significantly greater than that of non-alternatively spliced ones. In contrast, distributions of protein lengths for the two groups of genes are very similar among all six species. Based on these results, we propose that alternatively spliced genes may have originated from non-alternatively spliced ones through events such as DNA mutations or gene fusion.
引用
收藏
页码:1978 / 1985
页数:8
相关论文
共 44 条