DNA indels in coding regions reveal selective constraints on protein evolution in the human lineage

被引:38
作者
de la Chaux, Nicole [1 ,2 ]
Messer, Philipp W. [1 ]
Arndt, Peter F. [1 ]
机构
[1] Max Planck Inst Mol Genet, Dept Computat Mol Biol, D-14195 Berlin, Germany
[2] Univ Zurich, Dept Biochem, CH-8057 Zurich, Switzerland
关键词
D O I
10.1186/1471-2148-7-191
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Insertions and deletions of DNA segments (indels) are together with substitutions the major mutational processes that generate genetic variation. Here we focus on recent DNA insertions and deletions in protein coding regions of the human genome to investigate selective constraints on indels in protein evolution. Results: Frequencies of inserted and deleted amino acids differ from background amino acid frequencies in the human proteome. Small amino acids are overrepresented, while hydrophobic, aliphatic and aromatic amino acids are strongly suppressed. Indels are found to be preferentially located in protein regions that do not form important structural domains. Amino acid insertion and deletion rates in genes associated with elementary biochemical reactions (e. g. catalytic activity, ligase activity, electron transport, or catabolic process) are lower compared to those in other genes and are therefore subject to stronger purifying selection. Conclusion: Our analysis indicates that indels in human protein coding regions are subject to distinct levels of selective pressure with regard to their structural impact on the amino acid sequence, as well as to general properties of the genes they are located in. These findings confirm that many commonly accepted characteristics of selective constraints for substitutions are also valid for amino acid insertions and deletions.
引用
收藏
页数:13
相关论文
共 30 条
  • [1] Comparative analysis of amino acid repeats in rodents and humans
    Albà, MM
    Guigó, R
    [J]. GENOME RESEARCH, 2004, 14 (04) : 549 - 554
  • [2] Gene Ontology: tool for the unification of biology
    Ashburner, M
    Ball, CA
    Blake, JA
    Botstein, D
    Butler, H
    Cherry, JM
    Davis, AP
    Dolinski, K
    Dwight, SS
    Eppig, JT
    Harris, MA
    Hill, DP
    Issel-Tarver, L
    Kasarskis, A
    Lewis, S
    Matese, JC
    Richardson, JE
    Ringwald, M
    Rubin, GM
    Sherlock, G
    [J]. NATURE GENETICS, 2000, 25 (01) : 25 - 29
  • [3] Barnes M.R., 2003, BIOINFORMATICS GENET
  • [4] The Protein Data Bank
    Berman, HM
    Westbrook, J
    Feng, Z
    Gilliland, G
    Bhat, TN
    Weissig, H
    Shindyalov, IN
    Bourne, PE
    [J]. NUCLEIC ACIDS RESEARCH, 2000, 28 (01) : 235 - 242
  • [5] Majority of divergence between closely related DNA samples is due to indels
    Britten, RJ
    Rowen, L
    Williams, J
    Cameron, RA
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2003, 100 (08) : 4661 - 4665
  • [6] LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA
    Brudno, M
    Do, CB
    Cooper, GM
    Kim, MF
    Davydov, E
    Green, ED
    Sidow, A
    Batzoglou, S
    [J]. GENOME RESEARCH, 2003, 13 (04) : 721 - 731
  • [7] Natural selection on protein-coding genes in the human genome
    Bustamante, CD
    Fledel-Alon, A
    Williamson, S
    Nielsen, R
    Hubisz, MT
    Glanowski, S
    Tanenbaum, DM
    White, TJ
    Sninsky, JJ
    Hernandez, RD
    Civello, D
    Adams, MD
    Cargill, M
    Clark, AG
    [J]. NATURE, 2005, 437 (7062) : 1153 - 1157
  • [8] Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees
    Chen, FC
    Li, WH
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2001, 68 (02) : 444 - 456
  • [9] Human-specific insertions and deletions inferred from mammalian genome sequences
    Chen, Feng-Chi
    Chen, Chueng-Jong
    Li, Wen-Hsiung
    Chuang, Trees-Juen
    [J]. GENOME RESEARCH, 2007, 17 (01) : 16 - 22
  • [10] Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios
    Clark, AG
    Glanowski, S
    Nielsen, R
    Thomas, PD
    Kejariwal, A
    Todd, MA
    Tanenbaum, DM
    Civello, D
    Lu, F
    Murphy, B
    Ferriera, S
    Wang, G
    Zheng, XG
    White, TJ
    Sninsky, JJ
    Adams, MD
    Cargill, M
    [J]. SCIENCE, 2003, 302 (5652) : 1960 - 1963