Genomic repertoires of DNA-binding transcription factors across the tree of life

被引:106
作者
Charoensawan, Varodom [1 ]
Wilson, Derek [1 ]
Teichmann, Sarah A. [1 ]
机构
[1] MRC Lab Mol Biol, Cambridge CB2 0QH, England
基金
英国医学研究理事会;
关键词
PROTEIN DOMAIN DISCOVERY; GENE-REGULATION; DATABASE; EVOLUTION; WIDE; EXPRESSION; SEQUENCES; BACTERIA; FAMILIES; REVEALS;
D O I
10.1093/nar/gkq617
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Sequence-specific transcription factors (TFs) are important to genetic regulation in all organisms because they recognize and directly bind to regulatory regions on DNA. Here, we survey and summarize the TF resources available. We outline the organisms for which TF annotation is provided, and discuss the criteria and methods used to annotate TFs by different databases. By using genomic TF repertoires from similar to 700 genomes across the tree of life, covering Bacteria, Archaea and Eukaryota, we review TF abundance with respect to the number of genes, as well as their structural complexity in diverse lineages. While typical eukaryotic TFs are longer than the average eukaryotic proteins, the inverse is true for prokaryotes. Only in eukaryotes does the same family of DNA-binding domain (DBD) occur multiple times within one polypeptide chain. This potentially increases the length and diversity of DNA-recognition sequence by reusing DBDs from the same family. We examined the increase in TF abundance with the number of genes in genomes, using the largest set of prokaryotic and eukaryotic genomes to date. As pointed out before, prokaryotic TFs increase faster than linearly. We further observe a similar relationship in eukaryotic genomes with a slower increase in TFs.
引用
收藏
页码:7364 / 7377
页数:14
相关论文
共 103 条
  • [41] Comparative genomics of transcription factors and chromatin proteins in parasitic protists and other eukaryotes[J]. Iyer, Lakshminarayan M.;Anantharaman, Vivek;Wolf, Maxim Y.;Aravind, L. INTERNATIONAL JOURNAL FOR PARASITOLOGY, 2008(01)
  • [42] GENETIC REGULATORY MECHANISMS IN SYNTHESIS OF PROTEINS[J]. JACOB, F;MONOD, J. JOURNAL OF MOLECULAR BIOLOGY, 1961(03)
  • [43] Infrastructure for the life sciences: design and implementation of the UniProt website[J]. Jain, Eric;Bairoch, Amos;Duvaud, Severine;Phan, Isabelle;Redaschi, Nicole;Suzek, Baris E.;Martin, Maria J.;McGarvey, Peter;Gasteiger, Elisabeth. BMC BIOINFORMATICS, 2009
  • [44] Drug discovery with engineered zinc-finger proteins[J]. Jamieson, AC;Miller, JC;Pabo, CO. NATURE REVIEWS DRUG DISCOVERY, 2003(05)
  • [45] A genome-wide and nonredundant mouse transcription factor database[J]. Kanamori, M;Konno, H;Osato, N;Kawai, J;Hayashizaki, Y;Suzuki, H. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2004(03)
  • [46] RegTransBase - a database of regulatory sequences and interactions in a wide range of prokaryotic genomes[J]. Kazakov, Alexei E.;Cipriano, Michael J.;Novichkov, Pavel S.;Minovitsky, Simon;Vinogradov, Dmitry V.;Arkin, Adam;Mironov, Andrey A.;Gelfand, Mikhail S.;Dubchak, Inna. NUCLEIC ACIDS RESEARCH, 2007
  • [47] MATCH™:: a tool for searching transcription factor binding sites in DNA sequences[J]. Kel, AE;Gössling, E;Reuter, I;Cheremushkin, E;Kel-Margoulis, OV;Wingender, E. NUCLEIC ACIDS RESEARCH, 2003(13)
  • [48] Genomics of bacteria and archaea: the emerging dynamic view of the prokaryotic world[J]. Koonin, Eugene V.;Wolf, Yuri I. NUCLEIC ACIDS RESEARCH, 2008(21)
  • [49] DBD: a transcription factor prediction database[J]. Kummerfeld, Sarah K.;Teichmann, Sarah A. NUCLEIC ACIDS RESEARCH, 2006
  • [50] Transcription regulation and animal diversity[J]. Levine, M;Tjian, R. NATURE, 2003(6945)