Evolutionary comparisons suggest many novel cAMP response protein binding sites in Escherichia coli

被引:36
作者
Brown, CT
Callan, CG [1 ]
机构
[1] CALTECH, Div Biol, Pasadena, CA 91125 USA
[2] Princeton Univ, Joseph Henry Labs, Princeton, NJ 08540 USA
关键词
D O I
10.1073/pnas.0308628100
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The cAMP response protein (CRP) is a transcription factor known to regulate many genes in Escherichia coli. Computational studies of transcription factor binding to DNA are usually based on a simple matrix model of sequence-dependent binding energy. For CRP, this model predicts many binding sites that are not known to be functional. If they are indeed spurious, the underlying binding model is called into question. We use a species comparison method to assess the functionality of a population of such predicted CRP sites in E. coli. We compare them with orthologous sites in Salmonella typhimurium identified independently by CLUSTALW alignment, and find a dependence of mutation probability on position in the site. This dependence increases with predicted site binding energy. The positions where mutation is most strongly suppressed are those where mutation would have the biggest effect on predicted binding energy. This finding suggests that many of the novel sites are functional, that the matrix model correctly estimates their binding strength, and that calculated CRP binding strength is the quantity that is conserved between species. The analysis also identifies many new E. coli binding sites and genes likely to be functional for CRP.
引用
收藏
页码:2404 / 2409
页数:6
相关论文
共 20 条