共 50 条
Transcription factor specificity limits the number of DNA-binding motifs
被引:3
|作者:
Aptekmann, Ariel A.
[1
,2
]
Bulavka, Denys
[1
,3
]
Nadra, Alejandro D.
[4
]
Sanchez, Ignacio E.
[1
]
机构:
[1] Univ Buenos Aires, Fac Ciencias Exactas & Natur, Lab Fisiol Proteinas, Consejo Nacl lnvest Cient & Tecn,Inst Quim Biol F, Buenos Aires, DF, Argentina
[2] Rutgers State Univ, Marine & Coastal Sci Dept, New Brunswick, NJ USA
[3] Univ Buenos Aires, Fac Ciencias Exactas & Nat, Dept Matemat, Buenos Aires, DF, Argentina
[4] Univ Buenos Aires, Fac Ciencias Exactas & Natur, Dept Fisiol Biol Mol & Celular, IB3, Buenos Aires, DF, Argentina
来源:
PLOS ONE
|
2022年
/
17卷
/
01期
关键词:
GENOME-WIDE ANNOTATIONS;
SEQUENCE LOGOS;
BASE;
SWISSREGULON;
DATABASE;
SITES;
D O I:
10.1371/journal.pone.0263307
中图分类号:
O [数理科学和化学];
P [天文学、地球科学];
Q [生物科学];
N [自然科学总论];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
We study the limits imposed by transcription factor specificity on the maximum number of binding motifs that can coexist in a gene regulatory network, using the SwissRegulon Fantom5 collection of 684 human transcription factor binding sites as a model. We describe transcription factor specificity using regular expressions and find that most human transcription factor binding site motifs are separated in sequence space by one to three motif-discriminating positions. We apply theorems based on the pigeonhole principle to calculate the maximum number of transcription factors that can coexist given this degree of specificity, which is in the order of ten thousand and would fully utilize the space of DNA subsequences. Taking into account an expanded DNA alphabet with modified bases can further raise this limit by several orders of magnitude, at a lower level of sequence space usage. Our results may guide the design of transcription factors at both the molecular and system scale.
引用
收藏
页数:13
相关论文