Hold out the genome: a roadmap to solving the cis-regulatory code

被引:25
|
作者
de Boer, Carl G. [1 ]
Taipale, Jussi [2 ,3 ,4 ]
机构
[1] Univ British Columbia, Sch Biomed Engn, Vancouver, BC, Canada
[2] Univ Helsinki, Fac Med, Appl Tumor Genom Res Program, Helsinki, Finland
[3] Karolinska Inst, Dept Med Biochem & Biophys, Stockholm, Sweden
[4] Univ Cambridge, Dept Biochem, Cambridge, England
关键词
ENHANCER ACTIVITY MAPS; TRANSCRIPTION FACTORS; SHADOW ENHANCERS; GENE; SEQUENCE; BINDING; EVOLUTION; EXPRESSION; ELEMENTS; MODEL;
D O I
10.1038/s41586-023-06661-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression is regulated by transcription factors that work together to read cis-regulatory DNA sequences. The 'cis-regulatory code' - how cells interpret DNA sequences to determine when, where and how much genes should be expressed - has proven to be exceedingly complex. Recently, advances in the scale and resolution of functional genomics assays and machine learning have enabled substantial progress towards deciphering this code. However, the cis-regulatory code will probably never be solved if models are trained only on genomic sequences; regions of homology can easily lead to overestimation of predictive performance, and our genome is too short and has insufficient sequence diversity to learn all relevant parameters. Fortunately, randomly synthesized DNA sequences enable testing a far larger sequence space than exists in our genomes, and designed DNA sequences enable targeted queries to maximally improve the models. As the same biochemical principles are used to interpret DNA regardless of its source, models trained on these synthetic data can predict genomic activity, often better than genome-trained models. Here we provide an outlook on the field, and propose a roadmap towards solving the cis-regulatory code by a combination of machine learning and massively parallel assays using synthetic DNA.
引用
收藏
页码:41 / 50
页数:10
相关论文
共 50 条
  • [21] The dynamic, combinatorial cis-regulatory lexicon of epidermal differentiation
    Kim, Daniel S.
    Risca, Viviana I.
    Reynolds, David L.
    Chappell, James
    Rubin, Adam J.
    Jung, Namyoung
    Donohue, Laura K. H.
    Lopez-Pajares, Vanessa
    Kathiria, Arwa
    Shi, Minyi
    Zhao, Zhixin
    Deep, Harsh
    Sharmin, Mahfuza
    Rao, Deepti
    Lin, Shin
    Chang, Howard Y.
    Snyder, Michael P.
    Greenleaf, William J.
    Kundaje, Anshul
    Khavari, Paul A.
    NATURE GENETICS, 2021, 53 (11) : 1564 - +
  • [22] Cis-regulatory code for determining the action of Foxd as both an activator and a repressor in ascidian embryos
    Tokuhiro, Shinichi
    Satou, Yutaka
    DEVELOPMENTAL BIOLOGY, 2021, 476 : 11 - 17
  • [23] Seven myths of how transcription factors read the cis-regulatory code
    Zeitlinger, Julia
    CURRENT OPINION IN SYSTEMS BIOLOGY, 2020, 23 : 22 - 31
  • [24] cis-Regulatory Complexity within a Large Non-Coding Region in the Drosophila Genome
    Kundu, Mukta
    Kuzin, Alexander
    Lin, Tzu-Yang
    Lee, Chi-Hon
    Brody, Thomas
    Odenwald, Ward F.
    PLOS ONE, 2013, 8 (04):
  • [25] Predicting tissue specific cis-regulatory modules in the human genome using pairs of co-occurring motifs
    Girgis, Hani Z.
    Ovcharenko, Ivan
    BMC BIOINFORMATICS, 2012, 13
  • [26] A cis-regulatory module activating transcription in the suspensor contains five cis-regulatory elements
    Henry, Kelli F.
    Kawashima, Tomokazu
    Goldberg, Robert B.
    PLANT MOLECULAR BIOLOGY, 2015, 88 (03) : 207 - 217
  • [27] From sequence to consequence: Deciphering the complex cis-regulatory landscape
    Dsilva, Greg Jude
    Galande, Sanjeev
    JOURNAL OF BIOSCIENCES, 2024, 49 (02)
  • [28] In situ functional dissection of RNA cis-regulatory elements by multiplex CRISPR-Cas9 genome engineering
    Wu, Qianxin
    Ferry, Quentin R. V.
    Baeumler, Toni A.
    Michaels, Yale S.
    Vitsios, Dimitrios M.
    Habib, Omer
    Arnold, Roland
    Jiang, Xiaowei
    Maio, Stefano
    Steinkraus, Bruno R.
    Tapia, Marta
    Piazza, Paolo
    Xu, Ni
    Hollander, Georg A.
    Milne, Thomas A.
    Kim, Jin-Soo
    Enright, Anton J.
    Bassett, Andrew R.
    Fulga, Tudor A.
    NATURE COMMUNICATIONS, 2017, 8
  • [29] Spatially varying cis-regulatory divergence in Drosophila embryos elucidates cis-regulatory logic
    Combs, Peter A.
    Fraser, Hunter B.
    PLOS GENETICS, 2018, 14 (11):
  • [30] A universal framework for detecting cis-regulatory diversity in DNA regions
    Biswas, Anushua
    Narlikar, Leelavati
    GENOME RESEARCH, 2021, 31 (09) : 1646 - 1662