Hold out the genome: a roadmap to solving the cis-regulatory code

被引:25
作者
de Boer, Carl G. [1 ]
Taipale, Jussi [2 ,3 ,4 ]
机构
[1] Univ British Columbia, Sch Biomed Engn, Vancouver, BC, Canada
[2] Univ Helsinki, Fac Med, Appl Tumor Genom Res Program, Helsinki, Finland
[3] Karolinska Inst, Dept Med Biochem & Biophys, Stockholm, Sweden
[4] Univ Cambridge, Dept Biochem, Cambridge, England
关键词
ENHANCER ACTIVITY MAPS; TRANSCRIPTION FACTORS; SHADOW ENHANCERS; GENE; SEQUENCE; BINDING; EVOLUTION; EXPRESSION; ELEMENTS; MODEL;
D O I
10.1038/s41586-023-06661-w
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Gene expression is regulated by transcription factors that work together to read cis-regulatory DNA sequences. The 'cis-regulatory code' - how cells interpret DNA sequences to determine when, where and how much genes should be expressed - has proven to be exceedingly complex. Recently, advances in the scale and resolution of functional genomics assays and machine learning have enabled substantial progress towards deciphering this code. However, the cis-regulatory code will probably never be solved if models are trained only on genomic sequences; regions of homology can easily lead to overestimation of predictive performance, and our genome is too short and has insufficient sequence diversity to learn all relevant parameters. Fortunately, randomly synthesized DNA sequences enable testing a far larger sequence space than exists in our genomes, and designed DNA sequences enable targeted queries to maximally improve the models. As the same biochemical principles are used to interpret DNA regardless of its source, models trained on these synthetic data can predict genomic activity, often better than genome-trained models. Here we provide an outlook on the field, and propose a roadmap towards solving the cis-regulatory code by a combination of machine learning and massively parallel assays using synthetic DNA.
引用
收藏
页码:41 / 50
页数:10
相关论文
共 50 条
  • [41] Deciphering cis-regulatory control in inflammatory cells
    Ghisletti, Serena
    Natoli, Gioacchino
    PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2013, 368 (1620)
  • [42] Genome-wide prediction of cis-regulatory regions using supervised deep learning methods
    Li, Yifeng
    Shi, Wenqiang
    Wasserman, Wyeth W.
    BMC BIOINFORMATICS, 2018, 19
  • [43] In silico evolution of the hunchback gene indicates redundancy in cis-regulatory organization and spatial gene expression
    Zagrijchuk, Elizaveta A.
    Sabirov, Marat A.
    Holloway, David M.
    Spirov, Alexander V.
    JOURNAL OF BIOINFORMATICS AND COMPUTATIONAL BIOLOGY, 2014, 12 (02)
  • [44] Exonic remnants of whole-genome duplication reveal cis-regulatory function of coding exons
    Dong, Xianjun
    Navratilova, Pavla
    Fredman, David
    Drivenes, Oyvind
    Becker, Thomas S.
    Lenhard, Boris
    NUCLEIC ACIDS RESEARCH, 2010, 38 (04) : 1071 - 1085
  • [45] In Planta Analysis of a cis-Regulatory Cytokinin Response Motif in Arabidopsis and Identification of a Novel Enhancer Sequence
    Ramireddy, Eswarayya
    Brenner, Wolfram G.
    Pfeifer, Andreas
    Heyl, Alexander
    Schmuelling, Thomas
    PLANT AND CELL PHYSIOLOGY, 2013, 54 (07) : 1079 - 1092
  • [46] A dual cis-regulatory code links IRF8 to constitutive and inducible gene expression in macrophages
    Mancino, Alessandra
    Termanini, Alberto
    Barozzi, Iros
    Ghisletti, Serena
    Ostuni, Renato
    Prosperini, Elena
    Ozato, Keiko
    Natoli, Gioacchino
    GENES & DEVELOPMENT, 2015, 29 (04) : 394 - 408
  • [47] Cis-regulatory control of mammalian Trps1 gene expression
    Abrar, Muhammad
    Ali, Shahid
    Hussain, Irfan
    Khatoon, Hizran
    Batool, Fatima
    Ghazanfar, Shakira
    Corcoran, Dylan
    Kawakami, Yasuhiko
    Abbasi, Amir Ali
    JOURNAL OF EXPERIMENTAL ZOOLOGY PART B-MOLECULAR AND DEVELOPMENTAL EVOLUTION, 2024, 342 (02) : 85 - 100
  • [48] Fine-mapping cis-regulatory variants in diverse human populations
    Tehranchi, Ashley
    Hie, Brian
    Dacre, Michael
    Kaplow, Irene
    Pettie, Kade
    Combs, Peter
    Fraser, Hunter B.
    ELIFE, 2019, 8
  • [49] Correlating Gene Expression Variation with cis-Regulatory Polymorphism in Saccharomyces cerevisiae
    Chen, Kevin
    van Nimwegen, Erik
    Rajewsky, Nikolaus
    Siegal, Mark L.
    GENOME BIOLOGY AND EVOLUTION, 2010, 2 : 697 - 707
  • [50] Patterns of variation in cis-regulatory regions: examining evidence of purifying selection
    Naidoo, Thijessen
    Sjodin, Per
    Schlebusch, Carina
    Jakobsson, Mattias
    BMC GENOMICS, 2018, 19