Fully interpretable deep learning model of transcriptional control

被引:19
作者
Liu, Yi [1 ]
Barr, Kenneth [2 ]
Reinitz, John [1 ,3 ,4 ,5 ]
机构
[1] Univ Chicago, Inst Genom & Syst Biol, Dept Stat, Chicago, IL 60637 USA
[2] Univ Chicago, Inst Genom & Syst Biol, Dept Human Genet, Chicago, IL 60637 USA
[3] Univ Chicago, Inst Genom & Syst Biol, Dept Ecol & Evolut, Chicago, IL 60637 USA
[4] Univ Chicago, Inst Genom & Syst Biol, Dept Mol Genet, Chicago, IL 60637 USA
[5] Univ Chicago, Inst Genom & Syst Biol, Dept Cell Biol, Chicago, IL 60637 USA
基金
美国国家卫生研究院;
关键词
COOPERATIVE DNA-BINDING; DROSOPHILA; EXPRESSION; ENHANCERS; STRIPE; SEGMENTATION; REPRESSION; MECHANISM; NETWORKS; SEQUENCE;
D O I
10.1093/bioinformatics/btaa506
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The universal expressibility assumption of Deep Neural Networks (DNNs) is the key motivation behind recent worksin the systems biology community to employDNNs to solve important problems in functional genomics and moleculargenetics. Typically, such investigations have taken a `black box' approach in which the internal structure of themodel used is set purely by machine learning considerations with little consideration of representing the internalstructure of the biological system by the mathematical structure of the DNN. DNNs have not yet been applied to thedetailed modeling of transcriptional control in which mRNA production is controlled by the binding of specific transcriptionfactors to DNA, in part because such models are in part formulated in terms of specific chemical equationsthat appear different in form from those used in neural networks. Results: In this paper, we give an example of a DNN whichcan model the detailed control of transcription in a precise and predictive manner. Its internal structure is fully interpretableand is faithful to underlying chemistry of transcription factor binding to DNA. We derive our DNN from asystems biology model that was not previously recognized as having a DNN structure. Although we apply our DNNto data from the early embryo of the fruit fly Drosophila, this system serves as a test bed for analysis of much larger datasets obtained by systems biology studies on a genomic scale.
引用
收藏
页码:499 / 507
页数:9
相关论文
共 64 条
[1]  
Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
[2]   Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning [J].
Alipanahi, Babak ;
Delong, Andrew ;
Weirauch, Matthew T. ;
Frey, Brendan J. .
NATURE BIOTECHNOLOGY, 2015, 33 (08) :831-+
[3]  
[Anonymous], 2009, Probabilistic Graphical Models: Principles and Techniques
[4]  
[Anonymous], INT C LEARNING REPRE
[5]   Genome-Wide Quantitative Enhancer Activity Maps Identified by STARR-seq [J].
Arnold, Cosmas D. ;
Gerlach, Daniel ;
Stelzer, Christoph ;
Boryn, Lukasz M. ;
Rath, Martina ;
Stark, Alexander .
SCIENCE, 2013, 339 (6123) :1074-1077
[6]   A sequence level model of an intact locus predicts the location and function of nonadditive enhancers [J].
Barr, Kenneth A. ;
Reinitz, John .
PLOS ONE, 2017, 12 (07)
[7]   Synthetic enhancer design by in silico compensatory evolution reveals flexibility and constraint in cis-regulation [J].
Barr, Kenneth A. ;
Martinez, Carlos ;
Moran, Jennifer R. ;
Kim, Ah-Ram ;
Ramos, Alexandre F. ;
Reinitz, John .
BMC SYSTEMS BIOLOGY, 2017, 11
[8]   The analysis of novel distal Cebpa enhancers and silencers using a transcriptional model reveals the complex regulatory logic of hematopoietic lineage specification [J].
Bertolino, Eric ;
Reinitz, John ;
Manu .
DEVELOPMENTAL BIOLOGY, 2016, 413 (01) :128-144
[9]  
Boger Z, 1997, IEEE SYS MAN CYBERN, P3030
[10]   Cooperative DNA-binding by Bicoid provides a mechanism for threshold-dependent gene activation in the Drosophila embryo [J].
Burz, DS ;
Rivera-Pomar, R ;
Jäckle, H ;
Hanes, SD .
EMBO JOURNAL, 1998, 17 (20) :5998-6009