Fully interpretable deep learning model of transcriptional control

被引:19
作者
Liu, Yi [1 ]
Barr, Kenneth [2 ]
Reinitz, John [1 ,3 ,4 ,5 ]
机构
[1] Univ Chicago, Inst Genom & Syst Biol, Dept Stat, Chicago, IL 60637 USA
[2] Univ Chicago, Inst Genom & Syst Biol, Dept Human Genet, Chicago, IL 60637 USA
[3] Univ Chicago, Inst Genom & Syst Biol, Dept Ecol & Evolut, Chicago, IL 60637 USA
[4] Univ Chicago, Inst Genom & Syst Biol, Dept Mol Genet, Chicago, IL 60637 USA
[5] Univ Chicago, Inst Genom & Syst Biol, Dept Cell Biol, Chicago, IL 60637 USA
基金
美国国家卫生研究院;
关键词
COOPERATIVE DNA-BINDING; DROSOPHILA; EXPRESSION; ENHANCERS; STRIPE; SEGMENTATION; REPRESSION; MECHANISM; NETWORKS; SEQUENCE;
D O I
10.1093/bioinformatics/btaa506
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: The universal expressibility assumption of Deep Neural Networks (DNNs) is the key motivation behind recent worksin the systems biology community to employDNNs to solve important problems in functional genomics and moleculargenetics. Typically, such investigations have taken a `black box' approach in which the internal structure of themodel used is set purely by machine learning considerations with little consideration of representing the internalstructure of the biological system by the mathematical structure of the DNN. DNNs have not yet been applied to thedetailed modeling of transcriptional control in which mRNA production is controlled by the binding of specific transcriptionfactors to DNA, in part because such models are in part formulated in terms of specific chemical equationsthat appear different in form from those used in neural networks. Results: In this paper, we give an example of a DNN whichcan model the detailed control of transcription in a precise and predictive manner. Its internal structure is fully interpretableand is faithful to underlying chemistry of transcription factor binding to DNA. We derive our DNN from asystems biology model that was not previously recognized as having a DNN structure. Although we apply our DNNto data from the early embryo of the fruit fly Drosophila, this system serves as a test bed for analysis of much larger datasets obtained by systems biology studies on a genomic scale.
引用
收藏
页码:499 / 507
页数:9
相关论文
共 64 条
[41]   Deciphering regulatory DNA sequences and noncoding genetic variants using neural network models of massively parallel reporter assays [J].
Movva, Rajiv ;
Greenside, Peyton ;
Marinov, Georgi K. ;
Nair, Surag ;
Shrikumar, Avanti ;
Kundaje, Anshul .
PLOS ONE, 2019, 14 (06)
[42]  
Nair S., 2019, BIORXIV
[43]   A systematic characterization of factors that regulate Drosophila segmentation via a bacterial one-hybrid system [J].
Noyes, Marcus B. ;
Meng, Xiangdong ;
Wakabayashi, Atsuya ;
Sinha, Saurabh ;
Brodsky, Michael H. ;
Wolfe, Scot A. .
NUCLEIC ACIDS RESEARCH, 2008, 36 (08) :2547-2560
[44]  
Ogawa N, 2012, METHODS MOL BIOL, V786, P51, DOI 10.1007/978-1-61779-292-2_3
[45]   High-resolution analysis of DNA regulatory elements by synthetic saturation mutagenesis [J].
Patwardhan, Rupali P. ;
Lee, Choli ;
Litvin, Oren ;
Young, David L. ;
Pe'er, Dana ;
Shendure, Jay .
NATURE BIOTECHNOLOGY, 2009, 27 (12) :1173-1175
[46]   Recurrent Neural Networks for Sequential Phenotype Prediction in Genomics [J].
Pouladi, Farhad ;
Salehinejad, Hojjat ;
Gilani, Amir Mohammad .
PROCEEDINGS 2015 INTERNATIONAL CONFERENCE ON DEVELOPMENTS IN ESYSTEMS ENGINEERING DESE 2015, 2015, :225-230
[47]   MECHANISM OF EVE STRIPE FORMATION [J].
REINITZ, J ;
SHARP, DH .
MECHANISMS OF DEVELOPMENT, 1995, 49 (1-2) :133-158
[48]  
Reinitz John, 2003, ComPlexUs, V1, P54, DOI 10.1159/000070462
[49]   The regulatory control of Cebpa enhancers and silencers in the myeloid and red-blood cell lineages [J].
Repele, Andrea ;
Krueger, Shawn ;
Bhattacharyya, Tapas ;
Tuineau, Michelle Y. ;
Manu .
PLOS ONE, 2019, 14 (06)
[50]   High-throughput SELEX-SAGE method for quantitative modeling of transcription-factor binding sites [J].
Roulet, E ;
Busso, S ;
Camargo, AA ;
Simpson, AJG ;
Mermod, N ;
Bucher, P .
NATURE BIOTECHNOLOGY, 2002, 20 (08) :831-835