Visualizing complex feature interactions and feature sharing in genomic deep neural networks

被引:17
作者
Liu, Ge [1 ]
Zeng, Haoyang [1 ]
Gifford, David K. [1 ]
机构
[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA
关键词
Visualization; Deep neural networks; Combinatorial interactions; DNA; TRANSCRIPTION; PROTEINS; SUZ12;
D O I
10.1186/s12859-019-2957-4
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: Visualization tools for deep learning models typically focus on discovering key input features without considering how such low level features are combined in intermediate layers to make decisions. Moreover, many of these methods examine a network's response to specific input examples that may be insufficient to reveal the complexity of model decision making. Results: We present DeepResolve, an analysis framework for deep convolutional models of genome function that visualizes how input features contribute individually and combinatorially to network decisions. Unlike other methods, DeepResolve does not depend upon the analysis of a predefined set of inputs. Rather, it uses gradient ascent to stochastically explore intermediate feature maps to 1) discover important features, 2) visualize their contribution and interaction patterns, and 3) analyze feature sharing across tasks that suggests shared biological mechanism. We demonstrate the visualization of decision making using our proposed method on deep neural networks trained on both experimental and synthetic data. DeepResolve is competitive with existing visualization tools in discovering key sequence features, and identifies certain negative features and non-additive feature interactions that are not easily observed with existing tools. It also recovers similarities between poorly correlated classes which are not observed by traditional methods. DeepResolve reveals that DeepSEA's learned decision structure is shared across genome annotations including histone marks, DNase hypersensitivity, and transcription factor binding. We identify groups of TFs that suggest known shared biological mechanism, and recover correlation between DNA hypersensitivities and TF/Chromatin marks. Conclusions: DeepResolve is capable of visualizing complex feature contribution patterns and feature interactions that contribute to decision making in genomic deep convolutional networks. It also recovers feature sharing and class similarities which suggest interesting biological mechanisms. DeepResolve is compatible with existing visualization tools and provides complementary insights.
引用
收藏
页数:14
相关论文
共 38 条
  • [11] Discovering epistatic feature interactions from neural network models of regulatory DNA sequences
    Greenside, Peyton
    Shimko, Tyler
    Fordyce, Polly
    Kundaje, Anshul
    [J]. BIOINFORMATICS, 2018, 34 (17) : 629 - 637
  • [12] KRAB-Zinc Finger Proteins and KAP1 Can Mediate Long-Range Transcriptional Repression through Heterochromatin Spreading
    Groner, Anna C.
    Meylan, Sylvain
    Ciuffi, Angela
    Zangger, Nadine
    Ambrosini, Giovanna
    Denervaud, Nicolas
    Bucher, Philipp
    Trono, Didier
    [J]. PLOS GENETICS, 2010, 6 (03):
  • [13] Quantifying similarity between motifs
    Gupta, Shobhit
    Stamatoyannopoulos, John A.
    Bailey, Timothy L.
    Noble, William Stafford
    [J]. GENOME BIOLOGY, 2007, 8 (02)
  • [14] Sequential regulatory activity prediction across chromosomes with convolutional neural networks
    Kelley, David R.
    Reshef, Yakir A.
    Bileschi, Maxwell
    Belanger, David
    McLean, Cory Y.
    Snoek, Jasper
    [J]. GENOME RESEARCH, 2018, 28 (05) : 739 - 750
  • [15] Basset: learning the regulatory code of the accessible genome with deep convolutional neural networks
    Kelley, David R.
    Snoek, Jasper
    Rinn, John L.
    [J]. GENOME RESEARCH, 2016, 26 (07) : 990 - 999
  • [16] Krizhevsky A., 2017, COMMUN ACM, V60, P84, DOI DOI 10.1145/3065386
  • [17] Lanchantin J, 2017, BIOCOMPUT-PAC SYM, P254, DOI 10.1142/9789813207813_0025
  • [18] Lundberg SM, 2017, ADV NEUR IN, V30
  • [19] KRAB-Zinc Finger Proteins: A Repressor Family Displaying Multiple Biological Functions
    Lupo, Angelo
    Cesaro, Elena
    Montano, Giorgia
    Zurlo, Diana
    Izzo, Paola
    Costanzo, Paola
    [J]. CURRENT GENOMICS, 2013, 14 (04) : 268 - 278
  • [20] Suz12 is essential for mouse development and for EZH2 histone methyltransferase activity
    Pasini, D
    Bracken, AP
    Jensen, MR
    Denchi, EL
    Helin, K
    [J]. EMBO JOURNAL, 2004, 23 (20) : 4061 - 4071