Visualizing complex feature interactions and feature sharing in genomic deep neural networks

被引：17

作者：

Liu, Ge ^{[1
]}

Zeng, Haoyang ^{[1
]}

Gifford, David K. ^{[1
]}

机构：

[1] MIT, Comp Sci & Artificial Intelligence Lab, 77 Massachusetts Ave, Cambridge, MA 02139 USA

来源：

BMC BIOINFORMATICS | 2019年 / 20卷 / 1期

关键词：

Visualization; Deep neural networks; Combinatorial interactions; DNA; TRANSCRIPTION; PROTEINS; SUZ12;

D O I：

10.1186/s12859-019-2957-4

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Background: Visualization tools for deep learning models typically focus on discovering key input features without considering how such low level features are combined in intermediate layers to make decisions. Moreover, many of these methods examine a network's response to specific input examples that may be insufficient to reveal the complexity of model decision making. Results: We present DeepResolve, an analysis framework for deep convolutional models of genome function that visualizes how input features contribute individually and combinatorially to network decisions. Unlike other methods, DeepResolve does not depend upon the analysis of a predefined set of inputs. Rather, it uses gradient ascent to stochastically explore intermediate feature maps to 1) discover important features, 2) visualize their contribution and interaction patterns, and 3) analyze feature sharing across tasks that suggests shared biological mechanism. We demonstrate the visualization of decision making using our proposed method on deep neural networks trained on both experimental and synthetic data. DeepResolve is competitive with existing visualization tools in discovering key sequence features, and identifies certain negative features and non-additive feature interactions that are not easily observed with existing tools. It also recovers similarities between poorly correlated classes which are not observed by traditional methods. DeepResolve reveals that DeepSEA's learned decision structure is shared across genome annotations including histone marks, DNase hypersensitivity, and transcription factor binding. We identify groups of TFs that suggest known shared biological mechanism, and recover correlation between DNA hypersensitivities and TF/Chromatin marks. Conclusions: DeepResolve is capable of visualizing complex feature contribution patterns and feature interactions that contribute to decision making in genomic deep convolutional networks. It also recovers feature sharing and class similarities which suggest interesting biological mechanisms. DeepResolve is compatible with existing visualization tools and provides complementary insights.

引用

页数：14

共 38 条

[1] Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning
Alipanahi, Babak
Delong, Andrew
Weirauch, Matthew T.
Frey, Brendan J.
[J]. NATURE BIOTECHNOLOGY, 2015, 33 (08) : 831 - +
[2] DeepCpG: accurate prediction of single-cell DNA methylation states using deep learning
Angermueller, Christof
Lee, Heather J.
Reik, Wolf
Stegle, Oliver
[J]. GENOME BIOLOGY, 2017, 18
[3] Nguyen A, 2015, PROC CVPR IEEE, P427, DOI 10.1109/CVPR.2015.7298640
[4] [Anonymous], 2014, 2 INT C LEARN REPR I
[5] [Anonymous], ROLES COHESINS MITOS
[6] On Pixel-Wise Explanations for Non-Linear Classifier Decisions by Layer-Wise Relevance Propagation
Bach, Sebastian
Binder, Alexander
Montavon, Gregoire
Klauschen, Frederick
Mueller, Klaus-Robert
Samek, Wojciech
[J]. PLOS ONE, 2015, 10 (07):
[7] Bahdanau Dzmitry, 2015, CORR
[8] SUZ12 is required for both the histone methyltransferase activity and the silencing function of the EED-EZH2 complex
Cao, R
Zhang, Y
[J]. MOLECULAR CELL, 2004, 15 (01) : 57 - 67
[9] Can we open the black box of AI?
Castelvecchi D.
[J]. Nature, 2016, 538 (7623) : 20 - 23
[10] Maximum entropy methods for extracting the learned features of deep neural networks
Finnegan, Alex
Song, Jun S.
[J]. PLOS COMPUTATIONAL BIOLOGY, 2017, 13 (10)

← 1 2 3 4 →