IMPROVING THE INTERPRETABILITY OF DEEP NEURAL NETWORKS WITH STIMULATED LEARNING

被引：0

作者：

Tan, Shawn ^{[1
]}

Sim, Khe Chai ^{[1
]}

Gales, Mark ^{[2
]}

机构：

[1] Natl Univ Singapore, Singapore 117548, Singapore

[2] Univ Cambridge, Cambridge CB2 1TN, England

来源：

2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) | 2015年

关键词：

Deep Neural Networks;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Deep Neural Networks (DNNs) have demonstrated improvements in acoustic modelling for automatic speech recognition. However, they are often used as a black box, and not much is understood about what each of the hidden layers does. We seek to understand how the activations in the hidden layers change with different input, and how we can leverage such knowledge to modify the behaviour of the model. To this end, we propose stimulated deep learning where stimuli are introduced during the DNN training process to influence the behaviour of the hidden units. Specifically, constraints are applied so that the hidden units of each layer will exhibit phone-dependent regional activities when arranged in a 2-dimensional grid. We demonstrate that such constraints are able to yield visible activation regions without compromising the classification of the network and suppressing the activations for a region affects the classification accuracy of the corresponding phone more than the others.

引用

页码：617 / 623

页数：7

共 15 条

[1] [Anonymous], P IEEE INT C AC SPEE
[2] [Anonymous], 2014, CORR
[3] Bergstra James, 2010, PYTH SCI C SCIPY, P1
[4] Garson GD., 1991, AI Expert, V6, P46, DOI DOI 10.5555/129449.129452
[5] BACKPROPAGATION NEURAL NETWORKS FOR MODELING COMPLEX-SYSTEMS
GOH, ATC
[J]. ARTIFICIAL INTELLIGENCE IN ENGINEERING, 1995, 9 (03): : 143 - 151
[6] Deep Neural Networks for Acoustic Modeling in Speech Recognition
Hinton, Geoffrey
Deng, Li
Yu, Dong
Dahl, George E.
Mohamed, Abdel-rahman
Jaitly, Navdeep
Senior, Andrew
Vanhoucke, Vincent
Patrick Nguyen
Sainath, Tara N.
Kingsbury, Brian
[J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 82 - 97
[7] Kavukcuoglu K, 2009, PROC CVPR IEEE, P1605, DOI 10.1109/CVPRW.2009.5206545
[8] Mahendran Aravindh, 2014, ARXIV14120035
[9] Povey D., 2011, IEEE 2011 WORKSH AUT
[10] Seide F., 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), P24, DOI 10.1109/ASRU.2011.6163899

← 1 2 →