IMPROVING THE INTERPRETABILITY OF DEEP NEURAL NETWORKS WITH STIMULATED LEARNING

被引:0
作者
Tan, Shawn [1 ]
Sim, Khe Chai [1 ]
Gales, Mark [2 ]
机构
[1] Natl Univ Singapore, Singapore 117548, Singapore
[2] Univ Cambridge, Cambridge CB2 1TN, England
来源
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU) | 2015年
关键词
Deep Neural Networks;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Neural Networks (DNNs) have demonstrated improvements in acoustic modelling for automatic speech recognition. However, they are often used as a black box, and not much is understood about what each of the hidden layers does. We seek to understand how the activations in the hidden layers change with different input, and how we can leverage such knowledge to modify the behaviour of the model. To this end, we propose stimulated deep learning where stimuli are introduced during the DNN training process to influence the behaviour of the hidden units. Specifically, constraints are applied so that the hidden units of each layer will exhibit phone-dependent regional activities when arranged in a 2-dimensional grid. We demonstrate that such constraints are able to yield visible activation regions without compromising the classification of the network and suppressing the activations for a region affects the classification accuracy of the corresponding phone more than the others.
引用
收藏
页码:617 / 623
页数:7
相关论文
共 15 条
  • [1] [Anonymous], P IEEE INT C AC SPEE
  • [2] [Anonymous], 2014, CORR
  • [3] Bergstra James, 2010, PYTH SCI C SCIPY, P1
  • [4] Garson GD., 1991, AI Expert, V6, P46, DOI DOI 10.5555/129449.129452
  • [5] BACKPROPAGATION NEURAL NETWORKS FOR MODELING COMPLEX-SYSTEMS
    GOH, ATC
    [J]. ARTIFICIAL INTELLIGENCE IN ENGINEERING, 1995, 9 (03): : 143 - 151
  • [6] Deep Neural Networks for Acoustic Modeling in Speech Recognition
    Hinton, Geoffrey
    Deng, Li
    Yu, Dong
    Dahl, George E.
    Mohamed, Abdel-rahman
    Jaitly, Navdeep
    Senior, Andrew
    Vanhoucke, Vincent
    Patrick Nguyen
    Sainath, Tara N.
    Kingsbury, Brian
    [J]. IEEE SIGNAL PROCESSING MAGAZINE, 2012, 29 (06) : 82 - 97
  • [7] Kavukcuoglu K, 2009, PROC CVPR IEEE, P1605, DOI 10.1109/CVPRW.2009.5206545
  • [8] Mahendran Aravindh, 2014, ARXIV14120035
  • [9] Povey D., 2011, IEEE 2011 WORKSH AUT
  • [10] Seide F., 2011, 2011 IEEE Workshop on Automatic Speech Recognition & Understanding (ASRU), P24, DOI 10.1109/ASRU.2011.6163899