Sequence Discriminative Distributed Training of Long Short-Term Memory Recurrent Neural Networks

被引:0
|
作者
Sak, Hasim [1 ]
Vinyals, Oriol [1 ]
Heigold, Georg [1 ]
Senior, Andrew [1 ]
McDermott, Erik [1 ]
Monga, Rajat [1 ]
Mao, Mark [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
来源
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4 | 2014年
关键词
recurrent neural network; long short-term memory; sequence discriminative training; acoustic modeling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We recently showed that Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) outperform state-of-the-art deep neural networks (DNNs) for large scale acoustic modeling where the models were trained with the cross-entropy (CE) criterion. It has also been shown that sequence discriminative training of DNNs initially trained with the CE criterion gives significant improvements. In this paper, we investigate sequence discriminative training of LSTM RNNs in a large scale acoustic modeling task. We train the models in a distributed manner using asynchronous stochastic gradient descent optimization technique. We compare two sequence discriminative criteria maximum mutual information and state-level minimum Bayes risk, and we investigate a number of variations of the basic training strategy to better understand issues raised by both the sequential model, and the objective function. We obtain significant gains over the CE trained LSTM RNN model using sequence discriminative training techniques.
引用
收藏
页码:1209 / 1213
页数:5
相关论文
共 50 条
  • [1] On Speaker Adaptation of Long Short-Term Memory Recurrent Neural Networks
    Miao, Yajie
    Metze, Florian
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 1101 - 1105
  • [2] SEQUENCE-DISCRIMINATIVE TRAINING OF RECURRENT NEURAL NETWORKS
    Voigtlaender, Paul
    Doetsch, Patrick
    Wiesler, Simon
    Schlueter, Ralf
    Ney, Hermann
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 2100 - 2104
  • [3] Collective Anomaly Detection Based on Long Short-Term Memory Recurrent Neural Networks
    Bontemps, Loic
    Van Loi Cao
    McDermott, James
    Nhien-An Le-Khac
    FUTURE DATA AND SECURITY ENGINEERING, FDSE 2016, 2016, 10018 : 141 - 152
  • [4] Handwriting Recognition with Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
    Voigtlaender, Paul
    Doetsch, Patrick
    Ney, Hermann
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 228 - 233
  • [5] Non-Uniform MCE Training of Deep Long Short-Term Memory Recurrent Neural Networks for Keyword Spotting
    Meng, Zhong
    Juang, Biing-Hwang
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3547 - 3551
  • [6] Detecting Overlapping Speech with Long Short-Term Memory Recurrent Neural Networks
    Geiger, Juergen T.
    Eyben, Florian
    Schuller, Bjoern
    Rigoll, Gerhard
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1667 - 1671
  • [7] Long short-term memory-based deep recurrent neural networks for target tracking
    Gao, Chang
    Yan, Junkun
    Zhou, Shenghua
    Varshney, Pramod K.
    Liu, Hongwei
    INFORMATION SCIENCES, 2019, 502 : 279 - 296
  • [8] Forecasting hotel reservations with long short-term memory-based recurrent neural networks
    Wang, Jian
    Duggasani, Amar
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2020, 9 (01) : 77 - 94
  • [9] Long short-term memory-based recurrent neural networks for nonlinear target tracking
    Gao, Chang
    Yan, Junkun
    Zhou, Shenghua
    Chen, Bo
    Liu, Hongwei
    SIGNAL PROCESSING, 2019, 164 : 67 - 73
  • [10] Forecasting hotel reservations with long short-term memory-based recurrent neural networks
    Jian Wang
    Amar Duggasani
    International Journal of Data Science and Analytics, 2020, 9 : 77 - 94