Sequence Discriminative Distributed Training of Long Short-Term Memory Recurrent Neural Networks

被引:0
作者
Sak, Hasim [1 ]
Vinyals, Oriol [1 ]
Heigold, Georg [1 ]
Senior, Andrew [1 ]
McDermott, Erik [1 ]
Monga, Rajat [1 ]
Mao, Mark [1 ]
机构
[1] Google, Mountain View, CA 94043 USA
来源
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4 | 2014年
关键词
recurrent neural network; long short-term memory; sequence discriminative training; acoustic modeling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We recently showed that Long Short-Term Memory (LSTM) recurrent neural networks (RNNs) outperform state-of-the-art deep neural networks (DNNs) for large scale acoustic modeling where the models were trained with the cross-entropy (CE) criterion. It has also been shown that sequence discriminative training of DNNs initially trained with the CE criterion gives significant improvements. In this paper, we investigate sequence discriminative training of LSTM RNNs in a large scale acoustic modeling task. We train the models in a distributed manner using asynchronous stochastic gradient descent optimization technique. We compare two sequence discriminative criteria maximum mutual information and state-level minimum Bayes risk, and we investigate a number of variations of the basic training strategy to better understand issues raised by both the sequential model, and the objective function. We obtain significant gains over the CE trained LSTM RNN model using sequence discriminative training techniques.
引用
收藏
页码:1209 / 1213
页数:5
相关论文
共 50 条
[41]   Long Short-Term Memory Recurrent Neural Network for Tidal Level Forecasting [J].
Yang, Cheng-Hong ;
Wu, Chih-Hsien ;
Hsieh, Chih-Min .
IEEE ACCESS, 2020, 8 (08) :159389-159401
[42]   Long Short-Term Memory Recurrent Neural Network for Automatic Speech Recognition [J].
Oruh, Jane ;
Viriri, Serestina ;
Adegun, Adekanmi .
IEEE ACCESS, 2022, 10 :30069-30079
[43]   Wind Power Prediction based on Recurrent Neural Network with Long Short-Term Memory Units [J].
Dong, Danting ;
Sheng, Zhihao ;
Yang, Tiancheng .
2018 IEEE INTERNATIONAL CONFERENCE ON RENEWABLE ENERGY AND POWER ENGINEERING (REPE 2018), 2018, :34-38
[44]   APPLICATION OF RECURRENT NEURAL NETWORK LONG SHORT-TERM MEMORY MODEL ON EARLY KICK DETECTION [J].
Wang, Junzhe ;
Ozbayoglu, Evren M. .
PROCEEDINGS OF ASME 2022 41ST INTERNATIONAL CONFERENCE ON OCEAN, OFFSHORE & ARCTIC ENGINEERING, OMAE2022, VOL 10, 2022,
[45]   EVALUATING RECURRENT NEURAL NETWORKS AND LONG SHORT-TERM MEMORY FOR AIR POLLUTION FORECASTING: MITIGATING THE IMPACT OF VOLATILE ENVIRONMENTAL FACTORS [J].
Fauzi, Fatkhurokhman ;
Wasono, Rochdi ;
Kharisudin, Iqbal .
COMMUNICATIONS IN MATHEMATICAL BIOLOGY AND NEUROSCIENCE, 2023,
[46]   Solar Energy Production Forecast Using Standard Recurrent Neural Networks, Long Short-Term Memory, and Gated Recurrent Unit [J].
Buturache, Adrian-Nicolae ;
Stancu, Stelian .
INZINERINE EKONOMIKA-ENGINEERING ECONOMICS, 2021, 32 (04) :313-324
[47]   Using Long Short-Term Memory (LSTM) recurrent neural networks to classify unprocessed EEG for seizure prediction [J].
Chambers, Jordan D. ;
Cook, Mark J. ;
Burkitt, Anthony N. ;
Grayden, David B. .
FRONTIERS IN NEUROSCIENCE, 2024, 18
[48]   Long short-term memory and gated recurrent neural networks to predict the ionospheric vertical total electron content [J].
Iluore, Kenneth ;
Lu, Jianyong .
ADVANCES IN SPACE RESEARCH, 2022, 70 (03) :652-665
[49]   Robust Speech Recognition using Long Short-Term Memory Recurrent Neural Networks for Hybrid Acoustic Modelling [J].
Geiger, Juergen T. ;
Zhang, Zixing ;
Weninger, Felix ;
Schuller, Bjoern ;
Rigoll, Gerhard .
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, :631-635
[50]   On the Initialization of Long Short-Term Memory Networks [J].
Ghazi, Mostafa Mehdipour ;
Nielsen, Mads ;
Pai, Akshay ;
Modat, Marc ;
Cardoso, M. Jorge ;
Ourselin, Sebastien ;
Sorensen, Lauge .
NEURAL INFORMATION PROCESSING (ICONIP 2019), PT I, 2019, 11953 :275-286