Long short-term memory recurrent neural network architectures for Urdu acoustic modeling

被引:1
作者
Tehseen Zia
Usman Zahid
机构
[1] COMSATS University Islamabad,
来源
International Journal of Speech Technology | 2019年 / 22卷
关键词
Recurrent neural networks; Long short-term memory; Acoustic modeling; Speech recognition; Urdu;
D O I
暂无
中图分类号
学科分类号
摘要
Recurrent neural networks (RNNs) have achieved remarkable improvements in acoustic modeling recently. However, the potential of RNNs have not been utilized for modeling Urdu acoustics. The connectionist temporal classification and attention based RNNs are suffered due to the unavailability of lexicon and computational cost of training, respectively. Therefore, we explored contemporary long short-term memory and gated recurrent neural networks Urdu acoustic modeling. The efficacies of plain, deep, bidirectional and deep-directional network architectures are evaluated empirically. Results indicate that deep-directional has an advantage over the other architectures. A word error rate of 20% was achieved on a hundred words dataset of twenty speakers. It shows 15% improvement over the baseline single-layer LSTMs. It has been observed that two-layer architectures can improve performance over single-layer, however the performance is degraded with further layers. LSTM architectures were compared with gated recurrent unit (GRU) based architectures and it was found that LSTM has an advantage over GRU.
引用
收藏
页码:21 / 30
页数:9
相关论文
共 27 条
[1]  
Hinton G(2012)Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups IEEE Signal Processing Magazine 29 82-97
[2]  
Deng L(1997)Long short-term memory Neural Computation 9 1735-1780
[3]  
Yu D(1991)Hidden Markov models for speech recognition Technometrics 33 251-272
[4]  
Dahl GE(2010)Recurrent neural network based language model Interspeech 2 3-2681
[5]  
Mohamed AR(1989)A tutorial on hidden Markov models and selected applications in speech recognition Proceedings of the IEEE 77 257286-501
[6]  
Jaitly N(1997)Bidirectional recurrent neural networks IEEE Transactions on Signal Processing 45 2673-409
[7]  
Senior A(1990)An efficient gradient-based algorithm for on-line training of recurrent network trajectories Neural computation 2 490-undefined
[8]  
Vanhoucke V(2017)Recent progresses in deep learning based acoustic models IEEE/CAA Journal of Automatica Sinica 4 396-undefined
[9]  
Nguyen P(undefined)undefined undefined undefined undefined-undefined
[10]  
Sainath TN(undefined)undefined undefined undefined undefined-undefined