Long Short-Term Memory Recurrent Neural Network Architectures for Large Scale Acoustic Modeling

被引:0
|
作者
Sak, Hasim [1 ]
Senior, Andrew [1 ]
Beaufays, Francoise [1 ]
机构
[1] Google, New York, NY USA
来源
15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4 | 2014年
关键词
Long Short-Term Memory; LSTM; recurrent neural network; RNN; speech recognition; acoustic modeling;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Long Short-Term Memory (LSTM) is a specific recurrent neural network (RNN) architecture that was designed to model temporal sequences and their long-range dependencies more accurately than conventional RNNs. In this paper, we explore LSTM RNN architectures for large scale acoustic modeling in speech recognition. We recently showed that LSTM RNNs are more effective than DNNs and conventional RNNs for acoustic modeling, considering moderately-sized models trained on a single machine. Here, we introduce the first distributed training of LSTM RNNs using asynchronous stochastic gradient descent optimization on a large cluster of machines. We show that a two-layer deep LSTM RNN where each LSTM layer has a linear recurrent projection layer can exceed state-of-the-art speech recognition performance. This architecture makes more effective use of model parameters than the others considered, converges quickly, and outperforms a deep feed forward neural network having an order of magnitude more parameters.
引用
收藏
页码:338 / 342
页数:5
相关论文
共 50 条
  • [1] Long short-term memory recurrent neural network architectures for Urdu acoustic modeling
    Zia, Tehseen
    Zahid, Usman
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (01) : 21 - 30
  • [2] Long short-term memory recurrent neural network architectures for Urdu acoustic modeling
    Tehseen Zia
    Usman Zahid
    International Journal of Speech Technology, 2019, 22 : 21 - 30
  • [3] Long Short-Term Memory Recurrent Neural Network Architectures for Melody Generation
    Mishra, Abhinav
    Tripathi, Kshitij
    Gupta, Lakshay
    Singh, Krishna Pratap
    SOFT COMPUTING FOR PROBLEM SOLVING, 2019, 817 : 41 - 55
  • [4] Long short-term memory recurrent neural network for pharmacokinetic-pharmacodynamic modeling
    Liu, Xiangyu
    Liu, Chao
    Huang, Ruihao
    Zhu, Hao
    Liu, Qi
    Mitra, Sunanda
    Wang, Yaning
    INTERNATIONAL JOURNAL OF CLINICAL PHARMACOLOGY AND THERAPEUTICS, 2021, 59 (02) : 138 - 146
  • [5] Long Short-Term Memory Recurrent Neural Network Based Acoustic Model Using Connectionist Temporal Classification on a Large-Scale Training Corpus
    Lee, Donghyun
    Lim, Minkyu
    Park, Hosung
    Kong, Yoseb
    Park, Jeong-Sik
    Jang, Gil-Jin
    Kim, Ji-Hwan
    CHINA COMMUNICATIONS, 2017, 14 (09) : 23 - 31
  • [6] Handwriting Recognition with Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
    Voigtlaender, Paul
    Doetsch, Patrick
    Ney, Hermann
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 228 - 233
  • [8] Long Short-Term Memory Recurrent Neural Network-Based Acoustic Model Using Connectionist Temporal Classification on a Large-Scale Training Corpus
    Donghyun Lee
    Minkyu Lim
    Hosung Park
    Yoseb Kang
    Jeong-Sik Park
    Gil-Jin Jang
    Ji-Hwan Kim
    中国通信, 2017, 14 (09) : 23 - 31
  • [9] Applying Long Short-Term Memory Recurrent Neural Network for Intrusion Detection
    Althubiti, Sara
    Nick, William
    Mason, Janelle
    Yuan, Xiaohong
    Esterline, Albert
    IEEE SOUTHEASTCON 2018, 2018,
  • [10] Stock Price Prediction With Long Short-Term Memory Recurrent Neural Network
    Jeenanunta, Chawalit
    Chaysiri, Rujira
    Thong, Laksmey
    2018 INTERNATIONAL CONFERENCE ON EMBEDDED SYSTEMS AND INTELLIGENT TECHNOLOGY & INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY FOR EMBEDDED SYSTEMS (ICESIT-ICICTES), 2018,