Forward-Backward Attention Decoder

被引:7
|
作者
Mimura, Masato [1 ]
Sakai, Shinsuke [1 ]
Kawahara, Tatsuya [1 ]
机构
[1] Kyoto Univ, Sch Informat, Sakyo Ku, Kyoto 6068501, Japan
关键词
Sequence-to-sequence speech recognition; attention; acoustic-to-word models; forward-backward decoding; multitask learning; NEURAL-NETWORKS;
D O I
10.21437/Interspeech.2018-1160
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates how forward and backward attentions can be integrated to improve the performance of attention-based sequence-to-sequence (seq2seq) speech recognition systems. In the proposed approach, speech is decoded from left to right as well as from right to left utilizing forward and backward attention vectors, and the best sentence hypothesis is searched for according to combined probabilities provided by the decoders of two directions. Our method takes advantage of two distinct and complementary ways of extracting information from the asymmetric time structure of speech. It also mitigates a drawback of attention-based models that they tend to output less reliable labels due to error accumulation when the utterance becomes longer. We also show the effectiveness of a multitask learning in which the forward decoder is jointly trained with backward decoding sharing a single encoder. The proposed forward backward decoding improved word error rates (WERs) of word level attention models by up to 12.7 % relative in speech recognition experiments using large-scale spontaneous speech corpora. They achieve much higher performances than a state-ofthe-art hybrid DNN-HMM system while retaining the advantage of very low latency.
引用
收藏
页码:2232 / 2236
页数:5
相关论文
共 50 条
  • [1] CONVERGENCE OF INEXACT FORWARD-BACKWARD ALGORITHMS USING THE FORWARD-BACKWARD ENVELOPE
    Bonettini, S.
    Prato, M.
    Rebegoldi, S.
    SIAM JOURNAL ON OPTIMIZATION, 2020, 30 (04) : 3069 - 3097
  • [2] FORWARD-BACKWARD ASYMMETRIES
    BOHM, M
    HOLLIK, W
    Z PHYSICS AT LEP 1, VOL 1: STANDARD PHYSICS, 1989, : 203 - 234
  • [3] Forward-backward semiclassical dynamics
    Wright, NJ
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2005, 230 : U1360 - U1360
  • [4] Extended forward-backward algorithm
    Lassonde, Marc
    Nagesseur, Ludovic
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 2013, 403 (01) : 167 - 172
  • [5] Forward-backward SDEs with reflections
    不详
    FORWARD-BACKWARD STOCHASTIC DIFFERENTIAL EQUATIONS AND THEIR APPLICATIONS, 1999, 1702 : 169 - 192
  • [6] A generalization of forward-backward algorithm
    Azuma A.
    Matsumoto Y.
    Transactions of the Japanese Society for Artificial Intelligence, 2010, 25 (03) : 494 - 503
  • [7] A Generalized Forward-Backward Splitting
    Raguet, Hugo
    Fadili, Jalal
    Peyre, Gabriel
    SIAM JOURNAL ON IMAGING SCIENCES, 2013, 6 (03): : 1199 - 1226
  • [8] FORWARD-BACKWARD TRACING TYMPANOMETRY
    KOBAYASHI, T
    OKITSU, T
    TAKASAKA, T
    ACTA OTO-LARYNGOLOGICA, 1987, : 100 - 106
  • [9] On unitary and forward-backward MODE
    Gershman, AB
    Stoica, P
    DIGITAL SIGNAL PROCESSING, 1999, 9 (02) : 67 - 75
  • [10] On the Locality of the Forward-Backward Algorithm
    Merialdo, Bernard
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1993, 1 (02): : 255 - 257