Continual Learning for Multi-Dialect Acoustic Models

被引:11
作者
Houston, Brady [1 ]
Kirchhoff, Katrin [1 ]
机构
[1] Amazon, Seattle, WA 98109 USA
来源
INTERSPEECH 2020 | 2020年
关键词
speech recognition; acoustic modeling; multi-dialect; DEEP NEURAL-NETWORK;
D O I
10.21437/Interspeech.2020-1797
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Using data from multiple dialects has shown promise in improving neural network acoustic models. While such training can improve the performance of an acoustic model on a single dialect, it can also produce a model capable of good performance on multiple dialects. However, training an acoustic model on pooled data from multiple dialects takes a significant amount of time and computing resources, and it needs to be retrained every time a new dialect is added to the model. In contrast, sequential transfer learning (fine-tuning) does not require retraining using all data, but may result in catastrophic forgetting of previously-seen dialects. Using data from four english dialects, we demonstrate that by using loss functions that mitigate catastrophic forgetting, sequential transfer learning can be used to train multi-dialect acoustic models that narrow the WER gap between the best (combined training) and worst (fine-tuning) case by up to 65%. Continual learning shows great promise in minimizing training time while approaching the performance of models that require much more training time.
引用
收藏
页码:576 / 580
页数:5
相关论文
共 20 条
  • [1] Rusu AA, 2016, Arxiv, DOI arXiv:1606.04671
  • [2] Multi-dialect acoustic modeling using phone mapping and online i-vectors
    Arsikere, Harish
    Sapru, Ashtosh
    Garimella, Sri
    [J]. INTERSPEECH 2019, 2019, : 2125 - 2129
  • [3] Beringer N., 1998, 5 INT C SPOKEN LANGU
  • [4] Elfeky M, 2016, IEEE W SP LANG TECH, P624, DOI 10.1109/SLT.2016.7846328
  • [5] Ghorbani S, 2019, Arxiv, DOI arXiv:1910.00565
  • [6] Ghoshal A, 2013, INT CONF ACOUST SPEE, P7319, DOI 10.1109/ICASSP.2013.6639084
  • [7] Heigold G, 2013, INT CONF ACOUST SPEE, P8619, DOI 10.1109/ICASSP.2013.6639348
  • [8] Hinton G., 2015, Comput Sci, V1050
  • [9] Huang JT, 2013, INT CONF ACOUST SPEE, P7304, DOI 10.1109/ICASSP.2013.6639081
  • [10] Overcoming catastrophic forgetting in neural networks
    Kirkpatricka, James
    Pascanu, Razvan
    Rabinowitz, Neil
    Veness, Joel
    Desjardins, Guillaume
    Rusu, Andrei A.
    Milan, Kieran
    Quan, John
    Ramalho, Tiago
    Grabska-Barwinska, Agnieszka
    Hassabis, Demis
    Clopath, Claudia
    Kumaran, Dharshan
    Hadsell, Raia
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (13) : 3521 - 3526