Continual Learning for Multi-Dialect Acoustic Models

被引：11

作者：

Houston, Brady ^{[1
]}

Kirchhoff, Katrin ^{[1
]}

机构：

[1] Amazon, Seattle, WA 98109 USA

来源：

INTERSPEECH 2020 | 2020年

关键词：

speech recognition; acoustic modeling; multi-dialect; DEEP NEURAL-NETWORK;

D O I：

10.21437/Interspeech.2020-1797

中图分类号：

R36 [病理学]; R76 [耳鼻咽喉科学];

学科分类号：

100104 ; 100213 ;

摘要：

Using data from multiple dialects has shown promise in improving neural network acoustic models. While such training can improve the performance of an acoustic model on a single dialect, it can also produce a model capable of good performance on multiple dialects. However, training an acoustic model on pooled data from multiple dialects takes a significant amount of time and computing resources, and it needs to be retrained every time a new dialect is added to the model. In contrast, sequential transfer learning (fine-tuning) does not require retraining using all data, but may result in catastrophic forgetting of previously-seen dialects. Using data from four english dialects, we demonstrate that by using loss functions that mitigate catastrophic forgetting, sequential transfer learning can be used to train multi-dialect acoustic models that narrow the WER gap between the best (combined training) and worst (fine-tuning) case by up to 65%. Continual learning shows great promise in minimizing training time while approaching the performance of models that require much more training time.

引用

页码：576 / 580

页数：5

共 20 条

[1] Rusu AA, 2016, Arxiv, DOI arXiv:1606.04671
[2] Multi-dialect acoustic modeling using phone mapping and online i-vectors
Arsikere, Harish
Sapru, Ashtosh
Garimella, Sri
[J]. INTERSPEECH 2019, 2019, : 2125 - 2129
[3] Beringer N., 1998, 5 INT C SPOKEN LANGU
[4] Elfeky M, 2016, IEEE W SP LANG TECH, P624, DOI 10.1109/SLT.2016.7846328
[5] Ghorbani S, 2019, Arxiv, DOI arXiv:1910.00565
[6] Ghoshal A, 2013, INT CONF ACOUST SPEE, P7319, DOI 10.1109/ICASSP.2013.6639084
[7] Heigold G, 2013, INT CONF ACOUST SPEE, P8619, DOI 10.1109/ICASSP.2013.6639348
[8] Hinton G., 2015, Comput Sci, V1050
[9] Huang JT, 2013, INT CONF ACOUST SPEE, P7304, DOI 10.1109/ICASSP.2013.6639081
[10] Overcoming catastrophic forgetting in neural networks
Kirkpatricka, James
Pascanu, Razvan
Rabinowitz, Neil
Veness, Joel
Desjardins, Guillaume
Rusu, Andrei A.
Milan, Kieran
Quan, John
Ramalho, Tiago
Grabska-Barwinska, Agnieszka
Hassabis, Demis
Clopath, Claudia
Kumaran, Dharshan
Hadsell, Raia
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (13) : 3521 - 3526

← 1 2 →