Improving Under-Resourced Code-Switched Speech Recognition: Large Pre-trained Models or Architectural Interventions

被引:2
作者
van Vuren, Joshua Jansen [1 ]
Niesler, Thomas [1 ]
机构
[1] Stellenbosch Univ, Stellenbosch, South Africa
来源
INTERSPEECH 2023 | 2023年
关键词
speech recognition; under-resourced; code-switched; n-best rescoring; African languages;
D O I
10.21437/Interspeech.2023-1841
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present three approaches to improve language modelling of under-resourced code-switched speech. First, we challenge the practice of fine-tuning large pre-trained language models on small datasets. Secondly, we investigate the advantages of sub-word encodings for our multilingual code-switched speech. Thirdly, we propose an architectural innovation to the RNN language model that is specifically designed for code-switched text. We show a clear reduction in absolute word error rate of 0.17% for the adapted LSTM language model compared to M-BERT when employed in n-best rescoring experiments. Further, the LSTM models afford a seven-fold reduction in total number of parameters and reduces runtime during rescoring 100-fold. Contrary to recent research trends, our LSTM models do not outperform the word-level models when using sub-word vocabularies. Finally, the new architectural mechanism applied to the LSTM improves language prediction for a span of several words following a code-switch.
引用
收藏
页码:1439 / 1443
页数:5
相关论文
共 32 条
[1]  
Adel H., 2013, P IEEE INT C AC SPEE
[2]  
Akyurek A.F., 2020, P 58 ANN M ASS COMP
[3]  
[Anonymous], 2002, INTERSPEECH
[4]  
Ba J.L., 2016, Layer Normalization
[5]   Improved low-resource Somali speech recognition by semi-supervised acoustic and language model training [J].
Biswas, Astik ;
Menon, Raghav ;
van der Westhuizen, Ewald ;
Niesler, Thomas .
INTERSPEECH 2019, 2019, :3008-3012
[6]  
Brown T.B., 2020, ARXIV
[7]  
Devlin J., 2019, NAACL HLT 2019 2019, P1
[8]  
Gao Y., 2019, P INT GRAZ AUSTR
[9]  
Garg S., 2018, P C EMP METH NAT LAN
[10]  
Gupta A., 2021, P C N AM CHAPT ASS C