Synthesising isiZulu-English code-switch bigrams using word embeddings

被引:11
作者
van der Westhuizen, Ewald [1 ]
Niesler, Thomas [1 ]
机构
[1] Stellenbosch Univ, Dept Elect & Elect Engn, Stellenbosch, South Africa
来源
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION | 2017年
关键词
code-switching; word vectors; word embed-dings; Zulu; IsiZulu; spontaneous;
D O I
10.21437/Interspeech.2017-1437
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Code-switching is prevalent among South African speakers, and presents a challenge to automatic speech recognition systems. It is predominantly a spoken phenomenon. and generally does not occur in textual form. Therefore a particularly serious challenge is the extreme lack of training material for language modelling. We investigate the use of word embeddings to synthesise isiZulu-to-English code-switch bigrams with which to augment such sparse language model training data. A variety of word embeddings are trained on a monolingual English web text corpus, and subsequently queried to synthesise code-switch bigrams. Our evaluation is performed on language models trained on a new, although small. English-isiZulu code switch corpus compiled from South African soap operas. This data is characterised by fast. spontaneously spoken speech containing frequent code-switching. We show that the augmentation of the training data with code-switched bigrams synthesised in this way leads to a reduction in perplexity.
引用
收藏
页码:72 / 76
页数:5
相关论文
共 23 条
  • [11] Kurimo M., 2006, P MAIN C HUM LANG TE
  • [12] Levy O., 2015, Transactions of the Association for Computational Linguistics, V3, P211, DOI [DOI 10.1162/TACL_A_00134, DOI 10.1162/TACLA00134]
  • [13] Li Y, 2013, INT CONF ACOUST SPEE, P7368, DOI 10.1109/ICASSP.2013.6639094
  • [14] Li Y, 2011, INT CONF ACOUST SPEE, P5004
  • [15] Liang WB, 2013, INTERSPEECH, P1486
  • [16] Modipa T. I., 2013, PRASA 2013 P
  • [17] Pennington J., 2014, 2014 C EMP METH NAT, P43
  • [18] Poulos G., 1998, LINGUISTIC ANAL ZULU
  • [19] SHIA CJ, 2004, AC SPEECH SIGN PROC, V1
  • [20] Solorio T., 2008, P C EMP METH NAT LAN, P973