Multilingual speech recognition in seven languages

被引:23
作者
Uebler, U [1 ]
机构
[1] Bavarian Res Ctr Knowledge Based Syst, FORWISS, Res Grp Knowledge Proc, D-91058 Erlangen, Germany
关键词
dialect; non-native; bilingual; multilingual;
D O I
10.1016/S0167-6393(00)00095-9
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this study we present approaches to multilingual speech recognition. We first define different approaches, namely portation, cross-lingual and simultaneous multilingual speech recognition. We will show some experiments performed in the fields of multilingual speech recognition. In recent years we have ported our recognizer to other languages than German (Italian, Slovak, Slovenian, Czech, English, Japanese). We found that some languages achieve a higher recognition performance with comparable tasks, and are thus easier for automatic speech recognition than others. Furthermore, we present experiments which show the performance of cross-lingual speech recognition of an untrained language with a recognizer trained with other languages. The substitution of phones is important for cross-lingual and simultaneous multilingual recognition. We compared results in cross-lingual recognition for different baseline systems and found that the number of shared acoustic units is very important for the performance. With simultaneous multilingual recognition, performance usually decreases compared to monolingual recognition. In few cases, like in the case of non-native speech, however, the recognition can be improved. (C) 2001 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:53 / 69
页数:17
相关论文
共 29 条
[1]  
ACKERMANN U, 1996, 3 CRIM FORW WORKSH M
[2]  
ACKERMANN U, 1996, P INT C SPOK LANG PR
[3]  
BARNETT J, 1996, P INT C SPOK LANG PR
[4]  
BONAVENTURA P, 1997, P EUR C SPEECH COMM, V1, P355
[5]  
BUB U, 1997, P ICASSP97 MUN, V2, P1451
[6]  
CERFDANON H, 1991, P EUR C SPEECH COMM, V1, P183
[7]  
DALSGAARD P, 1991, P INT C AC SPEECH SI, P197
[8]  
DALSGAARD P, 1998, P INT C SPOK LANG PR, V6, P2627
[9]  
Delattre P., 1965, COMP PHONETIC FEATUR
[10]  
DENG L, 1997, P IEEE INT C AC SPEE, V2, P1007