Multilingual acoustic models for speech recognition and synthesis

被引:0
作者
Kunzmann, S [1 ]
Fischer, V [1 ]
Gonzalez, J [1 ]
Emam, O [1 ]
Günther, C [1 ]
Janke, E [1 ]
机构
[1] IBM Pervas Comp, European Voice Technol Dev, D-68165 Mannheim, Germany
来源
2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS | 2004年
关键词
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we review the design of a common phone alphabet for up to fifteen languages and describe its application in two important components of a seamless multilingual conversational system, namely speech recognition and synthesis. We report on experiments that demonstrate the advantages of multilingual acoustic models both for the recognition of foreign names and non-native speech, and describe the usefulness of a common phone alphabet for the construction of unit selection based mono- and bilingual speech synthesis systems.
引用
收藏
页码:745 / 748
页数:4
相关论文
共 13 条
[1]  
CAMBRA FP, 2000, P 6 INT C SPOK LANG
[2]  
DONOVAN RE, 1996, THESIS CAMBRIDGE U
[3]  
EIDE E, 2003, P IEEE INT C AC SPEE
[4]  
FISCHER V, 2000, P IEEE WORKSH MULT S
[5]  
FISCHER V, 2002, P 7 INT C SPOK LANG
[6]  
OSTENDORF M, 2002, P IEEE 2002 WORKSH S
[7]  
PFISTER B, 2003, P 8 EUR C SPEECH COM
[8]  
Schultz T., 2001, SPEECH COMMUNICATION, V35
[9]  
Siemund R., 2002, P LREC AR WORKSH, P1
[10]  
Stuker S., 2003, P IEEE INT C AC SPEE