Multilingual acoustic models for speech recognition and synthesis

被引：0

作者：

Kunzmann, S ^{[1
]}

Fischer, V ^{[1
]}

Gonzalez, J ^{[1
]}

Emam, O ^{[1
]}

Günther, C ^{[1
]}

Janke, E ^{[1
]}

机构：

[1] IBM Pervas Comp, European Voice Technol Dev, D-68165 Mannheim, Germany

来源：

2004 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL III, PROCEEDINGS: IMAGE AND MULTIDIMENSIONAL SIGNAL PROCESSING SPECIAL SESSIONS | 2004年

关键词：

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper we review the design of a common phone alphabet for up to fifteen languages and describe its application in two important components of a seamless multilingual conversational system, namely speech recognition and synthesis. We report on experiments that demonstrate the advantages of multilingual acoustic models both for the recognition of foreign names and non-native speech, and describe the usefulness of a common phone alphabet for the construction of unit selection based mono- and bilingual speech synthesis systems.

引用

页码：745 / 748

页数：4