Schwa-deletion in Hindi text-to-speech synthesis

被引：15

作者：

Narasimhan B. ^{[1
]}

Sproat R. ^{[2
]}

Kiraz G. ^{[3
]}

机构：

[1] Max Planck Inst. for Psycholing., Nijmegen

[2] University of Illinois, Urbana-Champaign

[3] AT and T Labs - Research, NJ

来源：

International Journal of Speech Technology | 2004年 / 7卷 / 4期

关键词：

Finite-state methods; Phonology; Text analysis; Text-to-speech;

D O I：

10.1023/B:IJST.0000037075.71599.62

中图分类号：

学科分类号：

摘要：

We describe the phenomenon of schwa-deletion in Hindi and how it is handled in the pronunciation component of a multilingual concatenative text-to-speech system. Each of the consonants in written Hindi is associated with an "inherent" schwa vowel which is not represented in the orthography. For instance, the Hindi word pronounced as [namak] ('salt') is represented in the orthography using the consonantal characters for [n], [m], and [k]. Two main factors complicate the issue of schwa pronunciation in Hindi. First, not every schwa following a consonant is pronounced within the word. Second, in multimorphemic words, the presence of a morpheme boundary can block schwa deletion where it might otherwise occur. We propose a model for schwa-deletion which combines a general purpose schwa-deletion rule proposed in the linguistics literature (Ohala, 1983), with additional morphological analysis necessitated by the high frequency of compounds in our database. The system is implemented in the framework of finite-state transducer technology.

引用

页码：319 / 333

页数：14

共 34 条

[1]

Bhaskararao P., Mathew S., Phonemic transcription rules for text-to-speech synthesis of Hindi, Com Puter Processing of Asian Languages, (1992)

[2]

Bhaskararao P., Peri V.N., Udpikar V., A text-to-speech system fo application by visually handicapped and illiterate, Proceedings of the International Conference on Spoken Language Processing, pp. 1239-1241, (1994)

[3]

Black A., Campbell N., Optimising selection of units from speech databases for concatenative synthesis, Proceedings of Eurospeech 95, 1, pp. 581-584, (1995)

[4]

Charpentier F., Moulines E., Pitch-synchronous waver-form processing techniques for text-to-speech synthesis using diphones, Speech Communication, 9, 5-6, pp. 453-467, (1990)

[5]

Furtado X.A., Sen A., Synthesis of unlimited speech in Indian languages using formant-based rules, Sãdhanã, 21, pp. 345-362, (1996)

[6]

Hopcroft J., Ullman J., Introduction to Automata Theory, Languages, and Computation, (1979)

[7]

Jannedy S., Mobius B., Name pronunciation in German text-to-speech synthesis, Proceedings of the 5th Conference on Applied Natural Language Processing, pp. 49-56, (1997)

[8]

Kachru Y., Hindi-Urdu, The World's Major Languages, (1987)

[9]

Kaplan R., Kay M., Regular models of phonological rule systems, Computational Linguistics, 20, pp. 331-378, (1994)

[10]

Kiraz G., Mobius B., Multilingual syllabification using weighted finite-state transducers, Proceedings of the Third International Workshop on Speech Synthesis, pp. 71-76, (1998)

← 1 2 3 4 →