SPEECH SYNTHESIS MODELS - A REVIEW

被引：5

作者：

BREEN, A

机构：

[1] BT Laboratories, Martlesham Heath

来源：

ELECTRONICS & COMMUNICATION ENGINEERING JOURNAL | 1992年 / 4卷 / 01期

关键词：

D O I：

10.1049/ecej:19920006

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Models of speech production and how these are used in text-to-speech conversion are reviewed. In the first part of the paper the foundation is laid for an explanation of present day speech synthesisers, and their limitations, through a phonetic description of speech production. The paper then presents a theorectical model of speech production which is the basis of most synthesisers. Next, a number of speech synthesisers are surveyed and their relative merits and shortcomings are considered. The paper ends with a brief look at techniques that are being used in place of the more traditional models of speech production in text-to-speech (TTS) systems and considers possible areas of progress in the future.

引用

页码：19 / 31

页数：13

共 23 条

[1] Allen J., 1987, TEXT SPEECH MITALK S
[2] CARLSON R, 1990, ADV SPEECH HEARING L, V1, P269
[3] Chan D. S. F., 1989, Eurospeech 89. European Conference on Speech Communication and Technology, P199
[4] THE SPEAKING MACHINE OF VONKEMPELEN,WOLFGANG
DUDLEY, H
TARNOCZY, TH
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1950, 22 (02) : 151 - 166
[5] A synthetic speaker.
Dudley, H
Riesz, RR
Watkins, SSA
[J]. JOURNAL OF THE FRANKLIN INSTITUTE, 1939, 227 : 0739 - 0764
[6] FANT G, 1985, SPEECH TRANSMISSION, P1
[7] Fant G., 1960, ACOUSTIC THEORY SPEE
[8] COMPUTER-MODEL TO CHARACTERIZE AIR VOLUME DISPLACED BY VIBRATING VOCAL CORDS
FLANAGAN, JL
ISHIZAKA, K
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 (05) : 1559 - 1565
[9] FOURCIN AJ, 1971, MED BIOL ILLUS, V21, P172
[10] FOURCIN AJ, 1989, SPEECH INPUT OUTPUT

← 1 2 3 →