SPEECH SYNTHESIS MODELS - A REVIEW

被引:5
作者
BREEN, A
机构
[1] BT Laboratories, Martlesham Heath
来源
ELECTRONICS & COMMUNICATION ENGINEERING JOURNAL | 1992年 / 4卷 / 01期
关键词
D O I
10.1049/ecej:19920006
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Models of speech production and how these are used in text-to-speech conversion are reviewed. In the first part of the paper the foundation is laid for an explanation of present day speech synthesisers, and their limitations, through a phonetic description of speech production. The paper then presents a theorectical model of speech production which is the basis of most synthesisers. Next, a number of speech synthesisers are surveyed and their relative merits and shortcomings are considered. The paper ends with a brief look at techniques that are being used in place of the more traditional models of speech production in text-to-speech (TTS) systems and considers possible areas of progress in the future.
引用
收藏
页码:19 / 31
页数:13
相关论文
共 23 条
  • [1] Allen J., 1987, TEXT SPEECH MITALK S
  • [2] CARLSON R, 1990, ADV SPEECH HEARING L, V1, P269
  • [3] Chan D. S. F., 1989, Eurospeech 89. European Conference on Speech Communication and Technology, P199
  • [4] THE SPEAKING MACHINE OF VONKEMPELEN,WOLFGANG
    DUDLEY, H
    TARNOCZY, TH
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1950, 22 (02) : 151 - 166
  • [5] A synthetic speaker.
    Dudley, H
    Riesz, RR
    Watkins, SSA
    [J]. JOURNAL OF THE FRANKLIN INSTITUTE, 1939, 227 : 0739 - 0764
  • [6] FANT G, 1985, SPEECH TRANSMISSION, P1
  • [7] Fant G., 1960, ACOUSTIC THEORY SPEE
  • [8] COMPUTER-MODEL TO CHARACTERIZE AIR VOLUME DISPLACED BY VIBRATING VOCAL CORDS
    FLANAGAN, JL
    ISHIZAKA, K
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1978, 63 (05) : 1559 - 1565
  • [9] FOURCIN AJ, 1971, MED BIOL ILLUS, V21, P172
  • [10] FOURCIN AJ, 1989, SPEECH INPUT OUTPUT