REPETITION AND RE-START STRATEGIES FOR PROSODY IN TEXT-TO-SPEECH CONVERSION SYSTEMS

被引:1
作者
LAVER, J
机构
[1] Centre for Speech Technology Research, University of Edinburgh, 80, South Bridge, Edinburgh EH1 1HN, Scotland
关键词
SPEECH SYNTHESIS; TEXT-TO-SPEECH CONVERSION; PROSODY; NOISE; SPOKEN DIALOG;
D O I
10.1016/0167-6393(93)90061-O
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speakers in conversations between humans continually adapt the prosodic and structural aspects of their speech to the perceived needs of their listeners, in terms of judgments about the potentially masking effects of transient and ambient noise levels, and in response to explicit requests by listeners for repetition. Adaptive strategies for repetition include changing such prosodic aspects of utterances as pitch range and mean, intensity mean and overall tempo of speaking, together with intonational re-structuring. Such repetition also deploys re-start strategies based on structural linguistic knowledge. An outline is offered of principles for incorporating elements of such intelligent adaptivity in the operation of text-to-speech conversion systems, to improve their interactive ability with human partners in dialogue.
引用
收藏
页码:75 / 85
页数:11
相关论文
共 31 条
  • [1] ALLEN MS, 1987, TEXT SPEECH MITALK S
  • [2] Atkinson J. M., 1984, STRUCTURES SOCIAL AC
  • [3] Brown Penelope., 1978, QUESTIONS POLITENESS, P56
  • [4] Cheepen C., 1988, PREDICTABILITY INFOR
  • [5] DOCHERTY G, 1988, ASPECTS SPEECH TECHN, P144
  • [6] Fant Gunnar, 1966, SPEECH TRANSMISSION, V1, P22
  • [7] Goffman, 1971, RELATIONS PUBLIC MIC
  • [8] HAGERMAN B, 1984, THESIS KAROLINSKA I
  • [9] JACK MA, 1988, ASPECTS SPEECH TECHN
  • [10] Kent R.D., 1992, ACOUSTIC ANAL SPEECH