REPETITION AND RE-START STRATEGIES FOR PROSODY IN TEXT-TO-SPEECH CONVERSION SYSTEMS

被引:1
作者
LAVER, J
机构
[1] Centre for Speech Technology Research, University of Edinburgh, 80, South Bridge, Edinburgh EH1 1HN, Scotland
关键词
SPEECH SYNTHESIS; TEXT-TO-SPEECH CONVERSION; PROSODY; NOISE; SPOKEN DIALOG;
D O I
10.1016/0167-6393(93)90061-O
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speakers in conversations between humans continually adapt the prosodic and structural aspects of their speech to the perceived needs of their listeners, in terms of judgments about the potentially masking effects of transient and ambient noise levels, and in response to explicit requests by listeners for repetition. Adaptive strategies for repetition include changing such prosodic aspects of utterances as pitch range and mean, intensity mean and overall tempo of speaking, together with intonational re-structuring. Such repetition also deploys re-start strategies based on structural linguistic knowledge. An outline is offered of principles for incorporating elements of such intelligent adaptivity in the operation of text-to-speech conversion systems, to improve their interactive ability with human partners in dialogue.
引用
收藏
页码:75 / 85
页数:11
相关论文
共 31 条
  • [21] Malinlowski Bronislaw., 1972, COMMUNICATION FACE F, P146
  • [22] MCALLISTER J, 1993, IN PRESS ENCY LANGUA
  • [23] MCALLISTER R, 1989, PERILUS, V9, P29
  • [24] NOFSINGER RE, 1991, EVERYDAY CONSERVATIO
  • [25] PERCEPTION OF SYNTHETIC SPEECH GENERATED BY RULE
    PISONI, DB
    NUSBAUM, HC
    GREENE, BG
    [J]. PROCEEDINGS OF THE IEEE, 1985, 73 (11) : 1665 - 1676
  • [26] SCHENKEIN J, 1978, STUDIES ORG CONVERSA
  • [27] Schiffrin Deborah., 1987, DISCOURSE MARKERS, DOI 10.1017/CBO9780511611841
  • [28] AUDITORY-FEEDBACK IN REGULATION OF VOICE
    SIEGEL, GM
    PICK, HL
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1974, 56 (05) : 1618 - 1624
  • [29] Sudnow D., 1972, STUDIES SOCIAL INTER
  • [30] Tannen D, 1989, TALKING VOICES REPET