Cognitive factors in the evaluation of synthetic speech

被引:31
作者
Delogu, C
Conte, S
Sementina, C
机构
[1] Fdn Ugo Bordoni, Multimedia Commun Div, Voice Commun Grp, I-00142 Rome, Italy
[2] Univ Palermo, Dipartimento Psicol, Palermo, Italy
[3] Univ Rome, Dipartimento Psicol, Rome, Italy
关键词
cognitive factors; evaluation; perception; text-to-speech synthesis;
D O I
10.1016/S0167-6393(98)00009-0
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper illustrates the importance of various cognitive factors involved in perceiving and comprehending synthetic speech. It includes findings drawn from the relative psychological and psycholinguistic literature together with experimental results obtained at the Fondazione Ugo Bordoni laboratory. Overall, it is shown that listening to and comprehending synthetic voices is more difficult than with a natural voice. However, and more importantly, this difficulty can and does decrease with the subjects' exposure to said synthetic voices, Furthermore, greater workload demands are associated with synthetic speech and subjects listening to synthetic passages are required to pay more attention than those listening to natural passages. (C) 1998 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:153 / 168
页数:16
相关论文
共 63 条
[1]   SYNTACTIC STRUCTURE MODIFIES ATTENTION DURING SPEECH PERCEPTION AND RECOGNITION [J].
ABRAMS, K ;
BEVER, TG .
QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1969, 21 :280-&
[2]  
ALLEN J, 1992, ADV SPEECH SIGNAL PR, P741
[3]  
[Anonymous], COMPREHENDING ORAL W
[4]  
BENDAT JS, 1972, ANAL MEASUREMENT PRO
[5]   AN INTELLIGIBILITY TEST USING SEMANTICALLY UNPREDICTABLE SENTENCES - TOWARDS THE QUANTIFICATION OF LINGUISTIC COMPLEXITY [J].
BENOIT, C .
SPEECH COMMUNICATION, 1990, 9 (04) :293-304
[6]  
BOOGAART T, 1992, P INT C SPEECH LANGU, V2, P1207
[7]  
CARLSON R, 1989, P ESCA WORKSH NOORDW
[8]  
CARLSON R, 1992, P 6 SWED PHON C GOTH, P63
[9]   THE EFFECT OF ASYMMETRIC TRANSFER AND SPEECH TECHNOLOGY ON DUAL-TASK PERFORMANCE [J].
DAMOS, DL .
HUMAN FACTORS, 1985, 27 (04) :409-421
[10]  
DELOGU C, 1995, ACTA ACUST, V3, P89