EFFECT OF SYNTHETIC VOICE INTELLIGIBILITY ON SPEECH COMPREHENSION

被引:8
作者
PARIS, CR
GILSON, RD
THOMAS, MH
SILVER, NC
机构
[1] UNIV CENT FLORIDA,DEPT PSYCHOL,ORLANDO,FL 32816
[2] UNIV NEVADA,LAS VEGAS,NV 89154
关键词
D O I
10.1518/001872095779064609
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
This research investigated the differential impact of synthetic voice quality and text difficulty on comprehension of extended prose. Sixty participants listened to five easy and five difficult passages in one of three speech modes: natural speech, VOTRAX (low intelligibility), or DECtalk (high intelligibility). Comprehension of DECtalk was equal to that of natural speech, whereas comprehension of VOTRAX was significantly poorer than with natural speech or DECtalk. Subjects were also asked to shadow passages of each speech type as a measure of resource processing demands. It was found that shadowing accuracy was significantly better for natural speech than for DECtalk and shadowing of DECtalk was markedly superior to that of VOTRAX. The results of this study suggest that resource-demand measures alone may not be appropriate to predict performance in practical applications. Specifically, overall comprehension may not suffer despite on-line losses in processing. These findings also point to a differential allocation of cognitive resources by speech synthesizers of differing intelligibility.
引用
收藏
页码:335 / 340
页数:6
相关论文
共 12 条
[1]   SURFACE INFORMATION LOSS IN COMPREHENSION [J].
GERNSBACHER, MA .
COGNITIVE PSYCHOLOGY, 1985, 17 (03) :324-363
[2]   SHORT-TERM STORAGE IN READING [J].
GLANZER, M ;
FISCHER, B ;
DORFMAN, D .
JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1984, 23 (04) :467-486
[3]   PERCEPTION OF SYNTHETIC SPEECH PRODUCED AUTOMATICALLY BY RULE - INTELLIGIBILITY OF 8 TEXT-TO-SPEECH SYSTEMS [J].
GREENE, BG ;
LOGAN, JS ;
PISONI, DB .
BEHAVIOR RESEARCH METHODS INSTRUMENTS & COMPUTERS, 1986, 18 (02) :100-107
[4]   SYNTACTIC PROCESSING OF CONNECTED SPEECH [J].
JARVELLA, RJ .
JOURNAL OF VERBAL LEARNING AND VERBAL BEHAVIOR, 1971, 10 (04) :409-416
[5]   SEGMENTAL INTELLIGIBILITY OF SYNTHETIC SPEECH PRODUCED BY RULE [J].
LOGAN, JS ;
GREENE, BG ;
PISONI, DB .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1989, 86 (02) :566-581
[6]  
Pisoni David B, 1987, Comput Speech Lang, V2, P303, DOI 10.1016/0885-2308(87)90014-3
[7]   AUDITORY SHORT-TERM-MEMORY AND VOWEL PERCEPTION [J].
PISONI, DB .
MEMORY & COGNITION, 1975, 3 (01) :7-18
[8]   PERCEPTION OF SYNTHETIC SPEECH GENERATED BY RULE [J].
PISONI, DB ;
NUSBAUM, HC ;
GREENE, BG .
PROCEEDINGS OF THE IEEE, 1985, 73 (11) :1665-1676
[9]   COMPREHENSION OF SYNTHETIC SPEECH PRODUCED BY RULE - WORD MONITORING AND SENTENCE-BY-SENTENCE LISTENING TIMES [J].
RALSTON, JV ;
PISONI, DB ;
LIVELY, SE ;
GREENE, BG ;
MULLENNIX, JW .
HUMAN FACTORS, 1991, 33 (04) :471-491
[10]  
REPP BH, 1983, ADV BASIC RES PRACTI, V10