Cognitive factors in the evaluation of synthetic speech

被引:30
|
作者
Delogu, C
Conte, S
Sementina, C
机构
[1] Fdn Ugo Bordoni, Multimedia Commun Div, Voice Commun Grp, I-00142 Rome, Italy
[2] Univ Palermo, Dipartimento Psicol, Palermo, Italy
[3] Univ Rome, Dipartimento Psicol, Rome, Italy
关键词
cognitive factors; evaluation; perception; text-to-speech synthesis;
D O I
10.1016/S0167-6393(98)00009-0
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper illustrates the importance of various cognitive factors involved in perceiving and comprehending synthetic speech. It includes findings drawn from the relative psychological and psycholinguistic literature together with experimental results obtained at the Fondazione Ugo Bordoni laboratory. Overall, it is shown that listening to and comprehending synthetic voices is more difficult than with a natural voice. However, and more importantly, this difficulty can and does decrease with the subjects' exposure to said synthetic voices, Furthermore, greater workload demands are associated with synthetic speech and subjects listening to synthetic passages are required to pay more attention than those listening to natural passages. (C) 1998 Elsevier Science B.V. All rights reserved.
引用
收藏
页码:153 / 168
页数:16
相关论文
共 50 条
  • [21] ASSESSING EVALUATION METRICS FOR SPEECH-TO-SPEECH TRANSLATION
    Salesky, Elizabeth
    Maeder, Julian
    Klinger, Severin
    2021 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2021, : 733 - 740
  • [22] PROVIDING HUMAN-FACTORS KNOWLEDGE TO NONSPECIALISTS - A STRUCTURED METHOD FOR THE EVALUATION OF FUTURE SPEECH INTERFACES
    LIFE, MA
    LONG, JB
    LEE, BP
    ERGONOMICS, 1994, 37 (11) : 1801 - 1842
  • [23] AN EVALUATION OF MONGOLIAN DATA-DRIVEN TEXT-TO-SPEECH
    Altangerel, Chagnaa
    Purev, Jaimai
    Yesyenbyek, Kerey
    Hansakunbuntheung, Chatchawarn
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [24] Evaluation of Prosody in Text-to-Speech Synthesis System of Bangla
    Basu, Tulika
    Saha, Arup
    2013 INTERNATIONAL CONFERENCE ORIENTAL COCOSDA HELD JOINTLY WITH 2013 CONFERENCE ON ASIAN SPOKEN LANGUAGE RESEARCH AND EVALUATION (O-COCOSDA/CASLRE), 2013,
  • [25] Impaired categorical perception of synthetic speech sounds in schizophrenia
    Cienfuegos, A
    March, L
    Shelley, AM
    Javitt, DC
    BIOLOGICAL PSYCHIATRY, 1999, 45 (01) : 82 - 88
  • [26] AN EVALUATION OF THE VISUAL SPEECH APPARATUS
    ARENDS, N
    POVEL, DJ
    VANOS, E
    MICHIELSEN, S
    CLAASSEN, J
    FEITER, I
    SPEECH COMMUNICATION, 1991, 10 (04) : 405 - 414
  • [27] Measuring cognitive factors in speech comprehension: The value of using the Text Reception Threshold test as a visual equivalent of the SRT test
    Kramer, Sophia E.
    Zekveld, Adriana A.
    Houtgast, Tammo
    SCANDINAVIAN JOURNAL OF PSYCHOLOGY, 2009, 50 (05) : 507 - 515
  • [28] Ontology for perception in cognitive agents and synthetic environments
    Suliman, H
    Mehdi, QH
    Gough, NE
    GAME-ON 2003: 4TH INTERNATIONAL CONFERENCE ON INTELLIGENT GAMES AND SIMULATION, 2003, : 127 - 134
  • [29] Factors influencing recognition of interrupted speech
    Wang, Xin
    Humes, Larry E.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2010, 128 (04) : 2100 - 2111
  • [30] Individual differences in speech-on-speech masking are correlated with cognitive and visual task performance
    Byrne, Andrew J.
    Conroy, Christopher
    Kidd, Gerald
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 154 (04) : 2137 - 2153