ON MEASURING THE INTELLIGIBILITY OF SYNTHETIC SPEECH IN NOISE - DO WE NEED A REALISTIC NOISE ENVIRONMENT?

被引:0
|
作者
Raitio, Tuomo [1 ]
Takanen, Marko [1 ]
Santala, Olli [1 ]
Suni, Antti [2 ]
Vainio, Martti [2 ]
Alku, Paavo [1 ]
机构
[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland
[2] Univ Helsinki, Inst Behav Sci, Helsinki, Finland
来源
2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年
基金
芬兰科学院;
关键词
synthetic speech; speech in noise; intelligibility; multichannel reproduction; Lombard speech; COCKTAIL PARTY;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Assessing the intelligibility of synthetic speech is important in creating synthetic voices to be used in real life applications, especially for the ones involving interfering noise. This raises the question how to measure the intelligibility of synthetic speech to correctly simulate such conditions. Conventionally, this has been done using a simple listening test setup where diotic speech and noise are played to both ears with headphones. This is indeed very different from the real noise environment where speech and noise are spatially distributed. This paper addresses the question whether a realistic noise environment should be used to test the intelligibility of synthetic speech. Three different test conditions, one with multichannel reproduction of noise and speech, and two headphone setups are evaluated. Tests are performed with natural and synthetic speech, including speech especially intended for noisy conditions. The results indicate a general trend in all setups but also some interesting differences.
引用
收藏
页码:4025 / 4028
页数:4
相关论文
共 50 条
  • [1] SEGMENTAL INTELLIGIBILITY AND SPEECH INTERFERENCE THRESHOLDS OF HIGH-QUALITY SYNTHETIC SPEECH IN PRESENCE OF NOISE
    KOUL, RK
    ALLEN, GD
    JOURNAL OF SPEECH AND HEARING RESEARCH, 1993, 36 (04): : 790 - 798
  • [2] Speech intelligibility improvement in car noise environment by voice transformation
    Nathwani, Karan
    Richard, Gael
    David, Bertrand
    Prablanc, Pierre
    Roussarie, Vincent
    SPEECH COMMUNICATION, 2017, 91 : 17 - 27
  • [3] Head orientation benefit to speech intelligibility in noise for cochlear implant users and in realistic listening conditions
    Grange, Jacques A.
    Culling, John F.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (06): : 4061 - 4072
  • [4] Effect of contralateral noise on energetic and informational masking on speech-in-speech intelligibility
    Dole, Marjorie
    Hoen, Michel
    Meunier, Fanny
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 156 - +
  • [5] Prediction of Speech Intelligibility by Means of EEG Responses to Sentences in Noise
    Muncke, Jan
    Kuruvila, Ivine
    Hoppe, Ulrich
    FRONTIERS IN NEUROSCIENCE, 2022, 16
  • [6] Can Objective Measures Predict the Intelligibility of Modified HMM-based Synthetic Speech in Noise?
    Valentini-Botinhao, Cassia
    Yamagishi, Junichi
    King, Simon
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1848 - 1851
  • [7] Evaluating the intelligibility benefit of speech modifications in known noise conditions
    Cooke, Martin
    Mayo, Catherine
    Valentini-Botinhao, Cassia
    Stylianou, Yannis
    Sauert, Bastian
    Tang, Yan
    SPEECH COMMUNICATION, 2013, 55 (04) : 572 - 585
  • [8] Learning static spectral weightings for speech intelligibility enhancement in noise
    Tang, Yan
    Cooke, Martin
    COMPUTER SPEECH AND LANGUAGE, 2018, 49 : 1 - 16
  • [9] Test of Spanish sentences to measure speech intelligibility in noise conditions
    Cervera, Teresa
    Gonzalez-Alvarez, Julio
    BEHAVIOR RESEARCH METHODS, 2011, 43 (02) : 459 - 467
  • [10] Speech intelligibility among modulated and spatially distributed noise sources
    Culling, John F.
    Mansell, Elizabeth R.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 133 (04): : 2254 - 2261