ON MEASURING THE INTELLIGIBILITY OF SYNTHETIC SPEECH IN NOISE - DO WE NEED A REALISTIC NOISE ENVIRONMENT?

被引：0

作者：

Raitio, Tuomo ^{[1
]}

Takanen, Marko ^{[1
]}

Santala, Olli ^{[1
]}

Suni, Antti ^{[2
]}

Vainio, Martti ^{[2
]}

Alku, Paavo ^{[1
]}

机构：

[1] Aalto Univ, Dept Signal Proc & Acoust, Espoo, Finland

[2] Univ Helsinki, Inst Behav Sci, Helsinki, Finland

来源：

2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2012年

基金：

芬兰科学院;

关键词：

synthetic speech; speech in noise; intelligibility; multichannel reproduction; Lombard speech; COCKTAIL PARTY;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Assessing the intelligibility of synthetic speech is important in creating synthetic voices to be used in real life applications, especially for the ones involving interfering noise. This raises the question how to measure the intelligibility of synthetic speech to correctly simulate such conditions. Conventionally, this has been done using a simple listening test setup where diotic speech and noise are played to both ears with headphones. This is indeed very different from the real noise environment where speech and noise are spatially distributed. This paper addresses the question whether a realistic noise environment should be used to test the intelligibility of synthetic speech. Three different test conditions, one with multichannel reproduction of noise and speech, and two headphone setups are evaluated. Tests are performed with natural and synthetic speech, including speech especially intended for noisy conditions. The results indicate a general trend in all setups but also some interesting differences.

引用

页码：4025 / 4028

页数：4

共 50 条

[41] Phoneme Intelligibility of Four Text-to-Speech Products to Nonnative Speakers of English in Noise
Venkatagiri, H. S.
INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2005, 8 (04) : 313 - 321
[42] Microscopic prediction of speech intelligibility in spatially distributed speech-shaped noise for normal-hearing listeners
Geravanchizadeh, Masoud
Fallah, Ali
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2015, 138 (06) : 4004 - 4015
[43] Conversational speech levels and signal-to-noise ratios in realistic acoustic conditions
Weisser, Adam
Buchholz, Jorg M.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2019, 145 (01) : 349 - 360
[44] Increasing speech intelligibility and naturalness in noise based on concepts of modulation spectrum and modulation transfer function
Ngo, Thuanvan
Kubo, Rieko
Akagi, Masato
SPEECH COMMUNICATION, 2021, 135 : 11 - 24
[45] The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise
Lu, Youyi
Cooke, Martin
SPEECH COMMUNICATION, 2009, 51 (12) : 1253 - 1262
[46] Binaural speech intelligibility in rooms with variations in spatial location of sources and modulation depth of noise interferers
Collin, Benjamin
Lavandier, Mathieu
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2013, 134 (02) : 1146 - 1159
[47] Effects of Noise and Speech Intelligibility on Listener Comprehension and Processing Time of Korean-Accented English
Wilson, Erin O'Brien
Spaulding, Tammie J.
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2010, 53 (06): : 1543 - 1554
[48] Measuring time-frequency importance functions of speech with bubble noise
Mandel, Michael I.
Yoho, Sarah E.
Healy, Eric W.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (04) : 2542 - 2553
[49] The Influence of Noise Reduction on Speech Intelligibility, Response Times to Speech, and Perceived Listening Effort in Normal-Hearing Listeners
van den Tillaart-Haverkate, Maj
de Ronde-Brons, Inge
Dreschler, Wouter A.
Houben, Rolph
TRENDS IN HEARING, 2017, 21 : 1 - 13
[50] Release from masking of speech intelligibility due to fluctuating ambient noise in open-plan offices
Chevret, P.
APPLIED ACOUSTICS, 2016, 101 : 156 - 167

← 1 2 3 4 5 →