Optimised spectral weightings for noise-dependent speech intelligibility enhancement

被引:0
作者
Tang, Yan [1 ]
Cooke, Martin [1 ]
机构
[1] Univ Basque Country, Language & Speech Lab, Vitoria, Spain
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
speech intelligibility; noise; optimisation; genetic algorithm; glimpse proportion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural or synthetic speech is increasingly used in less-than-ideal listening conditions. Maximising the likelihood of correct message reception in such situations often leads to a strategy of loud and repetitive renditions of output speech. An alternative approach is to modify the speech signal in ways which increase intelligibility in noise without increasing signal level or duration. The current study focused on the design of stationary spectral modifications whose effect is to reallocate speech energy across frequency bands. Frequency band weights were selected using a genetic algorithm-based optimisation procedure, with glimpse proportion as the objective intelligibility metric, for a range of noise types and levels. As expected, a clear dependence of noise type and global signal-to-noise ratio on energy reallocation was found. One unanticipated outcome was the consistent discovery of sparse, highly-selective spectral energy weightings, particularly in high noise conditions. In a subjective test using stationary noise and competing speech maskers, listeners were able to identify significantly more words in sentences as a result of spectral weighting, with increases of up to 15 percentage points. These findings suggest that context-dependent speech output can be used to maintain intelligibility at lower sound output levels.
引用
收藏
页码:954 / 957
页数:4
相关论文
共 50 条
  • [31] The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise
    Lu, Youyi
    Cooke, Martin
    SPEECH COMMUNICATION, 2009, 51 (12) : 1253 - 1262
  • [32] Selective Frequency Enhancement of Speech Signal for Intelligibility Improvement in Presence of Near-end Noise
    Premananda, B. S.
    Uma, B., V
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL(ICAC3'15), 2015, 49 : 244 - 252
  • [33] SPECTRAL FEATURE ENHANCEMENT FOR PEOPLE WITH SENSORINEURAL HEARING IMPAIRMENT - EFFECTS ON SPEECH-INTELLIGIBILITY AND QUALITY
    STONE, MA
    MOORE, BCJ
    JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1992, 29 (02): : 39 - 56
  • [34] Fundamental frequency and speech intelligibility in background noise
    Brown, Christopher A.
    Bacon, Sid P.
    HEARING RESEARCH, 2010, 266 (1-2) : 52 - 59
  • [35] A comparative intelligibility study of speech enhancement algorithms
    Hu, Yi
    Loizou, Philipos C.
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 561 - +
  • [36] Effects of noise and filtering on the intelligibility of speech produced during simultaneous communication
    MacKenzie, DJ
    Schiavetti, N
    Whitehead, RL
    Metz, DE
    JOURNAL OF COMMUNICATION DISORDERS, 2004, 37 (06) : 505 - 515
  • [37] Intelligibility of speech spoken in noise/reverberation for older adults in reverberant environments
    Hodoshima, Nao
    Arai, Takayuki
    Kurisu, Kiyohiro
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 1462 - 1465
  • [38] Subjective intelligibility of deep neural network-based speech enhancement
    Gelderblom, Femke B.
    Tronstad, Tron V.
    Viggen, Erlend Magnus
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1968 - 1972
  • [39] Increasing speech intelligibility in monaural hearing by adding noise at the other ear
    Wang, Kang
    Wang, Peng
    Qiu, Xiaojun
    APPLIED ACOUSTICS, 2019, 146 : 50 - 55
  • [40] Effects on speech intelligibility of temporal jittering and spectral smearing of the high-frequency components of speech
    MacDonald, Ewen N.
    Pichora-Fuller, M. Kathleen
    Schneider, Bruce A.
    HEARING RESEARCH, 2010, 261 (1-2) : 63 - 66