Optimised spectral weightings for noise-dependent speech intelligibility enhancement

被引:0
作者
Tang, Yan [1 ]
Cooke, Martin [1 ]
机构
[1] Univ Basque Country, Language & Speech Lab, Vitoria, Spain
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
speech intelligibility; noise; optimisation; genetic algorithm; glimpse proportion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural or synthetic speech is increasingly used in less-than-ideal listening conditions. Maximising the likelihood of correct message reception in such situations often leads to a strategy of loud and repetitive renditions of output speech. An alternative approach is to modify the speech signal in ways which increase intelligibility in noise without increasing signal level or duration. The current study focused on the design of stationary spectral modifications whose effect is to reallocate speech energy across frequency bands. Frequency band weights were selected using a genetic algorithm-based optimisation procedure, with glimpse proportion as the objective intelligibility metric, for a range of noise types and levels. As expected, a clear dependence of noise type and global signal-to-noise ratio on energy reallocation was found. One unanticipated outcome was the consistent discovery of sparse, highly-selective spectral energy weightings, particularly in high noise conditions. In a subjective test using stationary noise and competing speech maskers, listeners were able to identify significantly more words in sentences as a result of spectral weighting, with increases of up to 15 percentage points. These findings suggest that context-dependent speech output can be used to maintain intelligibility at lower sound output levels.
引用
收藏
页码:954 / 957
页数:4
相关论文
共 50 条
  • [21] A mapping model of spectral tilt in normal-to-Lombard speech conversion for intelligibility enhancement
    Li, Gang
    Hu, Ruimin
    Zhang, Rui
    Wang, Xiaochen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (27-28) : 19471 - 19491
  • [22] Effects of urgent speech and congruent/incongruent text on speech intelligibility in noise and reverberation
    Hodoshima, Nao
    INTERSPEECH 2019, 2019, : 3113 - 3117
  • [23] Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression
    Zorila, Tudor-Catalin
    Kandia, Varvara
    Stylianou, Yannis
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 634 - 637
  • [24] Effect of Spectral Contrast Enhancement on Speech-on-Speech Intelligibility and Voice Cue Sensitivity in Cochlear Implant Users
    El Boghdady, Nawal
    Langner, Florian
    Gaudrain, Etienne
    Baskent, Deniz
    Nogueira, Waldo
    EAR AND HEARING, 2021, 42 (02) : 271 - 289
  • [25] Preservation of Speech Spectral Dynamics Enhances Intelligibility
    Petkov, Petko N.
    Kleijn, W. Bastiaan
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3564 - 3568
  • [26] A Higher Intelligibility Speech-Enhancement Algorithm
    Liu, Peng
    Ma, Jianfen
    MECHATRONICS AND INDUSTRIAL INFORMATICS, PTS 1-4, 2013, 321-324 : 1075 - 1079
  • [27] On Improvement of Speech Intelligibility and Quality: A Survey of Unsupervised Single Channel Speech Enhancement Algorithms
    Saleem, Nasir
    Khattak, Muhammad Irfan
    Verdu, Elena
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2020, 6 (02): : 78 - 89
  • [28] On Speech Intelligibility Estimation of Phase-Aware Single-Channel Speech Enhancement
    Gaich, Andreas
    Mowlaee, Pejman
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 2553 - 2557
  • [29] Speech intelligibility enhancement: a hybrid wiener approach
    V. Srinivasarao
    Umesh Ghanekar
    International Journal of Speech Technology, 2020, 23 : 517 - 525
  • [30] Speech Intelligibility and Quality: A Comparative Study of Speech Enhancement Algorithms
    Xu, Xiaodong
    Flynn, Ronan
    Russell, Michael
    2017 28TH IRISH SIGNALS AND SYSTEMS CONFERENCE (ISSC), 2017,