Optimised spectral weightings for noise-dependent speech intelligibility enhancement

被引:0
作者
Tang, Yan [1 ]
Cooke, Martin [1 ]
机构
[1] Univ Basque Country, Language & Speech Lab, Vitoria, Spain
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
speech intelligibility; noise; optimisation; genetic algorithm; glimpse proportion;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Natural or synthetic speech is increasingly used in less-than-ideal listening conditions. Maximising the likelihood of correct message reception in such situations often leads to a strategy of loud and repetitive renditions of output speech. An alternative approach is to modify the speech signal in ways which increase intelligibility in noise without increasing signal level or duration. The current study focused on the design of stationary spectral modifications whose effect is to reallocate speech energy across frequency bands. Frequency band weights were selected using a genetic algorithm-based optimisation procedure, with glimpse proportion as the objective intelligibility metric, for a range of noise types and levels. As expected, a clear dependence of noise type and global signal-to-noise ratio on energy reallocation was found. One unanticipated outcome was the consistent discovery of sparse, highly-selective spectral energy weightings, particularly in high noise conditions. In a subjective test using stationary noise and competing speech maskers, listeners were able to identify significantly more words in sentences as a result of spectral weighting, with increases of up to 15 percentage points. These findings suggest that context-dependent speech output can be used to maintain intelligibility at lower sound output levels.
引用
收藏
页码:954 / 957
页数:4
相关论文
共 50 条
  • [1] Learning static spectral weightings for speech intelligibility enhancement in noise
    Tang, Yan
    Cooke, Martin
    COMPUTER SPEECH AND LANGUAGE, 2018, 49 : 1 - 16
  • [2] Spectral and temporal manipulations of SFF envelopes for enhancement of speech intelligibility in noise
    Chennupati, Nivedita
    Kadiri, Sudarsana Reddy
    Yegnanarayana, B.
    COMPUTER SPEECH AND LANGUAGE, 2019, 54 : 86 - 105
  • [3] Effects of Enhancement of Spectral Changes on Speech Quality and Subjective Speech Intelligibility
    Chen, Jing
    Baer, Thomas
    Moore, Brian C. J.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1640 - 1643
  • [4] Combining spectral and temporal modification techniques for speech intelligibility enhancement
    Cooke, Martin
    Aubanel, Vincent
    Garcia Lecumberri, Maria Luisa
    COMPUTER SPEECH AND LANGUAGE, 2019, 55 : 26 - 39
  • [5] Speech enhancement by speech intelligibility index In sensor network
    Parija, Smita
    Sahu, Prasanna Kumar
    Singh, Sudhansu Sekhar
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTING COMMUNICATION & NETWORKING TECHNOLOGIES (ICCCNT), 2012,
  • [6] Improvement of speech intelligibility by reallocation of spectral energy
    Takou, Reiko
    Seiyama, Nobumasa
    Imai, Atsushi
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3572 - 3574
  • [7] Rephrasing-Based Speech Intelligibility Enhancement
    Zhang, Mengqiu
    Petkov, Petko N.
    Kleijn, W. Bastiaan
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3554 - 3558
  • [8] Speech intelligibility enhancement: a hybrid wiener approach
    Srinivasarao, V.
    Ghanekar, Umesh
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2020, 23 (03) : 517 - 525
  • [9] SPECTRAL CONTRAST ENHANCEMENT OF SPEECH IN NOISE FOR LISTENERS WITH SENSORINEURAL HEARING IMPAIRMENT - EFFECTS ON INTELLIGIBILITY, QUALITY, AND RESPONSE-TIMES
    BAER, T
    MOORE, BCJ
    GATEHOUSE, S
    JOURNAL OF REHABILITATION RESEARCH AND DEVELOPMENT, 1993, 30 (01): : 49 - 72
  • [10] Time and Frequency Dependent Amplification for Speech Intelligibility Enhancement in Noisy Environments
    Brouckxon, Henk
    Verhelst, Werner
    De Schuymer, Bart
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 557 - +