Combining spectral and temporal modification techniques for speech intelligibility enhancement

被引:6
|
作者
Cooke, Martin [1 ,2 ]
Aubanel, Vincent [3 ]
Garcia Lecumberri, Maria Luisa [2 ]
机构
[1] Ikerbasque Basque Sci Fdn, Bilbao, Spain
[2] Univ Basque Country, Language & Speech Lab, Vitoria 01006, Spain
[3] Univ Grenoble Alpes, Ctr Natl Rech Sci, GIPSA Lab, Grenoble, France
关键词
Speech modification; Intelligibility; Retiming; Glimpsing; COCHLEA-SCALED ENTROPY; NOISE; CLEAR; INTENSITY;
D O I
10.1016/j.csl.2018.10.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Modifying clean speech prior to output in noisy conditions can lead to substantial intelligibility gains. Most algorithms operate by redistributing energy across the signal, leaving the timing of the underlying speech sounds intact. Other techniques do alter the timing of speech relative to the masker. Both classes of approach - spectral and temporal - lead to a reduction in energetic masking. The current study examines how their combination affects intelligibility. Arguments can be made for both synergy and redundancy, and the presence of distortions introduced by both spectral and temporal approaches might even lead to an antagonistic combination. A cohort of native Spanish listeners identified keywords in sentences in unmodified form and following spectral, temporal and spectro-temporal modification, in the presence of a fluctuating masker. Errors in the spectro-temporal condition were substantially lower than following spectral or temporal modification alone, with a three-fold reduction compared to unmodified speech. Spectro-temporal gains were observed for all phonemes. A glimpse-based model of energetic masking incorporating speech rate changes predicts intelligibility (r = .96), and a glimpsing analysis provides further insights into the distinct mechanisms through which spectral and temporal approaches lead to a release from energetic masking. (C) 2018 Elsevier Ltd. All rights reserved.
引用
收藏
页码:26 / 39
页数:14
相关论文
共 50 条
  • [1] Learning static spectral weightings for speech intelligibility enhancement in noise
    Tang, Yan
    Cooke, Martin
    COMPUTER SPEECH AND LANGUAGE, 2018, 49 : 1 - 16
  • [2] Effects of Enhancement of Spectral Changes on Speech Quality and Subjective Speech Intelligibility
    Chen, Jing
    Baer, Thomas
    Moore, Brian C. J.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1640 - 1643
  • [3] Real-Time Modulation Enhancement of Temporal Envelopes for Increasing Speech Intelligibility
    Koutsogiannaki, Maria
    Francois, Holly
    Choo, Kihyun
    Oh, Eunmi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1973 - 1977
  • [4] Modulation Enhancement of Temporal Envelopes for Increasing Speech Intelligibility in Noise
    Koutsogiannaki, Maria
    Stylianou, Yannis
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2508 - 2512
  • [5] Enhancement of cleft palate speech using temporal and spectral processing
    Sudro, Protima Nomo
    Prasanna, S. R. Mahadeva
    SPEECH COMMUNICATION, 2020, 123 : 70 - 82
  • [6] Optimised spectral weightings for noise-dependent speech intelligibility enhancement
    Tang, Yan
    Cooke, Martin
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 954 - 957
  • [7] Effects on speech intelligibility of temporal jittering and spectral smearing of the high-frequency components of speech
    MacDonald, Ewen N.
    Pichora-Fuller, M. Kathleen
    Schneider, Bruce A.
    HEARING RESEARCH, 2010, 261 (1-2) : 63 - 66
  • [8] Speech Intelligibility Enhancement on Android Platform by Consonant-Vowel-Ratio Modification
    Sarath, P. G.
    Jayan, A. R.
    2016 INTERNATIONAL CONFERENCE ON NEXT GENERATION INTELLIGENT SYSTEMS (ICNGIS), 2016, : 148 - 152
  • [9] Effect of enhancement of spectral changes on speech intelligibility and clarity preferences for the hearing impaired
    Chen, Jing
    Baer, Thomas
    Moore, Brian C. J.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2012, 131 (04) : 2987 - 2998
  • [10] The effects of speech intelligibility and temporal-spectral variability on performance and annoyance ratings
    Liebl, Andreas
    Assfalg, Alexander
    Schlittmeier, Sabine J.
    APPLIED ACOUSTICS, 2016, 110 : 170 - 175