Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression

被引:0
作者
Zorila, Tudor-Catalin
Kandia, Varvara
Stylianou, Yannis
机构
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
speech-in-noise enhancement; speech intelligibility; spectral shaping; dynamic range compression; CLEAR SPEECH; LISTENERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we suggest a non-parametric way to improve the intelligibility of speech in noise. The signal is enhanced before presented in a noisy environment, under the constraint of equal global signal power before and after modifications. Two systems are combined in a cascade form to enhance the quality of the signal first in frequency (spectral shaping) and then in time (dynamic range compression). Experiments with speech shaped (SSN) and competing speaker (CS) types of noise at various low SNR values, show that the suggested approach outperforms state-of-the art methods in terms of the Speech Intelligibility Index (SII). In terms of SNR gain there is an improvement of 7 dB (SSN) and 8 dB (CS) over these methods. A formal listening test confirm the efficiency of the suggested system in enhancing speech intelligibility in noise.
引用
收藏
页码:634 / 637
页数:4
相关论文
共 50 条
  • [21] Speech intelligibility improvement in car noise environment by voice transformation
    Nathwani, Karan
    Richard, Gael
    David, Bertrand
    Prablanc, Pierre
    Roussarie, Vincent
    SPEECH COMMUNICATION, 2017, 91 : 17 - 27
  • [22] Optimal Speech Intelligibility Improvement for Varying Car Noise Characteristics
    Ritujoy Biswas
    Karan Nathwani
    Faizal Hafiz
    Akshya Swain
    Journal of Signal Processing Systems, 2022, 94 : 1429 - 1446
  • [23] The dynamic range of speech, compression, and its effect on the speech reception threshold in stationary and interrupted noise
    Rhebergen, Koenraad S.
    Versfeld, Niek J.
    Dreschler, Wouter. A.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2009, 126 (06) : 3236 - 3245
  • [24] FORMANT SHIFTING FOR SPEECH INTELLIGIBILITY IMPROVEMENT IN CAR NOISE ENVIRONMENT
    Nathwani, Karan
    Daniel, Morgane
    Richard, Gael
    David, Bertrand
    Roussarie, Vincent
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 5375 - 5379
  • [25] SPEECH INTELLIGIBILITY IMPROVEMENT USING THE CONSTRAINTS ON SPEECH DISTORTION AND NOISE OVER-ESTIMATION
    Li, Na
    Bao, Chang-chun
    Xia, Bing-yin
    Bao, Feng
    2013 NINTH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION HIDING AND MULTIMEDIA SIGNAL PROCESSING (IIH-MSP 2013), 2013, : 602 - 606
  • [26] Optimised spectral weightings for noise-dependent speech intelligibility enhancement
    Tang, Yan
    Cooke, Martin
    13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 954 - 957
  • [27] SPEECH INTELLIGIBILITY ENHANCEMENT USING NON-PARALLEL SPEAKING STYLE CONVERSION WITH STARGAN AND DYNAMIC RANGE COMPRESSION
    Li, Gang
    Hu, Ruimin
    Ke, Shanfa
    Zhang, Rui
    Wang, Xiaochen
    Gao, Li
    2020 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2020,
  • [28] The contribution of changes in F0 and spectral tilt to increased intelligibility of speech produced in noise
    Lu, Youyi
    Cooke, Martin
    SPEECH COMMUNICATION, 2009, 51 (12) : 1253 - 1262
  • [29] Effects of Reverberation on the Relation Between Compression Speed and Working Memory for Speech-in-Noise Perception
    Reinhart, Paul
    Zahorik, Pavel
    Souza, Pamela
    EAR AND HEARING, 2019, 40 (05) : 1098 - 1105
  • [30] Frequency-lowering processing to improve speech-in-noise intelligibility in patients with age-related hearing loss
    Bruno, Rocco
    Freni, Francesco
    Portelli, Daniele
    Alberti, Giuseppe
    Gazia, Francesco
    Meduri, Alessandro
    Galletti, Francesco
    Galletti, Bruno
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2021, 278 (10) : 3697 - 3706