Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression

被引:0
作者
Zorila, Tudor-Catalin
Kandia, Varvara
Stylianou, Yannis
机构
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
speech-in-noise enhancement; speech intelligibility; spectral shaping; dynamic range compression; CLEAR SPEECH; LISTENERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we suggest a non-parametric way to improve the intelligibility of speech in noise. The signal is enhanced before presented in a noisy environment, under the constraint of equal global signal power before and after modifications. Two systems are combined in a cascade form to enhance the quality of the signal first in frequency (spectral shaping) and then in time (dynamic range compression). Experiments with speech shaped (SSN) and competing speaker (CS) types of noise at various low SNR values, show that the suggested approach outperforms state-of-the art methods in terms of the Speech Intelligibility Index (SII). In terms of SNR gain there is an improvement of 7 dB (SSN) and 8 dB (CS) over these methods. A formal listening test confirm the efficiency of the suggested system in enhancing speech intelligibility in noise.
引用
收藏
页码:634 / 637
页数:4
相关论文
共 50 条
  • [31] Effects of noise and reverberation on speech recognition with variants of a multichannel adaptive dynamic range compression scheme
    Rallapalli, Varsha H.
    Alexander, Joshua M.
    INTERNATIONAL JOURNAL OF AUDIOLOGY, 2019, 58 (10) : 661 - 669
  • [32] A dynamic auditory-cognitive system supports speech-in-noise perception in older adults
    Anderson, Samira
    White-Schwoch, Travis
    Parbery-Clark, Alexandra
    Kraus, Nina
    HEARING RESEARCH, 2013, 300 : 18 - 32
  • [33] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Dong, Huan-Yu
    Lee, Chang-Myung
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2018,
  • [34] Speech intelligibility improvement in noisy reverberant environments based on speech enhancement and inverse filtering
    Huan-Yu Dong
    Chang-Myung Lee
    EURASIP Journal on Audio, Speech, and Music Processing, 2018
  • [35] Modeling the effects of dynamic range compression on signals in noise
    Corey, Ryan M.
    Singer, Andrew C.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2021, 150 (01) : 159 - 170
  • [36] Characterization of the Intelligibility of Vowel-Consonant-Vowel (VCV) Recordings in Five Languages for Application in Speech-in-Noise Screening in Multilingual Settings
    Rocco, Giulia
    Bernardi, Giuliano
    Ali, Randall
    van Waterschoot, Toon
    Polo, Edoardo Maria
    Barbieri, Riccardo
    Paglialonga, Alessia
    APPLIED SCIENCES-BASEL, 2023, 13 (09):
  • [37] A new binary mask based on noise constraints for improved speech intelligibility
    Kim, Gibak
    Loizou, Philipos C.
    11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 3 AND 4, 2010, : 1632 - 1635
  • [38] Effect of slow-acting wide dynamic range compression on measures of intelligibility and ratings of speech quality in simulated-loss listeners
    Rosengard, PS
    Payton, KL
    Braida, LD
    JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 2005, 48 (03): : 702 - 714
  • [39] Signal-to-Noise-Ratio-Aware Dynamic Range Compression in Hearing Aids
    May, Tobias
    Kowalewski, Borys
    Dau, Torsten
    TRENDS IN HEARING, 2018, 22
  • [40] Selective Frequency Enhancement of Speech Signal for Intelligibility Improvement in Presence of Near-end Noise
    Premananda, B. S.
    Uma, B., V
    PROCEEDINGS OF 4TH INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATION AND CONTROL(ICAC3'15), 2015, 49 : 244 - 252