Speech-in-noise intelligibility improvement based on spectral shaping and dynamic range compression

被引:0
作者
Zorila, Tudor-Catalin
Kandia, Varvara
Stylianou, Yannis
机构
来源
13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3 | 2012年
关键词
speech-in-noise enhancement; speech intelligibility; spectral shaping; dynamic range compression; CLEAR SPEECH; LISTENERS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we suggest a non-parametric way to improve the intelligibility of speech in noise. The signal is enhanced before presented in a noisy environment, under the constraint of equal global signal power before and after modifications. Two systems are combined in a cascade form to enhance the quality of the signal first in frequency (spectral shaping) and then in time (dynamic range compression). Experiments with speech shaped (SSN) and competing speaker (CS) types of noise at various low SNR values, show that the suggested approach outperforms state-of-the art methods in terms of the Speech Intelligibility Index (SII). In terms of SNR gain there is an improvement of 7 dB (SSN) and 8 dB (CS) over these methods. A formal listening test confirm the efficiency of the suggested system in enhancing speech intelligibility in noise.
引用
收藏
页码:634 / 637
页数:4
相关论文
共 50 条
  • [1] SPEECH-IN-NOISE INTELLIGIBILITY IMPROVEMENT BASED ON POWER RECOVERY AND DYNAMIC RANGE COMPRESSION
    Zorila, Tudor-Catalin
    Kandia, Varvara
    Stylianou, Yannis
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 2075 - 2079
  • [2] Increasing Speech Intelligibility via Spectral Shaping with Frequency Warping and Dynamic Range Compression plus Transient Enhancement
    Godoy, Elizabeth
    Stylianou, Yannis
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3539 - 3543
  • [3] End-to-End Neural Based Modification of Noisy Speech for Speech-in-Noise Intelligibility Improvement
    Shifas, Muhammed P., V
    Zorila, Catalin
    Stylianou, Yannis
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 162 - 173
  • [4] Near and Far Field Speech-in-Noise Intelligibility Improvements Based on a Time-Frequency Energy Reallocation Approach
    Zorila, Tudor-Catalin
    Stylianou, Yannis
    Ishihara, Tatsuma
    Akamine, Masami
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (10) : 1808 - 1818
  • [5] Auditory efferents involved in speech-in-noise intelligibility
    Giraud, AL
    Garnier, S
    Micheyl, C
    Lina, G
    Chays, A
    CheryCroze, S
    NEUROREPORT, 1997, 8 (07) : 1779 - 1783
  • [6] Improving speech intelligibility in noise by SII-dependent preprocessing using frequency-dependent amplification and dynamic range compression
    Schepker, Henning
    Rennies, Jan
    Doclo, Simon
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3544 - 3548
  • [7] Modeling Noise Influence to Speech Intelligibility Non-intrusively by Reduced Speech Dynamic Range
    Chen, Fei
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 1359 - 1362
  • [8] Glimpse-based estimation of speech intelligibility from speech-in-noise using artificial neural networks
    Tang, Yan
    COMPUTER SPEECH AND LANGUAGE, 2021, 69
  • [9] SII-based Speech Preprocessing for Intelligibility Improvement in Noise
    Taal, Cees H.
    Jensen, Jesper
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3549 - 3553
  • [10] Spontaneous Otoacoustic Emission Enhancement in Children with Reduced Speech-in-Noise Intelligibility
    Elgeti, Anja
    Zehnhoff-Dinnesen, Antoinette Gertrud am
    Matulat, Peter
    Schmidt, Claus-Michael
    Knief, Arne
    AUDIOLOGY AND NEURO-OTOLOGY, 2008, 13 (06) : 357 - 364