A Model-Based Spectral Envelope Wiener Filter for Perceptually Motivated Speech Enhancement

被引:0
|
作者
Hadir, Najib [1 ]
Faubel, Friedrich [1 ]
Klakow, Dietrich [1 ]
机构
[1] Univ Saarland, D-66123 Saarbrucken, Germany
来源
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年
关键词
speech enhancement; Bayesian estimation; signal reconstruction; NOISE; MIXTURE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present a model-based Wiener filter whose frequency response is optimized in the dimensionally reduced log-Mel domain. That is achieved by making use of a reasonably novel speech feature enhancement approach that has originally been developed in the area of speech recognition. Its combination with Wiener filtering is motivated by the fact that signal reconstruction from log-Mel features sounds very unnatural. Hence, we correct only the spectral envelope and preserve the fine spectral structure of the noisy signal. Experiments on a Wall Street Journal corpus showed a relative improvement of up to 24% relative in PESQ and 45% relative in log spectral distance (LSD), compared to Ephraim and Mallah's log spectral amplitude estimator.
引用
收藏
页码:220 / 223
页数:4
相关论文
共 50 条
  • [41] On using acoustic environment classification for statistical model-based speech enhancement
    Choi, Jae-Hun
    Chang, Joon-Hyuk
    SPEECH COMMUNICATION, 2012, 54 (03) : 477 - 490
  • [42] Speech enhancement based on perceptually comfortable residual noise
    Shin, Jong Won
    Chang, Joon-Hyuk
    Kim, Nam Soo
    IEICE TRANSACTIONS ON COMMUNICATIONS, 2007, E90B (11) : 3323 - 3326
  • [43] SPEECH ENHANCEMENT USING IMPROVED MAP ESTIMATION AND WIENER FILTER
    Chehrehsa, Sarang
    Moir, Tom
    PROCEEDINGS OF 2016 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2016, : 494 - 498
  • [44] SPECTRO-TEMPORAL SUBBAND WIENER FILTER FOR SPEECH ENHANCEMENT
    Hsu, Chung-Chien
    Lin, Tse-En
    Chen, Jian-Hueng
    Chi, Tai-Shih
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4001 - 4004
  • [45] A Model-Based Soft Decision Approach for Speech Enhancement
    Xianyun Wang
    Changchun Bao
    Feng Bao
    中国通信, 2017, 14 (09) : 11 - 22
  • [46] ENSEMBLE INFERENCE FOR DIFFUSION MODEL-BASED SPEECH ENHANCEMENT
    Shi, Hao
    Kamo, Naoyuki
    Delcroix, Marc
    Nakatani, Tomohiro
    Araki, Shoko
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 735 - 739
  • [47] Subband Spectral-Subtraction Speech Enhancement Based on the DFT Modulated Filter Banks
    Cai, Yu
    Hou, Chaohuan
    PROCEEDINGS OF 2012 IEEE 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) VOLS 1-3, 2012, : 571 - 574
  • [48] Model-Based Feature Enhancement for Reverberant Speech Recognition
    Krueger, Alexander
    Haeb-Umbach, Reinhold
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07): : 1692 - 1707
  • [49] Speech Enhancement Based on Analysis Synthesis Framework With Improved Pitch Estimation and Spectral Envelope Enhancement
    Liu, Bin
    Mo, Fuyuan
    Tao, Jianhua
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 461 - 466
  • [50] Speech enhancement using long short term memory with trained speech features and adaptive wiener filter
    Garg, Anil
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (03) : 3647 - 3675