A Model-Based Spectral Envelope Wiener Filter for Perceptually Motivated Speech Enhancement

被引:0
|
作者
Hadir, Najib [1 ]
Faubel, Friedrich [1 ]
Klakow, Dietrich [1 ]
机构
[1] Univ Saarland, D-66123 Saarbrucken, Germany
来源
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5 | 2011年
关键词
speech enhancement; Bayesian estimation; signal reconstruction; NOISE; MIXTURE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this work, we present a model-based Wiener filter whose frequency response is optimized in the dimensionally reduced log-Mel domain. That is achieved by making use of a reasonably novel speech feature enhancement approach that has originally been developed in the area of speech recognition. Its combination with Wiener filtering is motivated by the fact that signal reconstruction from log-Mel features sounds very unnatural. Hence, we correct only the spectral envelope and preserve the fine spectral structure of the noisy signal. Experiments on a Wall Street Journal corpus showed a relative improvement of up to 24% relative in PESQ and 45% relative in log spectral distance (LSD), compared to Ephraim and Mallah's log spectral amplitude estimator.
引用
收藏
页码:220 / 223
页数:4
相关论文
共 50 条
  • [1] Perceptually Motivated Generalized Spectral Subtraction for Speech Enhancement
    Zoghlami, Novlene
    Lachiri, Zied
    Ellouze, Noureddine
    ADVANCES IN NONLINEAR SPEECH PROCESSING, 2010, 5933 : 136 - 143
  • [2] A Perceptually Motivated Estimator for Speech Enhancement
    Montazeri, Vahid
    Khoubrouy, Soudeh A.
    Panahi, Issa M. S.
    2013 8TH INTERNATIONAL SYMPOSIUM ON IMAGE AND SIGNAL PROCESSING AND ANALYSIS (ISPA), 2013, : 366 - 370
  • [3] A perceptually motivated approach for speech enhancement
    Hu, Y
    Loizou, PC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05): : 457 - 465
  • [4] A Perceptually Motivated Multi-Band Spectral Subtraction Algorithm for Enhancement of Degraded Speech
    Upadhyay, Navneet
    Karmakar, Abhijit
    2012 THIRD INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGY (ICCCT), 2012, : 340 - 345
  • [5] Speech enhancement based on perceptually motivated guided spectrogram filtering
    Wang, Jie
    Yan, Linhuang
    Yang, Qiaohe
    Yuan, Minmin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 40 (03) : 5443 - 5454
  • [6] Speech enhancement based on perceptually motivated Bayesian estimators of the magnitude spectrum
    Loizou, PC
    IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05): : 857 - 869
  • [7] A Perceptually Motivated Approach for Speech Enhancement Based on Deep Neural Network
    Han, Wei
    Zhang, Xiongwei
    Min, Gang
    Sun, Meng
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (04): : 835 - 838
  • [8] Efficient beta-order Perceptually Motivated Spectral Amplitude Bayesian Estimator Based On Chi-distribution for Speech Enhancement
    Zhao, Huan
    Yang, Yong
    Lu, Zhiqiang
    Yu, Fei
    JOURNAL OF COMPUTERS, 2012, 7 (11) : 2829 - 2835
  • [9] Spectral difference for statistical model-based speech enhancement in speech recognition
    Lee, Soojeong
    Chang, Joon-Hyuk
    MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (23) : 24917 - 24929
  • [10] Spectral difference for statistical model-based speech enhancement in speech recognition
    Soojeong Lee
    Joon-Hyuk Chang
    Multimedia Tools and Applications, 2017, 76 : 24917 - 24929