AR Model-based Bayesian Speech Enhancement for Nonstationary Environment

被引:0
作者
Huang, Qinghua [1 ]
Liu, Kai [1 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai, Peoples R China
来源
INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL SCIENCES AND OPTIMIZATION, VOL 1, PROCEEDINGS | 2009年
关键词
D O I
10.1109/CSO.2009.171
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A new technique for enhancing audio signal from a noisy nonstationary environment is presented in the paper. Autoregressive (AR) model is used to efficiently exploit the temporally correlated information of audio and noise signals during a short stationary frame. The temporal models of signals and noisy process are combined to construct a state space. The state space appropriately describes that the observed noisy signal is generated from two underlying sources which evolve with Markovian dynamics across successive step times. In the state space, the clean speech and the noise are two hidden source signals. The recovery of clean speech and the estimation of all the model parameters are carried out within the variational Bayesian framework. The original speech can be estimated as a state using a variational Kalman smoother. The experimental results show that our approach can obtain better performance in terms of signal-to-noise ratio (SNR) measure.
引用
收藏
页码:918 / 921
页数:4
相关论文
共 9 条
  • [1] [Anonymous], 2001, DISCRETE TIME SPEECH
  • [2] Beal MJ, 2003, VARIATIONAL ALGORITH
  • [3] A new approach for speech enhancement based on the adaptive thresholding of the wavelet packets
    Ghanbari, Yasser
    Karami-Mollaei, Mohammad Reza
    [J]. SPEECH COMMUNICATION, 2006, 48 (08) : 927 - 940
  • [4] Hu Y, 2006, INT CONF ACOUST SPEE, P153
  • [5] Bayesian nonstationary source separation
    Huang, Qinghua
    Yang, Jie
    Zhou, Yue
    [J]. NEUROCOMPUTING, 2008, 71 (7-9) : 1714 - 1729
  • [6] Variational Bayesian method for speech enhancement
    Huang, Qinghua
    Yang, Jie
    Zhou, Yue
    [J]. NEUROCOMPUTING, 2007, 70 (16-18) : 3063 - 3067
  • [7] Noise power spectral density estimation based on optimal smoothing and minimum statistics
    Martin, R
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2001, 9 (05): : 504 - 512
  • [8] Martin R., 1994, Proc. Eur. Signal Process, P1182
  • [9] Variational Bayes for generalized autoregressive models
    Roberts, SJ
    Penny, WD
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (09) : 2245 - 2257