A Modified Speech Enhancement Algorithm Using A Universal Speaker Model

被引:0
|
作者
Guo, Li [1 ]
Jiang, Wenbin [1 ]
Ying, Rendong [1 ]
Liu, Peilin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch elect informat & elect engn, Shanghai 200030, Peoples R China
关键词
GMMS; IMCRA; MFCCs; speech enhancement; STATISTICAL-MODEL;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a statistical model-based speech enhancement algorithm using an improved minima controlled recursive averaging (IMCRA) noise estimation and a decision-directed (DD) priori SNR estimation. In the training stage, the Gaussian mixture model (GMM) of the Mel-frequency cepstral coefficients (MFCCs) of universal speaker is obtained. In speech enhancement stage, minima tracking process of IMCRA noise estimation is adjusted with the noisy power spectrum of current frame and an adjustment weighting factor. In addition, based on the universal GMM, some significant constant parameters are replaced by frequency-varying parameters, such as the weighting parameter in the DD priori SNR estimation and the adjustment weighting factor in the modified minima tracking process of IMCRA. The performance of proposed speech enhancement is evaluated by objective tests under various stationary and non-stationary noise environments. From experimental results, compared to the conventional approaches, the proposed scheme performs better and is suitable for being used as the pre-processing of speech processing systems.
引用
收藏
页码:521 / 526
页数:6
相关论文
共 50 条
  • [31] Speech/speaker recognition using a HMM/GMM hybrid model
    Rodriguez, E
    Ruiz, B
    Garcia-Crespo, A
    Garcia, F
    AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 227 - 234
  • [32] Speaker adaptive speech recognition using phone pair model
    Li, BJ
    Hirose, K
    2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 714 - 717
  • [33] A modified crosstalk resistant adaptive noise canceller algorithm for speech enhancement
    Lin, Jie
    Li, Jian Ping
    Zhan, Si Yu
    Liao, Jian Ming
    Fu, Yan
    WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING, VOL 1 AND 2, 2006, : 180 - +
  • [34] Modified Wiener Filtering Speech Enhancement Algorithm with Phase Spectrum Compensation
    Zhang Wenlu
    Peng Hua
    2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1075 - 1079
  • [35] A KIND OF MODIFIED SPEECH ENHANCEMENT ALGORITHM BASED ON WAVELET PACKAGE TRANSFORMATION
    Zhang, L. H.
    Rong, G. F.
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1 AND 2, 2008, : 421 - 425
  • [36] INVENTORY BASED SPEECH ENHANCEMENT FOR SPEAKER DEDICATED SPEECH COMMUNICATION SYSTEMS
    Xiao, Xiaoqiang
    Lee, Peng
    Nickel, Robert M.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3877 - +
  • [37] Speech recognition enhancement using beamforming and a genetic algorithm
    Chan, K. Y.
    Yiu, K. F. C.
    Low, S. Y.
    Nordholm, S.
    Ling, S. H.
    NSS: 2009 3RD INTERNATIONAL CONFERENCE ON NETWORK AND SYSTEM SECURITY, 2009, : 510 - +
  • [38] A Speech Enhancement Algorithm Based on a Chi MRF Model of the Speech STFT Amplitudes
    Andrianakis, Yiannis
    White, Paul R.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (08): : 1508 - 1517
  • [39] Speech Enhancement Algorithm Using Recursive Wavelet Shrinkage
    Lee, Gihyoun
    Na, Sung Dae
    Seong, KiWoong
    Cho, Jin-Ho
    Kim, Myoung Nam
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (07): : 1945 - 1948
  • [40] Speech Enhancement Using Affine Projection Algorithm in Subband
    Mahabadi, Ali Ameri
    Eshghi, Mohammad
    2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 222 - +