A Modified Speech Enhancement Algorithm Using A Universal Speaker Model

被引：0

作者：

Guo, Li ^{[1
]}

Jiang, Wenbin ^{[1
]}

Ying, Rendong ^{[1
]}

Liu, Peilin ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch elect informat & elect engn, Shanghai 200030, Peoples R China

来源：

2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) | 2014年

关键词：

GMMS; IMCRA; MFCCs; speech enhancement; STATISTICAL-MODEL;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose a statistical model-based speech enhancement algorithm using an improved minima controlled recursive averaging (IMCRA) noise estimation and a decision-directed (DD) priori SNR estimation. In the training stage, the Gaussian mixture model (GMM) of the Mel-frequency cepstral coefficients (MFCCs) of universal speaker is obtained. In speech enhancement stage, minima tracking process of IMCRA noise estimation is adjusted with the noisy power spectrum of current frame and an adjustment weighting factor. In addition, based on the universal GMM, some significant constant parameters are replaced by frequency-varying parameters, such as the weighting parameter in the DD priori SNR estimation and the adjustment weighting factor in the modified minima tracking process of IMCRA. The performance of proposed speech enhancement is evaluated by objective tests under various stationary and non-stationary noise environments. From experimental results, compared to the conventional approaches, the proposed scheme performs better and is suitable for being used as the pre-processing of speech processing systems.

引用

页码：521 / 526

页数：6

共 50 条

[31] Speech/speaker recognition using a HMM/GMM hybrid model
Rodriguez, E
Ruiz, B
Garcia-Crespo, A
Garcia, F
AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 227 - 234
[32] Speaker adaptive speech recognition using phone pair model
Li, BJ
Hirose, K
2000 5TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS, VOLS I-III, 2000, : 714 - 717
[33] A modified crosstalk resistant adaptive noise canceller algorithm for speech enhancement
Lin, Jie
Li, Jian Ping
Zhan, Si Yu
Liao, Jian Ming
Fu, Yan
WAVELET ACTIVE MEDIA TECHNOLOGY AND INFORMATION PROCESSING, VOL 1 AND 2, 2006, : 180 - +
[34] Modified Wiener Filtering Speech Enhancement Algorithm with Phase Spectrum Compensation
Zhang Wenlu
Peng Hua
2017 IEEE 9TH INTERNATIONAL CONFERENCE ON COMMUNICATION SOFTWARE AND NETWORKS (ICCSN), 2017, : 1075 - 1079
[35] A KIND OF MODIFIED SPEECH ENHANCEMENT ALGORITHM BASED ON WAVELET PACKAGE TRANSFORMATION
Zhang, L. H.
Rong, G. F.
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON WAVELET ANALYSIS AND PATTERN RECOGNITION, VOLS 1 AND 2, 2008, : 421 - 425
[36] INVENTORY BASED SPEECH ENHANCEMENT FOR SPEAKER DEDICATED SPEECH COMMUNICATION SYSTEMS
Xiao, Xiaoqiang
Lee, Peng
Nickel, Robert M.
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3877 - +
[37] Speech recognition enhancement using beamforming and a genetic algorithm
Chan, K. Y.
Yiu, K. F. C.
Low, S. Y.
Nordholm, S.
Ling, S. H.
NSS: 2009 3RD INTERNATIONAL CONFERENCE ON NETWORK AND SYSTEM SECURITY, 2009, : 510 - +
[38] A Speech Enhancement Algorithm Based on a Chi MRF Model of the Speech STFT Amplitudes
Andrianakis, Yiannis
White, Paul R.
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (08): : 1508 - 1517
[39] Speech Enhancement Algorithm Using Recursive Wavelet Shrinkage
Lee, Gihyoun
Na, Sung Dae
Seong, KiWoong
Cho, Jin-Ho
Kim, Myoung Nam
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2016, E99D (07): : 1945 - 1948
[40] Speech Enhancement Using Affine Projection Algorithm in Subband
Mahabadi, Ali Ameri
Eshghi, Mohammad
2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 222 - +

← 1 2 3 4 5 →