A Modified Speech Enhancement Algorithm Using A Universal Speaker Model

被引：0

作者：

Guo, Li ^{[1
]}

Jiang, Wenbin ^{[1
]}

Ying, Rendong ^{[1
]}

Liu, Peilin ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch elect informat & elect engn, Shanghai 200030, Peoples R China

来源：

2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) | 2014年

关键词：

GMMS; IMCRA; MFCCs; speech enhancement; STATISTICAL-MODEL;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose a statistical model-based speech enhancement algorithm using an improved minima controlled recursive averaging (IMCRA) noise estimation and a decision-directed (DD) priori SNR estimation. In the training stage, the Gaussian mixture model (GMM) of the Mel-frequency cepstral coefficients (MFCCs) of universal speaker is obtained. In speech enhancement stage, minima tracking process of IMCRA noise estimation is adjusted with the noisy power spectrum of current frame and an adjustment weighting factor. In addition, based on the universal GMM, some significant constant parameters are replaced by frequency-varying parameters, such as the weighting parameter in the DD priori SNR estimation and the adjustment weighting factor in the modified minima tracking process of IMCRA. The performance of proposed speech enhancement is evaluated by objective tests under various stationary and non-stationary noise environments. From experimental results, compared to the conventional approaches, the proposed scheme performs better and is suitable for being used as the pre-processing of speech processing systems.

引用

页码：521 / 526

页数：6

共 50 条

[1] SPEAKER DEPENDENT SPEECH ENHANCEMENT USING SINUSOIDAL MODEL
Mowlaee, Pejman
Nachbar, Christian
2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2014, : 80 - 84
[2] Speech Enhancement Regularized by a Speaker Verification Model
Lay, Bunlong
Gerkmann, Timo
2022 IEEE 24TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2022,
[3] Speech Enhancement Using Modified Phase Opponency Model
Deshmukh, Om D.
Espy-Wilson, Carol Y.
INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 269 - +
[4] Speech enhancement using the modified phase-opponency model
Deshmukh, Om D.
Espy-Wilson, Carol Y.
Carney, Laurel H.
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (06): : 3886 - 3898
[5] A Modified Speech Enhancement Algorithm Based on the Subspace
Jia, Hairong
Zhang, Xueying
Jin, Chensheng
2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 3, 2009, : 344 - 347
[6] Speech Enhancement for Speaker Identification
Mahesh, R.
2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
[7] Personalized Speech Enhancement Without a Separate Speaker Embedding Model
Parnamaa, Tanel
Saabas, Ando
INTERSPEECH 2024, 2024, : 4863 - 4867
[8] Modified adaptive algorithm and its use for speech enhancement
Zhang, Ling-Hua
Deng, Li-Xin
Zheng, Bao-Yu
Nanjing Youdian Xueyuan Xuebao/Journal of Nanjing Institute of Posts and Telecommunications, 2005, 25 (02): : 44 - 47
[9] EXPLORING UNIVERSAL SPEECH ATTRIBUTES FOR SPEAKER VERIFICATION
Zhang, Sheng
Guo, Wu
Hu, Guoping
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5355 - 5359
[10] A New Speech Enhancement Algorithm with Generalized Gamma Speech Model
Zhao, Gaihua
Zhou, Bin
Zhang, Xiongwei
Sui Lu-ying
2012 INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP 2012), 2012,

← 1 2 3 4 5 →