A Modified Speech Enhancement Algorithm Using A Universal Speaker Model

被引：0

作者：

Guo, Li ^{[1
]}

Jiang, Wenbin ^{[1
]}

Ying, Rendong ^{[1
]}

Liu, Peilin ^{[1
]}

机构：

[1] Shanghai Jiao Tong Univ, Sch elect informat & elect engn, Shanghai 200030, Peoples R China

来源：

2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP) | 2014年

关键词：

GMMS; IMCRA; MFCCs; speech enhancement; STATISTICAL-MODEL;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In this paper, we propose a statistical model-based speech enhancement algorithm using an improved minima controlled recursive averaging (IMCRA) noise estimation and a decision-directed (DD) priori SNR estimation. In the training stage, the Gaussian mixture model (GMM) of the Mel-frequency cepstral coefficients (MFCCs) of universal speaker is obtained. In speech enhancement stage, minima tracking process of IMCRA noise estimation is adjusted with the noisy power spectrum of current frame and an adjustment weighting factor. In addition, based on the universal GMM, some significant constant parameters are replaced by frequency-varying parameters, such as the weighting parameter in the DD priori SNR estimation and the adjustment weighting factor in the modified minima tracking process of IMCRA. The performance of proposed speech enhancement is evaluated by objective tests under various stationary and non-stationary noise environments. From experimental results, compared to the conventional approaches, the proposed scheme performs better and is suitable for being used as the pre-processing of speech processing systems.

引用

页码：521 / 526

页数：6

共 50 条

[21] SPEECH ENHANCEMENT USING ARCH MODEL
Atkins, Aviva
Cohen, Israel
2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
[22] VoiceID Loss: Speech Enhancement for Speaker Verification
Shon, Suwon
Tang, Hao
Glass, James
INTERSPEECH 2019, 2019, : 2888 - 2892
[23] First Investigation of Universal Speech Attributes for Speaker Verification
Zhang, Sheng
Guo, Wu
Hu, Guoping
2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
[24] Speaker Adaptive Model for Hindi Speech using Kaldi Speech Recognition toolkit
Upadhyaya, Prashant
Mittal, Sanjeev Kumar
Varshney, Yash Vardhan
Farooq, Omar
Abidi, Musiur Raza
PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 222 - 226
[25] Performance enhancement of speaker identification systems using speech encryption and cancelable features
Soliman N.F.
Mostfa Z.
El-Samie F.E.A.
Abdalla M.I.
Soliman, Naglaa F. (nagla_soliman@yahoo.com), 1600, Springer Science and Business Media, LLC (20): : 977 - 1004
[26] Speaker adaptation using codebook integrated deep neural networks for speech enhancement
Chidambar, B.
Naidu, D. Hanumanth Rao
JASA EXPRESS LETTERS, 2024, 4 (11):
[27] An Approach for Speech Enhancement in Low SNR Environments using Granular Speaker Embedding
Saha, Jayasree
Mukhopadhyay, Rudrabha
Agrawal, Aparna
Jain, Surabhi
Jawahar, C. V.
PROCEEDINGS OF 7TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA, CODS-COMAD 2024, 2024, : 325 - 331
[28] Speech enhancement using modified IMCRA and OMLSA methods
Tien Dung Tran
Quoc Cuong Nguyen
Dang Khoa Nguyen
2010 THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2010, : 195 - 200
[29] Esophageal Speech Enhancement using Modified Voicing Source
Ishaq, Rizwan
Zapirain, Begona Garcia
2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 210 - 214
[30] Speech Enhancement Using Modified Magnitude and Phase Spectra
Hossain, Sk. Imran
Chowdhury, Md. Fahim Hossain
Amin, Md. Faijul
Murase, Kazuyuki
2013 INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2013,

← 1 2 3 4 5 →