A Modified Speech Enhancement Algorithm Using A Universal Speaker Model

被引:0
|
作者
Guo, Li [1 ]
Jiang, Wenbin [1 ]
Ying, Rendong [1 ]
Liu, Peilin [1 ]
机构
[1] Shanghai Jiao Tong Univ, Sch elect informat & elect engn, Shanghai 200030, Peoples R China
关键词
GMMS; IMCRA; MFCCs; speech enhancement; STATISTICAL-MODEL;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we propose a statistical model-based speech enhancement algorithm using an improved minima controlled recursive averaging (IMCRA) noise estimation and a decision-directed (DD) priori SNR estimation. In the training stage, the Gaussian mixture model (GMM) of the Mel-frequency cepstral coefficients (MFCCs) of universal speaker is obtained. In speech enhancement stage, minima tracking process of IMCRA noise estimation is adjusted with the noisy power spectrum of current frame and an adjustment weighting factor. In addition, based on the universal GMM, some significant constant parameters are replaced by frequency-varying parameters, such as the weighting parameter in the DD priori SNR estimation and the adjustment weighting factor in the modified minima tracking process of IMCRA. The performance of proposed speech enhancement is evaluated by objective tests under various stationary and non-stationary noise environments. From experimental results, compared to the conventional approaches, the proposed scheme performs better and is suitable for being used as the pre-processing of speech processing systems.
引用
收藏
页码:521 / 526
页数:6
相关论文
共 50 条
  • [21] SPEECH ENHANCEMENT USING ARCH MODEL
    Atkins, Aviva
    Cohen, Israel
    2016 IEEE INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2016,
  • [22] VoiceID Loss: Speech Enhancement for Speaker Verification
    Shon, Suwon
    Tang, Hao
    Glass, James
    INTERSPEECH 2019, 2019, : 2888 - 2892
  • [23] First Investigation of Universal Speech Attributes for Speaker Verification
    Zhang, Sheng
    Guo, Wu
    Hu, Guoping
    2016 10TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2016,
  • [24] Speaker Adaptive Model for Hindi Speech using Kaldi Speech Recognition toolkit
    Upadhyaya, Prashant
    Mittal, Sanjeev Kumar
    Varshney, Yash Vardhan
    Farooq, Omar
    Abidi, Musiur Raza
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 222 - 226
  • [25] Performance enhancement of speaker identification systems using speech encryption and cancelable features
    Soliman N.F.
    Mostfa Z.
    El-Samie F.E.A.
    Abdalla M.I.
    Soliman, Naglaa F. (nagla_soliman@yahoo.com), 1600, Springer Science and Business Media, LLC (20): : 977 - 1004
  • [26] Speaker adaptation using codebook integrated deep neural networks for speech enhancement
    Chidambar, B.
    Naidu, D. Hanumanth Rao
    JASA EXPRESS LETTERS, 2024, 4 (11):
  • [27] An Approach for Speech Enhancement in Low SNR Environments using Granular Speaker Embedding
    Saha, Jayasree
    Mukhopadhyay, Rudrabha
    Agrawal, Aparna
    Jain, Surabhi
    Jawahar, C. V.
    PROCEEDINGS OF 7TH JOINT INTERNATIONAL CONFERENCE ON DATA SCIENCE AND MANAGEMENT OF DATA, CODS-COMAD 2024, 2024, : 325 - 331
  • [28] Speech enhancement using modified IMCRA and OMLSA methods
    Tien Dung Tran
    Quoc Cuong Nguyen
    Dang Khoa Nguyen
    2010 THIRD INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND ELECTRONICS (ICCE), 2010, : 195 - 200
  • [29] Esophageal Speech Enhancement using Modified Voicing Source
    Ishaq, Rizwan
    Zapirain, Begona Garcia
    2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 210 - 214
  • [30] Speech Enhancement Using Modified Magnitude and Phase Spectra
    Hossain, Sk. Imran
    Chowdhury, Md. Fahim Hossain
    Amin, Md. Faijul
    Murase, Kazuyuki
    2013 INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2013,