Speech enhancement using modified IMCRA and OMLSA methods

被引:0
|
作者
Tien Dung Tran [1 ]
Quoc Cuong Nguyen [1 ]
Dang Khoa Nguyen [1 ]
机构
[1] Hanoi Univ Technol, Int Res Ctr MICA, Hanoi, Vietnam
关键词
speech enhancement; Mean-Square Error Log-Spectral Amplitude; Improved Minimal Controlled Recursive Averaging; SPECTRAL AMPLITUDE ESTIMATOR;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper we present a speech enhancement method in highly non-stationary noise environments based on modified Improved Minimal Controlled Recursive Averaging (IMCRA) method and Optimal Modified Minimum Mean-Square Error Log-Spectral Amplitude (OMLSA) method. The original OMLSA method, the spectral gain function, which minimizes the mean-square error of the log-spectral amplitude, is obtained as a weighted geometric mean of the hypothetical gain associated with the presence uncertainty. Whereas in IMCRA method, noise estimation is given by averaging past spectral value of noisy speech using a smoothing parameter that is adjusted by speech presence probability in frequency domain. A new method is proposed, in which the minimum spectral power value of noisy speech is adjusted by past speech presence probability. In addition, a noise estimation algorithm is proposed for highly non-stationary noise environment. The noise estimate is updated by averaging the noise spectral power estimate of IMCRA method with the past noise spectral power. Evaluations under various environment conditions, especially highly non-stationary noise environment, confirm that the modification of IMCRA and OMLSA method improved the speech quality.
引用
收藏
页码:195 / 200
页数:6
相关论文
共 50 条
  • [1] Microphone array speech enhancement based on optimized IMCRA
    Li, Qiuying
    Zhang, Tao
    Geng, Yanzhang
    Gao, Zhen
    NOISE CONTROL ENGINEERING JOURNAL, 2021, 69 (06) : 468 - 476
  • [2] Speech enhancement algorithm of improved OMLSA based on bilateral spectrogram filtering
    Wang, Jie
    Yan, Linhuang
    Tian, Jiayi
    Yuan, Minmin
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 39 (05) : 6881 - 6889
  • [3] Modified Magnitude Spectral Subtraction Methods for Speech Enhancement
    Naik, D. C.
    Murthy, A. Sreenivasa
    Nuthakki, Ramesh
    2017 INTERNATIONAL CONFERENCE ON ELECTRICAL, ELECTRONICS, COMMUNICATION, COMPUTER, AND OPTIMIZATION TECHNIQUES (ICEECCOT), 2017, : 274 - 279
  • [4] Using Deep Speech Recognition to Evaluate Speech Enhancement Methods
    Siddiqui, Shamoon
    Rasool, Ghulam
    Ramachandran, Ravi P.
    Bouaynaya, Nidhal C.
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [5] Esophageal Speech Enhancement using Modified Voicing Source
    Ishaq, Rizwan
    Zapirain, Begona Garcia
    2013 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (IEEE ISSPIT 2013), 2013, : 210 - 214
  • [6] Speech Enhancement Using Modified Magnitude and Phase Spectra
    Hossain, Sk. Imran
    Chowdhury, Md. Fahim Hossain
    Amin, Md. Faijul
    Murase, Kazuyuki
    2013 INTERNATIONAL CONFERENCE ON ELECTRICAL INFORMATION AND COMMUNICATION TECHNOLOGY (EICT), 2013,
  • [7] Speech Enhancement Using Modified Phase Opponency Model
    Deshmukh, Om D.
    Espy-Wilson, Carol Y.
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 269 - +
  • [8] Radar speech signal enhancement based on modified Compressed Sensing methods
    Wang, Yuanhao
    Yang, Qi
    Zeng, Yang
    Deng, Bin
    Wang, Hongqiang
    2020 13TH UK-EUROPE-CHINA WORKSHOP ON MILLIMETRE-WAVES AND TERAHERTZ TECHNOLOGIES (UCMMT2020), 2020,
  • [9] A Modified Speech Enhancement Algorithm Using A Universal Speaker Model
    Guo, Li
    Jiang, Wenbin
    Ying, Rendong
    Liu, Peilin
    2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 521 - 526
  • [10] Speech enhancement using the modified phase-opponency model
    Deshmukh, Om D.
    Espy-Wilson, Carol Y.
    Carney, Laurel H.
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2007, 121 (06): : 3886 - 3898