Single channel speech enhancement using iterative constrained NMF based adaptive wiener gain

被引:1
|
作者
Yechuri, Sivaramakrishna [1 ]
Vanambathina, Sunnydayal [1 ]
机构
[1] VIT AP Univ, SENSE, Amaravati, India
关键词
NMF; Adaptive wiener gain; Inverse nakagami; Erlang; Inverse gamma; Students-t probability density functions; SDR; PESQ; STOI; NONNEGATIVE MATRIX FACTORIZATION; ALGORITHMS; EXTRACTION; MACHINE; FILTER;
D O I
10.1007/s11042-023-16480-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a novel single channel speech enhancement algorithm using iterative constrained Non-negative matrix factorization (NMF) based adaptive Wiener gain for non-stationary noise. In the recent past, NMF-based Wiener filtering methods were used for speech enhancement. The Wiener filter performance depends on the adaptive gain factor value. The adaptive gain factor (alpha) value is constant regardless of noise type and signal to noise ratio (SNR), so it will affect speech enhancement performance. To overcome this, the adaptive factor value is calculated using a genetic algorithm (GA). Here, the GA adjusts the adaptive Wiener gain based on noise type and SNR level. The GA-based adaptive Wiener gain minimizes Wiener filter estimation errors and improves speech quality by adjusting the base vector weights of noise and speech. Additionally, we use the iterative constraints NMF (IC-NMF) method for calculating the priors from noisy speech magnitudes. We select the Erlang, Inverse Gamma, Students-t, and Inverse Nakagami distributions for speech priors and Gaussian distributions for noise priors. Noise and speech samples are well correlated with those distributions. This provides accurate estimation of the necessary statistics of these distributions to regularize the NMF criterion. So, we combine an iterative constrained NMF and a genetic algorithm-based adaptive Wiener filtering method for speech enhancement. The proposed method outperforms other benchmark algorithms in terms of source to distortion ratio (SDR), short-time objective intelligibility (STOI), and perceptual evaluation of speech quality (PESQ).
引用
收藏
页码:26233 / 26254
页数:22
相关论文
共 50 条
  • [21] Single channel source separation using graph sparse NMF and adaptive dictionary learning
    Pham, Tuan
    Lee, Yuan-Shan
    Lin, Yan-Bo
    Li, Yung-Hui
    Tai, Tzu-Chiang
    Wang, Jia-Ching
    INTELLIGENT DATA ANALYSIS, 2017, 21 : S5 - S19
  • [22] Speech Enhancement Using NMF based on Hierarchical Deep Neural Networks with Joint Learning
    Mirjalili, Mohammad Mahdi
    Seyedin, Sanaz
    2020 28TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2020, : 1411 - 1415
  • [23] Speech Enhancement Based on the Wiener Filter and Wavelet Entropy
    Jiao, Mingke
    Lou, Lin
    Geng, Xiliang
    Wang, Zhongming
    Zhang, Peng
    Liao, Xijiang
    Zhang, Wenyuan
    2015 12TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2015, : 1956 - 1960
  • [24] Discriminative Training of NMF Model Based on Class Probabilities for Speech Enhancement
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    IEEE SIGNAL PROCESSING LETTERS, 2016, 23 (04) : 502 - 506
  • [25] An adaptive autoregressive pre-whitener for speech and acoustic signals based on parametric NMF
    Jaramillo, Alfredo Esquivel
    Nielsen, Jesper Kjaer
    Christensen, Mads Graesboll
    SPEECH COMMUNICATION, 2023, 151 : 9 - 23
  • [26] Speech Enhancement Based on NMF Under Electric Vehicle Noise Condition
    Wang, Minghe
    Zhang, Erhua
    Tang, Zhenmin
    IEEE ACCESS, 2018, 6 : 9147 - 9159
  • [27] NOISE-ADAPTIVE DEEP NEURAL NETWORK FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Chung, Hanwook
    Kim, Taesup
    Plourde, Eric
    Champagne, Benoit
    2018 IEEE 28TH INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2018,
  • [28] Spectro-temporal post-enhancement using MMSE estimation in NMF based single-channel source separation
    Grais, Emad M.
    Erdogan, Hakan
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3278 - 3282
  • [29] DNN-Based Speech Enhancement via Integrating NMF and CASA
    Yan, Bofang
    Bao, Changchun
    Bai, Zhigang
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 435 - 439
  • [30] Speech Enhancement Based on Codebook Constrained Nonnegative Matrix Factorization
    Bai, Zhigang
    Bao, Changchun
    Yan, Bofang
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 361 - 365