Single channel speech enhancement using iterative constrained NMF based adaptive wiener gain

被引:1
|
作者
Yechuri, Sivaramakrishna [1 ]
Vanambathina, Sunnydayal [1 ]
机构
[1] VIT AP Univ, SENSE, Amaravati, India
关键词
NMF; Adaptive wiener gain; Inverse nakagami; Erlang; Inverse gamma; Students-t probability density functions; SDR; PESQ; STOI; NONNEGATIVE MATRIX FACTORIZATION; ALGORITHMS; EXTRACTION; MACHINE; FILTER;
D O I
10.1007/s11042-023-16480-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a novel single channel speech enhancement algorithm using iterative constrained Non-negative matrix factorization (NMF) based adaptive Wiener gain for non-stationary noise. In the recent past, NMF-based Wiener filtering methods were used for speech enhancement. The Wiener filter performance depends on the adaptive gain factor value. The adaptive gain factor (alpha) value is constant regardless of noise type and signal to noise ratio (SNR), so it will affect speech enhancement performance. To overcome this, the adaptive factor value is calculated using a genetic algorithm (GA). Here, the GA adjusts the adaptive Wiener gain based on noise type and SNR level. The GA-based adaptive Wiener gain minimizes Wiener filter estimation errors and improves speech quality by adjusting the base vector weights of noise and speech. Additionally, we use the iterative constraints NMF (IC-NMF) method for calculating the priors from noisy speech magnitudes. We select the Erlang, Inverse Gamma, Students-t, and Inverse Nakagami distributions for speech priors and Gaussian distributions for noise priors. Noise and speech samples are well correlated with those distributions. This provides accurate estimation of the necessary statistics of these distributions to regularize the NMF criterion. So, we combine an iterative constrained NMF and a genetic algorithm-based adaptive Wiener filtering method for speech enhancement. The proposed method outperforms other benchmark algorithms in terms of source to distortion ratio (SDR), short-time objective intelligibility (STOI), and perceptual evaluation of speech quality (PESQ).
引用
收藏
页码:26233 / 26254
页数:22
相关论文
共 50 条
  • [31] Supervised speech enhancement using online Group-Sparse Convolutive NMF
    Yousefi, Midia
    Savoji, Mohammad Hassan
    2016 8TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2016, : 494 - 499
  • [32] A SOURCE/FILTER MODEL WITH ADAPTIVE CONSTRAINTS FOR NMF-BASED SPEECH SEPARATION
    Bouvier, Damien
    Obin, Nicolas
    Liuni, Marco
    Roebel, Axel
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 131 - 135
  • [33] An NMF-HMM Speech Enhancement Method based on Kullback-Leibler Divergence
    Xiang, Yang
    Shi, Liming
    Hojvang, Jesper Lisby
    Rasmussen, Morten Hojfeldt
    Christensen, Mads Graesboll
    INTERSPEECH 2020, 2020, : 2667 - 2671
  • [34] A NEW COST FUNCTION FOR DNN-BASED SPEECH ENHANCEMENT COMBINING NMF AND CASA
    Yan, Bofang
    Bao, Changchun
    Bai, Zhigang
    PROCEEDINGS OF 2018 14TH IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2018, : 255 - 259
  • [35] Environment-adaptive speech enhancement for bilateral cochlear implants using a single processor
    Mirzahasanloo, Taher S.
    Kehtarnavaz, Nasser
    Gopalakrishna, Vanishree
    Loizou, Philipos C.
    SPEECH COMMUNICATION, 2013, 55 (04) : 523 - 534
  • [36] REGULARIZED NMF-BASED SPEECH ENHANCEMENT WITH SPECTRAL COMPONENTS MODELED BY GAUSSIAN MIXTURES
    Chung, Hanwook
    Plourde, Eric
    Champagne, Benoit
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [37] A review of supervised learning algorithms for single channel speech enhancement
    Saleem, Nasir
    Khattak, Muhammad Irfan
    INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2019, 22 (04) : 1051 - 1075
  • [38] UNSUPERVISED MONAURAL SPEECH ENHANCEMENT USING ROBUST NMF WITH LOW-RANK AND SPARSE CONSTRAINTS
    Li, Yinan
    Zhang, Xiongwei
    Sun, Meng
    Min, Gang
    2015 IEEE CHINA SUMMIT & INTERNATIONAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING, 2015, : 1 - 4
  • [39] Improved Semi-Supervised NMF Based Real-Time Capable Speech Enhancement
    Hu, Yonggang
    Zhang, Xiongwei
    Zou, Xia
    Sun, Meng
    Min, Gang
    Li, Yinan
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2016, E99A (01) : 402 - 406
  • [40] A NEW LINEAR MMSE FILTER FOR SINGLE CHANNEL SPEECH ENHANCEMENT BASED ON NONNEGATIVE MATRIX FACTORIZATION
    Mohammadiha, Nasser
    Gerkmann, Timo
    Leijon, Arne
    2011 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2011, : 45 - 48