Single channel speech enhancement using iterative constrained NMF based adaptive wiener gain

被引:1
|
作者
Yechuri, Sivaramakrishna [1 ]
Vanambathina, Sunnydayal [1 ]
机构
[1] VIT AP Univ, SENSE, Amaravati, India
关键词
NMF; Adaptive wiener gain; Inverse nakagami; Erlang; Inverse gamma; Students-t probability density functions; SDR; PESQ; STOI; NONNEGATIVE MATRIX FACTORIZATION; ALGORITHMS; EXTRACTION; MACHINE; FILTER;
D O I
10.1007/s11042-023-16480-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a novel single channel speech enhancement algorithm using iterative constrained Non-negative matrix factorization (NMF) based adaptive Wiener gain for non-stationary noise. In the recent past, NMF-based Wiener filtering methods were used for speech enhancement. The Wiener filter performance depends on the adaptive gain factor value. The adaptive gain factor (alpha) value is constant regardless of noise type and signal to noise ratio (SNR), so it will affect speech enhancement performance. To overcome this, the adaptive factor value is calculated using a genetic algorithm (GA). Here, the GA adjusts the adaptive Wiener gain based on noise type and SNR level. The GA-based adaptive Wiener gain minimizes Wiener filter estimation errors and improves speech quality by adjusting the base vector weights of noise and speech. Additionally, we use the iterative constraints NMF (IC-NMF) method for calculating the priors from noisy speech magnitudes. We select the Erlang, Inverse Gamma, Students-t, and Inverse Nakagami distributions for speech priors and Gaussian distributions for noise priors. Noise and speech samples are well correlated with those distributions. This provides accurate estimation of the necessary statistics of these distributions to regularize the NMF criterion. So, we combine an iterative constrained NMF and a genetic algorithm-based adaptive Wiener filtering method for speech enhancement. The proposed method outperforms other benchmark algorithms in terms of source to distortion ratio (SDR), short-time objective intelligibility (STOI), and perceptual evaluation of speech quality (PESQ).
引用
收藏
页码:26233 / 26254
页数:22
相关论文
共 50 条
  • [41] SPEAKER AND NOISE INDEPENDENT ONLINE SINGLE-CHANNEL SPEECH ENHANCEMENT
    Germain, Francois G.
    Mysore, Gautham J.
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 71 - 75
  • [42] Speech Enhancement Control Design Algorithm for Dual-Microphone Systems Using β-NMF in a Complex Environment
    Wang, Dong-xia
    Jiang, Mao-song
    Niu, Fang-lin
    Cao, Yu-dong
    Zhou, Cheng-xu
    COMPLEXITY, 2018,
  • [43] Adaptive Wiener Gain to Improve Sound Quality on Nonnegative Matrix Factorization-Based Noise Reduction System
    Lai, Ying-Hui
    Wang, Syu-Siang
    Chen, Chien-Hsun
    Jhang, Sin-Hua
    IEEE ACCESS, 2019, 7 : 43286 - 43297
  • [44] Two-Stage Temporal Processing for Single-Channel Speech Enhancement
    Samui, Sunzan
    Chakrabarti, Indrajit
    Ghosh, Soumya Kanti
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3723 - 3727
  • [45] ROTATIONAL RESET STRATEGY FOR ONLINE SEMI-SUPERVISED NMF-BASED SPEECH ENHANCEMENT FOR LONG RECORDINGS
    Zhou, Jun
    Chen, Shuo
    Duan, Zhiyao
    2015 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS (WASPAA), 2015,
  • [46] Single-Channel Speech Separation Based on Deep Clustering with Local Optimization
    Fu, Taotao
    Yu, Ge
    Guo, Lili
    Wang, Yan
    Liang, Ji
    2017 3RD INTERNATIONAL CONFERENCE ON FRONTIERS OF SIGNAL PROCESSING (ICFSP), 2017, : 44 - 49
  • [47] Single Channel Source Separation Using Non-Gaussian NMF and Modified Hilbert Spectrum
    Sharafinezhad, Seyyed Reza
    Alizadeh, Habib
    Eshghi, Mohammad
    2014 22ND IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE), 2014, : 1673 - 1677
  • [48] Frequency Selection Based Separation of Speech Signals with Reduced Computational Time Using Sparse NMF
    Varshney, Yash Vardhan
    Abbasi, Zia Ahmad
    Abidi, Musiur Raza
    Farooq, Omar
    ARCHIVES OF ACOUSTICS, 2017, 42 (02) : 287 - 295
  • [49] An evaluation of the perceptual quality of phase-aware single-channel speech enhancement
    Krawczyk-Becker, Martin
    Gerkmann, Timo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2016, 140 (04) : EL364 - EL369
  • [50] NOISE ROBUST EXEMPLAR MATCHING WITH COUPLED DICTIONARIES FOR SINGLE-CHANNEL SPEECH ENHANCEMENT
    Yilmaz, Emre
    Baby, Deepak
    Van Hamme, Hugo
    2015 23RD EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2015, : 874 - 878