SDW-SWF: Speech Distortion Weighted Single-Channel Wiener Filter for Noise Reduction

被引:2
作者
Zhang, Jie [1 ]
Tao, Rui [1 ]
Du, Jun [1 ]
Dai, Li-Rong [1 ]
机构
[1] Univ Sci & Technol China USTC, Dept Elect Engn & Informat Sci, Hefei 230026, Peoples R China
基金
中国国家自然科学基金;
关键词
Speech enhancement; speech distortion; meansquare error; GEVD; low-rank approximation; Wiener filter; ENHANCEMENT;
D O I
10.1109/TASLP.2023.3304474
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Speech enhancement shows an important necessity in many audio applications, particularly in noisy environments, where the speech quality needs to be improved. In this work, we consider the single-channel noise reduction (NR) problem from the conventional signal processing perspective. As conventional single-channel NR filters suffer from a serious speech distortion (SD) problem, we propose an SD weighted single-channel Wiener filter (SDW-SWF) in the short-time Fourier transform domain, which is obtained by minimizing the mean-square error (MSE) of the clean speech plus a mu-weighted residual noise variance. Based on the generalized eigenvalue decomposition (GEVD) and rank-r approximation of the speech correlation matrix, the SDW-SWF can be written as a linear combination of eigenpairs, from which some special cases reduce to existing single-channel NR filters. As such, the proposed SDW-SWF has two parameters (i.e., mu and r) to tradeoff the MSE and SD. Then we theoretically analyze the impacts of the tradeoff parameters on the NR performance in SD, residual noise variance and the output signal-to-noise ratio (SNR). In addition, it is shown that the STFT-domain SDW-SWF can be further extended to the time domain, where the derived theorems still hold. Numerical results from several perspectives validate the effectiveness of the proposed method.
引用
收藏
页码:3176 / 3189
页数:14
相关论文
共 44 条
[1]  
Adler A, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE)
[2]   Robust Speech-Distortion Weighted Interframe Wiener Filters for Single-Channel Noise Reduction [J].
Andersen, Kristian Timm ;
Moonen, Marc .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (01) :97-107
[3]  
[Anonymous], 2005, Speech Enhancement
[4]  
Benesty J, 2009, SPRINGER TOP SIGN PR, V2, P1, DOI 10.1007/978-3-642-00296-0_1
[5]  
Benesty J, 2008, SPRINGER TOP SIGN PR, V1, P1
[6]   GSVD-based optimal filtering for single and multimicrophone speech enhancement [J].
Doclo, S ;
Moonen, M .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (09) :2230-2244
[7]  
Doclo S., 2014, SIGNALS COMMUNICATIO
[8]   Frequency-domain criterion for the speech distortion weighted multichannel Wiener filter for robust noise reduction [J].
Doclo, Simon ;
Spriet, Ann ;
Wouters, Jan ;
Moonen, Marc .
SPEECH COMMUNICATION, 2007, 49 (7-8) :636-656
[9]   SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].
EPHRAIM, Y ;
MALAH, D .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445
[10]   Robust Constrained MFMVDR Filters for Single-Channel Speech Enhancement Based on Spherical Uncertainty Set [J].
Fischer, Dorte ;
Doclo, Simon .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 29 :618-631