Combination of MVDR beamforming and single-channel spectral processing for enhancing noisy and reverberant speech

被引:48
作者
Cauchi, Benjamin [1 ,3 ]
Kodrasi, Ina [2 ,3 ]
Rehr, Robert [2 ,3 ]
Gerlach, Stephan [1 ,3 ]
Jukic, Ante [2 ,3 ]
Gerkmann, Timo [2 ,3 ]
Doclo, Simon [1 ,2 ,3 ]
Goetze, Stefan [1 ,3 ]
机构
[1] Fraunhofer IDMT Hearing Speech & Audio Technol, D-26129 Oldenburg, Germany
[2] Carl von Ossietzky Univ Oldenburg, Dept Med Phys & Acoust, D-26111 Oldenburg, Germany
[3] Cluster Excellence Hearing4all, Oldenburg, Germany
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2015年
关键词
REVERB challenge; Dereverberation; Noise reduction; Beamforming; Spectral enhancement; ENHANCEMENT; QUALITY;
D O I
10.1186/s13634-015-0242-x
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a system aiming at joint dereverberation and noise reduction by applying a combination of a beamformer with a single-channel spectral enhancement scheme. First, a minimum variance distortionless response beamformer with an online estimated noise coherence matrix is used to suppress noise and reverberation. The output of this beamformer is then processed by a single-channel spectral enhancement scheme, based on statistical room acoustics, minimum statistics, and temporal cepstrum smoothing, to suppress residual noise and reverberation. The evaluation is conducted using the REVERB challenge corpus, designed to evaluate speech enhancement algorithms in the presence of both reverberation and noise. The proposed system is evaluated using instrumental speech quality measures, the performance of an automatic speech recognition system, and a subjective evaluation of the speech quality based on a MUSHRA test. The performance achieved by beamforming, single-channel spectral enhancement, and their combination are compared, and experimental results show that the proposed system is effective in suppressing both reverberation and noise while improving the speech quality. The achieved improvements are particularly significant in conditions with high reverberation times.
引用
收藏
页数:12
相关论文
共 44 条
[1]  
[Anonymous], SPRINGER HDB SPEECH
[2]  
[Anonymous], 2020, Nonparametric Statistical Inference, DOI DOI 10.1201/9781439896129
[3]  
[Anonymous], 2007, Speech Enhancement: Theory and Practice
[4]  
Benesty J, 2008, SPRINGER TOP SIGN PR, V1, P1
[5]  
Bitzer J, 2001, DIGITAL SIGNAL PROC, P19
[6]   On the importance of early reflections for speech in rooms [J].
Bradley, JS ;
Sato, H ;
Picard, M .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2003, 113 (06) :3233-3244
[7]  
Braun S, 2013, 2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO)
[8]   Parameterized MMSE spectral magnitude estimation for the enhancement of noisy speech [J].
Breithaupt, Colin ;
Krawczyk, Martin ;
Martin, Rainer .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4037-4040
[9]   A novel a priori SNR estimation approach based on selective cepstro-temporal smoothing [J].
Breithaupt, Colin ;
Gerkmann, Timo ;
Martin, Rainer .
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, :4897-4900
[10]  
Cauchi B., 2014, P REVERB CHALL WORKS