Multiframe maximum a posteriori estimators for single-microphone speech enhancement

被引：2

作者：

Ranjbaryan, Raziyeh ^{[1
]}

Abutalebi, Hamid Reza ^{[1
]}

机构：

[1] Yazd Univ, Elect Engn Dept, Yazd, Iran

来源：

IET SIGNAL PROCESSING | 2021年 / 15卷 / 07期

基金：

美国国家科学基金会;

关键词：

SPECTRAL AMPLITUDE ESTIMATION; NOISE-REDUCTION; PHASE;

D O I：

10.1049/sil2.12045

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Multiframe maximum a posteriori (MAP) estimators are applied to a single-microphone noise reduction problem. Several attempts have been made to exploit the interframe correlation (IFC) between speech coefficients in the short-time Fourier transform domain. In a noise-reduction algorithm, all available information of recorded signals should be optimally utilized in the estimation process. Single-microphone multiframe minimum variance distortion-less response and single-microphone multiframe Wiener filters (MFWFs) have been presented in this approach. Incorporating the concept of IFC in the MAP estimator leads to multiframe MAP estimators in a single-microphone case. In each time-frequency unit, the current and a finite number of past noisy signals are utilized to develop the estimators. A complex factor is adopted to model the IFC between speech signals, which allows the application of multiframe MAP estimators. The noise reduction performance is compared for the proposed estimators with the joint MAP estimator (which ignores the correlation between successive frames) and benchmark MFWFs and speech-distortion weighted interframe Wiener filters for different input noise types. These evaluations verify that the proposed methods exhibit good performance.

引用

页码：467 / 481

页数：15

共 30 条

[1] Simultaneous detection and estimation approach for speech enhancement [J].

Abramson, Ari ;

Cohen, Israel .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08) :2348-2359

[2] Robust Speech-Distortion Weighted Interframe Wiener Filters for Single-Channel Noise Reduction [J].

Andersen, Kristian Timm ;

Moonen, Marc .

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (01) :97-107

[3]

Benesty J, 2008, SPRINGER TOP SIGN PR, V1, P1

[4] Noise spectrum estimation in adverse environments: Improved minima controlled recursive averaging [J].

Cohen, I .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (05) :466-475

[5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02) :443-445

[6] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR [J].

EPHRAIM, Y ;

MALAH, D .

IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06) :1109-1121

[7]

Fischer D, 2017, EUR SIGNAL PR CONF, P603, DOI 10.23919/EUSIPCO.2017.8081278

[8]

Gaich A, 2015, 16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, P2553

[9]

Garofolo J.S., 1993, GETTING STARTED DARP, DOI [10.6028/nist.ir.4930, DOI 10.6028/NIST.IR.4930]

[10]

Gemmeke JF, 2017, INT CONF ACOUST SPEE, P776, DOI 10.1109/ICASSP.2017.7952261

← 1 2 3 →