THEORETICAL ANALYSIS OF BIASED MMSE SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR AND ITS EXTENSION TO MUSICAL-NOISE-FREE SPEECH ENHANCEMENT

被引：0

作者：

Nakai, Shunsuke ^{[1
]}

Saruwatari, Hiroshi ^{[1
]}

Miyazaki, Ryoichi ^{[1
]}

Nakamura, Satoshi ^{[1
]}

Kondo, Kazunobu ^{[2
]}

机构：

[1] Nara Inst Sci & Technol, Nara 6300101, Japan

[2] Yamaha Corp, Shizuoka, Japan

来源：

2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA) | 2014年

关键词：

MMSE-STSA estimator; musical noise; musical-noise-free speech enhancement; higher-order statistics;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we provide a theoretical analysis of the minimum mean-square error short-time spectral amplitude (MMSE-STSA) estimator with the biased a priori SNR estimation and its extension to musical-noise-free speech enhancement. Recently, musical-noise-free speech enhancement has been proposed, where no musical noise is generated in iterative spectral subtraction. However, no existence of the musical-noise-free condition in the MMSE-STSA estimator has been reported. Therefore, in this paper, we show that the musical-noise-free condition exists in the biased MMSE-STSA estimator via the theoretical analysis. In addition, we perform comparative experiments and clarify the efficacy of the proposed musical-noise-free speech enhancement.

引用

页码：122 / 126

页数：5

共 18 条

[1] [Anonymous], 2007, Speech Enhancement: Theory and Practice
[2] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
BOLL, SF
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
[3] Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions
Breithaupt, Colin
Martin, Rainer
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 277 - 289
[4] Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor
Cappe, Olivier
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 345 - 349
[5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR
EPHRAIM, Y
MALAH, D
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02): : 443 - 445
[6] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
EPHRAIM, Y
MALAH, D
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
[7] Postprocessing method for suppressing musical noise generated by spectral subtraction
Goh, Z
Tan, KC
Tan, BTG
[J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03): : 287 - 292
[8] Inoue Takayuki, 2010, Proceedings of the 2010 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), P220, DOI 10.1109/MLSP.2010.5589167
[9] Iterative Noise Power Subtraction Technique for Improved Speech Quality
Khan, M. Ryyan
Hasan, Taufiq
Khan, M. Rezwan
[J]. PROCEEDINGS OF ICECE 2008, VOLS 1 AND 2, 2008, : 391 - +
[10] Improved Voice Activity Detection Based on Iterative Spectral Subtraction and Double Thresholds for CVR
Li, Xiangbin
Li, Guo
Li, Xueren
[J]. 2008 WORKSHOP ON POWER ELECTRONICS AND INTELLIGENT TRANSPORTATION SYSTEM, PROCEEDINGS, 2008, : 153 - +

← 1 2 →