THEORETICAL ANALYSIS OF BIASED MMSE SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR AND ITS EXTENSION TO MUSICAL-NOISE-FREE SPEECH ENHANCEMENT

被引:0
作者
Nakai, Shunsuke [1 ]
Saruwatari, Hiroshi [1 ]
Miyazaki, Ryoichi [1 ]
Nakamura, Satoshi [1 ]
Kondo, Kazunobu [2 ]
机构
[1] Nara Inst Sci & Technol, Nara 6300101, Japan
[2] Yamaha Corp, Shizuoka, Japan
来源
2014 4TH JOINT WORKSHOP ON HANDS-FREE SPEECH COMMUNICATION AND MICROPHONE ARRAYS (HSCMA) | 2014年
关键词
MMSE-STSA estimator; musical noise; musical-noise-free speech enhancement; higher-order statistics;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we provide a theoretical analysis of the minimum mean-square error short-time spectral amplitude (MMSE-STSA) estimator with the biased a priori SNR estimation and its extension to musical-noise-free speech enhancement. Recently, musical-noise-free speech enhancement has been proposed, where no musical noise is generated in iterative spectral subtraction. However, no existence of the musical-noise-free condition in the MMSE-STSA estimator has been reported. Therefore, in this paper, we show that the musical-noise-free condition exists in the biased MMSE-STSA estimator via the theoretical analysis. In addition, we perform comparative experiments and clarify the efficacy of the proposed musical-noise-free speech enhancement.
引用
收藏
页码:122 / 126
页数:5
相关论文
共 18 条
  • [1] [Anonymous], 2007, Speech Enhancement: Theory and Practice
  • [2] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [3] Analysis of the Decision-Directed SNR Estimator for Speech Enhancement With Respect to Low-SNR and Transient Conditions
    Breithaupt, Colin
    Martin, Rainer
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (02): : 277 - 289
  • [4] Elimination of the Musical Noise Phenomenon with the Ephraim and Malah Noise Suppressor
    Cappe, Olivier
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (02): : 345 - 349
  • [5] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02): : 443 - 445
  • [6] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [7] Postprocessing method for suppressing musical noise generated by spectral subtraction
    Goh, Z
    Tan, KC
    Tan, BTG
    [J]. IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1998, 6 (03): : 287 - 292
  • [8] Inoue Takayuki, 2010, Proceedings of the 2010 IEEE International Workshop on Machine Learning for Signal Processing (MLSP), P220, DOI 10.1109/MLSP.2010.5589167
  • [9] Iterative Noise Power Subtraction Technique for Improved Speech Quality
    Khan, M. Ryyan
    Hasan, Taufiq
    Khan, M. Rezwan
    [J]. PROCEEDINGS OF ICECE 2008, VOLS 1 AND 2, 2008, : 391 - +
  • [10] Improved Voice Activity Detection Based on Iterative Spectral Subtraction and Double Thresholds for CVR
    Li, Xiangbin
    Li, Guo
    Li, Xueren
    [J]. 2008 WORKSHOP ON POWER ELECTRONICS AND INTELLIGENT TRANSPORTATION SYSTEM, PROCEEDINGS, 2008, : 153 - +