MMSE STSA ESTIMATOR WITH NONSTATIONARY NOISE ESTIMATION BASED ON ICA FOR HIGH-QUALITY SPEECH ENHANCEMENT

被引:3
作者
Okamoto, Ryoi [1 ]
Takahashi, Yu [1 ]
Saruwatari, Hiroshi [1 ]
Shikano, Kiyohiro [1 ]
机构
[1] Nara Inst Sci & Technol, Grad Sch Informat Sci, Ikoma, Nara 6300192, Japan
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
Blind source separation; microphone array signal processing; independent component analysis; spectral subtraction; MMSE STSA estimator; INDEPENDENT COMPONENT ANALYSIS; SUBTRACTION;
D O I
10.1109/ICASSP.2010.5495162
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a new blind speech extraction method consisting of a minimum mean-square error short-time spectral amplitude (MMSE STSA) estimator and noise estimation based on independent component analysis (ICA). First, we perform a computer simulation using the artificial noise whose stationarity could be controlled parametrically, and the obtained results indicate that the proposed method is superior to conventional methods, such as blind spatial subtraction array (BSSA) and the original MMSE STSA estimator under the non-point-source and nonstationary noise condition. Finally, we conduct an experiment in an actual railway-station environment, and objective and subjective evaluations to confirm the advantage of the proposed method in the real world.
引用
收藏
页码:4778 / 4781
页数:4
相关论文
共 9 条
  • [1] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [2] BRANDSTEIN, 2001, MICROPHONE ARRAYS SI
  • [3] INDEPENDENT COMPONENT ANALYSIS, A NEW CONCEPT
    COMON, P
    [J]. SIGNAL PROCESSING, 1994, 36 (03) : 287 - 314
  • [4] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR SHORT-TIME SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (06): : 1109 - 1121
  • [5] MURATA N, 1998, NOLTA, P923
  • [6] Rabiner L. R., 1993, Fundamentals of Speech Recognition
  • [7] Blind source separation combining independent component analysis and beamforming
    Saruwatari, H
    Kurita, S
    Takeda, K
    Itakura, F
    Nishikawa, T
    Shikano, K
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2003, 2003 (11) : 1135 - 1146
  • [8] Takahashi Y., 2008, JOINT WORKSH HANDS F, P164
  • [9] Blind Spatial Subtraction Array for Speech Enhancement in Noisy Environment
    Takahashi, Yu
    Takatani, Tomoya
    Osako, Keiichi
    Saruwatari, Hiroshi
    Shikano, Kiyohiro
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 650 - 664