Close speaker cancellation for suppression of non-stationary background noise for hands-free speech interface

被引:0
作者
Even, Jani
Ishi, Carlos
Saruwatari, Hiroshi
Hagita, Norihiro
机构
来源
11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2 | 2010年
关键词
speech enhancement; hands-free speech interface; noise cancellation; SUBTRACTION; ENHANCEMENT; ENVIRONMENT; SIGNALS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a noise cancellation method based on the ability to efficiently cancel a close target speaker contribution from the signals observed at a microphone array. The proposed method exploits this specificity in the case of the hands-free speech interface when the target user is close to the microphone array and the noise is a diffuse background noise. This method is in particular able to deal with non-stationary noise. The method can be divided in three steps. First, the steering vector pointing at the target user is estimated from the covariance of the observed signals. Then the noise estimate is obtained by canceling the user's contribution. During this step the speech pauses are also estimated. Finally a post-filter is used to suppress this estimated noise from the observed signals. The postfilter strength is controlled by using the estimated noise during the speech pauses as reference. A 20k-words dictation task in presence of non-stationary diffuse background noise at different SNR levels illustrates the effectiveness of the proposed method.
引用
收藏
页码:977 / 980
页数:4
相关论文
共 14 条
  • [1] SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION
    BOLL, SF
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02): : 113 - 120
  • [2] ROBUST ADAPTIVE BEAMFORMING
    COX, H
    ZESKIND, RM
    OWEN, MM
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (10): : 1365 - 1376
  • [3] Doclo S., 2004, EUSIPCO 04, P2007
  • [4] SPEECH ENHANCEMENT USING A MINIMUM MEAN-SQUARE ERROR LOG-SPECTRAL AMPLITUDE ESTIMATOR
    EPHRAIM, Y
    MALAH, D
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1985, 33 (02): : 443 - 445
  • [5] SPEECH ENHANCEMENT IN PRESENCE OF DIFFUSE BACKGROUND NOISE: WHY USING BLIND SIGNAL EXTRACTION?
    Even, Jani
    Saruwatari, Hiroshi
    Shikano, Kiyorhiro
    Takatani, Tomoya
    [J]. 2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4770 - 4773
  • [6] AN ALTERNATIVE APPROACH TO LINEARLY CONSTRAINED ADAPTIVE BEAMFORMING
    GRIFFITHS, LJ
    JIM, CW
    [J]. IEEE TRANSACTIONS ON ANTENNAS AND PROPAGATION, 1982, 30 (01) : 27 - 34
  • [7] Ito K., 1999, J ACOUST SOC JPN, V20, P196
  • [8] *JUL, JUL OP SOURC LARG VO
  • [9] Speech intelligibility improvement using convolutive blind source separation assisted by denoising algorithms
    Kocinski, Jedrzej
    [J]. SPEECH COMMUNICATION, 2008, 50 (01) : 29 - 37
  • [10] Multichannel Eigenspace Beamforming in a Reverberant Noisy Environment With Multiple Interfering Speech Signals
    Markovich, Shmulik
    Gannot, Sharon
    Cohen, Israel
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (06): : 1071 - 1086