Two-Microphone Binary Mask Speech Enhancement in Diffuse and Directional Noise Fields

被引:0
|
作者
Abdipour, Roohollah [1 ]
Akbari, Ahmad [1 ]
Rahmani, Mohsen [2 ]
机构
[1] Iran Univ Sci & Technol, Sch Comp Engn, Tehran, Iran
[2] Arak Univ, Fac Engn, Dept Comp Engn, Arak, Iran
关键词
Two-microphone speech enhancement; source separation; binary mask; diffuse noise; directional noise; CROSS-PSD ESTIMATION; SOURCE SEPARATION; SOUND FIELDS; INTELLIGIBILITY; COHERENCE; LOCALIZATION; STATISTICS;
D O I
10.4218/etrij.14.0113.0917
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Two-microphone binary mask speech enhancement (2mBMSE) has been of particular interest in recent literature and has shown promising results. Current 2mBMSE systems rely on spatial cues of speech and noise sources. Although these cues are helpful for directional noise sources, they lose their efficiency in diffuse noise fields. We propose a new system that is effective in both directional and diffuse noise conditions. The system exploits two features. The first determines whether a given time-frequency (T-F) unit of the input spectrum is dominated by a diffuse or directional source. A diffuse signal is certainly a noise signal, but a directional signal could correspond to a noise or speech source. The second feature discriminates between T-F units dominated by speech or directional noise signals. Speech enhancement is performed using a binary mask, calculated based on the proposed features. In both directional and diffuse noise fields, the proposed system segregates speech T-F units with hit rates above 85%. It outperforms previous solutions in terms of signal-to-noise ratio and perceptual evaluation of speech quality improvement, especially in diffuse noise conditions.
引用
收藏
页码:772 / 782
页数:11
相关论文
共 50 条
  • [1] An iterative noise cross-PSD estimation for two-microphone speech enhancement
    Rahmani, Mohsen
    Akbari, Ahmad
    Ayad, Beghdad
    APPLIED ACOUSTICS, 2009, 70 (03) : 514 - 521
  • [2] A NOISE PSD AND CROSS-PSD ESTIMATION FOR TWO-MICROPHONE SPEECH ENHANCEMENT SYSTEMS
    Freudenberger, Juergen
    Stenzel, Sebastian
    Venditti, Benjamin
    2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 708 - 711
  • [3] Speech Enhancement Using a Square Microphone Array in the Presence of Directional and Diffuse Noise
    Ogawa, Tetsuji
    Takada, Shintaro
    Akagiri, Kenzo
    Kobayashi, Tetsunori
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2010, E93A (05) : 926 - 935
  • [4] Two-microphone subband noise reduction scheme with a new noise subtraction parameter for speech quality enhancement
    Aung, Thiri Thandar
    Thumchirdchupong, Hathaichanok
    Tangsangiumvisai, Nisachon
    Nishihara, Akinori
    IET SIGNAL PROCESSING, 2015, 9 (02) : 130 - 142
  • [5] Speech Enhancement for Mobile Phones Based on the Imparity of Two-Microphone Signals
    Hu, Jwusheng
    Lee, Mingtang
    ICIA: 2009 INTERNATIONAL CONFERENCE ON INFORMATION AND AUTOMATION, VOLS 1-3, 2009, : 591 - 596
  • [6] Two-microphone separation of speech mixtures
    Pedersen, Michael Syskind
    Wang, DeLiang
    Larsen, Jan
    Kjems, Ulrik
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (03): : 475 - 492
  • [7] Two-Microphone Generalized Sidelobe Canceller with Post-Filter Based Speech Enhancement in Composite Noise
    Park, Jinsoo
    Kim, Wooil
    Han, David K.
    Ko, Hanseok
    ETRI JOURNAL, 2016, 38 (02) : 366 - 375
  • [8] A two-microphone approach for speech enhancement in hands-free communications
    Jeannes, RLB
    Faucon, G
    Ayad, B
    1996 INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY, VOLUMES 1 AND 2 - PROCEEDINGS, 1996, : 424 - 427
  • [9] A soft decision based noise cross power spectral density estimation for two-microphone speech enhancement systems
    Zhang, XF
    Jia, Y
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 813 - 816
  • [10] Selective-Tap Blind Dereverberation for Two-Microphone Enhancement of Reverberant Speech
    Kokkinakis, Kostas
    Loizou, Philipos C.
    IEEE SIGNAL PROCESSING LETTERS, 2009, 16 (11) : 961 - 964