Speech Enhancement With Robust Beamforming for Spatially Overlapped and Distributed Sources

被引:5
|
作者
Xiong Wenmeng [1 ]
Bao Changchun [1 ]
Jia Maoshen [1 ]
Picheral, Jose [2 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[2] Univ Paris Saclay, CNRS, Cent Supelec, Signal & Syst Lab, F-91190 Gif Sur Yvette, France
基金
中国国家自然科学基金;
关键词
Array signal processing; Interference; Microphone arrays; Signal to noise ratio; Speech enhancement; Correlation; Arrays; DOA; microphone array; MVDR; speech enhancement; ALTERNATING DIRECTION METHOD; LOCALIZATION;
D O I
10.1109/TASLP.2022.3201391
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Most of the existing Beamforming methods are based on the assumptions that the sources are all point sources and the angular separation between the direction of arrival (DOA) of the source and the interference is large enough to assure good performance. In this paper, we consider a tough scenario where the target source and the interference are simultaneously spatially distributed and overlapped. To improve the performance of Beamforming in this scenario, we propose two approaches: the first approach exploits the non-Gaussianity as well as the spectrogram sparsity of the output of the microphone array; the second approach exploits the generalized sparsity with overlapped groups of the Beampattern. The proposed criteria are solved by methods based on linearized preconditioned alternating direction method of multipliers (LPADMM) with high accuracy and high computational efficiency. Numerical simulations and real data experiments show the advantages of the proposed approaches compared to previously proposed Beamforming methods for signal enhancement.
引用
收藏
页码:2778 / 2790
页数:13
相关论文
共 50 条
  • [21] Binaural Deep Neural Network for Robust Speech Enhancement
    Jiang, Yi
    Liu, Runsheng
    2014 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2014, : 692 - 695
  • [22] A Front-End Speech Enhancement System for Robust Automotive Speech Recognition
    Wang, Haikun
    Ye, Zhongfu
    Chen, Jingdong
    2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 1 - 5
  • [23] Signal enhancement using beamforming and nonstationarity with applications to speech
    Gannot, S
    Burshtein, D
    Weinstein, E
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2001, 49 (08) : 1614 - 1626
  • [24] A SUBBAND HYBRID BEAMFORMING FOR IN-CAR SPEECH ENHANCEMENT
    Fox, Charles
    Vitte, Guillaume
    Charbit, Maurice
    Prado, Jacques
    Badeau, Roland
    David, Bertrand
    2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 11 - 15
  • [25] SPEECH ENHANCEMENT FOR ROBUST SPEECH RECOGNITION IN MOTORCYCLE ENVIRONMENT
    Mporas, Iosif
    Ganchev, Todor
    Kocsis, Otilia
    Fakotakis, Nikos
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2010, 19 (02) : 159 - 173
  • [26] Compensation of speech enhancement distortion for robust speech recognition
    Ding, P
    Cao, ZG
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 449 - 452
  • [27] Far Field Speech Enhancement at Low SNR in Presence of Nonstationary Noise Based on Spectral Masking and MVDR Beamforming
    Astapov, Sergei
    Lavrentyev, Aleksandr
    Shuranov, Evgeniy
    SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 21 - 31
  • [28] Distributed Sensor Selection for Speech Enhancement With Acoustic Sensor Networks
    Hu, De
    Si, Qintuya
    Liu, Rui
    Bao, Feilong
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 985 - 999
  • [29] Speech enhancement using nonlinear microphone array based on noise adaptive complementary beamforming
    Sabuwatari, H
    Kajita, S
    Takeda, K
    Itakura, F
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2000, E83A (05) : 866 - 876
  • [30] Bind subband beamforming with time-delay constraints for moving source speech enhancement
    Yermeche, Zohra
    Grbic, Nedelko
    Claesson, Ingvar
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2360 - 2372