Speech Enhancement With Robust Beamforming for Spatially Overlapped and Distributed Sources

被引：5

作者：

Xiong Wenmeng ^{[1
]}

Bao Changchun ^{[1
]}

Jia Maoshen ^{[1
]}

Picheral, Jose ^{[2
]}

机构：

[1] Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

[2] Univ Paris Saclay, CNRS, Cent Supelec, Signal & Syst Lab, F-91190 Gif Sur Yvette, France

来源：

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING | 2022年 / 30卷

基金：

中国国家自然科学基金;

关键词：

Array signal processing; Interference; Microphone arrays; Signal to noise ratio; Speech enhancement; Correlation; Arrays; DOA; microphone array; MVDR; speech enhancement; ALTERNATING DIRECTION METHOD; LOCALIZATION;

D O I：

10.1109/TASLP.2022.3201391

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Most of the existing Beamforming methods are based on the assumptions that the sources are all point sources and the angular separation between the direction of arrival (DOA) of the source and the interference is large enough to assure good performance. In this paper, we consider a tough scenario where the target source and the interference are simultaneously spatially distributed and overlapped. To improve the performance of Beamforming in this scenario, we propose two approaches: the first approach exploits the non-Gaussianity as well as the spectrogram sparsity of the output of the microphone array; the second approach exploits the generalized sparsity with overlapped groups of the Beampattern. The proposed criteria are solved by methods based on linearized preconditioned alternating direction method of multipliers (LPADMM) with high accuracy and high computational efficiency. Numerical simulations and real data experiments show the advantages of the proposed approaches compared to previously proposed Beamforming methods for signal enhancement.

引用

页码：2778 / 2790

页数：13

共 50 条

[21] Binaural Deep Neural Network for Robust Speech Enhancement
Jiang, Yi
Liu, Runsheng
2014 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (ICSPCC), 2014, : 692 - 695
[22] A Front-End Speech Enhancement System for Robust Automotive Speech Recognition
Wang, Haikun
Ye, Zhongfu
Chen, Jingdong
2018 11TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2018, : 1 - 5
[23] Signal enhancement using beamforming and nonstationarity with applications to speech
Gannot, S
Burshtein, D
Weinstein, E
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2001, 49 (08) : 1614 - 1626
[24] A SUBBAND HYBRID BEAMFORMING FOR IN-CAR SPEECH ENHANCEMENT
Fox, Charles
Vitte, Guillaume
Charbit, Maurice
Prado, Jacques
Badeau, Roland
David, Bertrand
2012 PROCEEDINGS OF THE 20TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2012, : 11 - 15
[25] SPEECH ENHANCEMENT FOR ROBUST SPEECH RECOGNITION IN MOTORCYCLE ENVIRONMENT
Mporas, Iosif
Ganchev, Todor
Kocsis, Otilia
Fakotakis, Nikos
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2010, 19 (02) : 159 - 173
[26] Compensation of speech enhancement distortion for robust speech recognition
Ding, P
Cao, ZG
2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 449 - 452
[27] Far Field Speech Enhancement at Low SNR in Presence of Nonstationary Noise Based on Spectral Masking and MVDR Beamforming
Astapov, Sergei
Lavrentyev, Aleksandr
Shuranov, Evgeniy
SPEECH AND COMPUTER (SPECOM 2018), 2018, 11096 : 21 - 31
[28] Distributed Sensor Selection for Speech Enhancement With Acoustic Sensor Networks
Hu, De
Si, Qintuya
Liu, Rui
Bao, Feilong
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 985 - 999
[29] Speech enhancement using nonlinear microphone array based on noise adaptive complementary beamforming
Sabuwatari, H
Kajita, S
Takeda, K
Itakura, F
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2000, E83A (05) : 866 - 876
[30] Bind subband beamforming with time-delay constraints for moving source speech enhancement
Yermeche, Zohra
Grbic, Nedelko
Claesson, Ingvar
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (08): : 2360 - 2372

← 1 2 3 4 5 →