LOW LATENCY TWO STAGE BEAMFORMING WITH DISTRIBUTED MICROPHONE ARRAYS USING A PLANE WAVE DECOMPOSITION

被引：0

作者：

Mittal, Manan ^{[1
]}

Corey, Ryan M. ^{[2
,3
]}

Zhuang, Yongjie ^{[1
]}

Singer, Andrew C. ^{[1
]}

机构：

[1] SUNY Stony Brook, Stony Brook, NY 11794 USA

[2] Univ Illinois, Chicago, IL USA

[3] Discovery Partners Inst, Chicago, IL USA

来源：

2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024 | 2024年

关键词：

Plane Wave Decomposition; Beamforming; Microphone Arrays; Sensor Networks; SPEECH ENHANCEMENT;

D O I：

10.1109/IWAENC61483.2024.10694131

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Many rooms are equipped with conferencing systems with a large number of microphones. However, these systems are often limited to a fixed number of delay-and-sum beamformers and typical automix software will pick the beamformer output with the most energy. A more constructive combination is possible if we have access to the output of multiple beams constructed by such commercial systems. This article proposes a beamspace projection that may effectively view such a commercial conferencing system as a low-latency dimensionality reduction operation. Such a projection can be formulated as a plane wave decomposition of the received signals. Experiments conducted in simulation show that beamspace projection can rapidly approach the SINR gain achievable with access to all the microphones with lower computational complexity. Finally, we use commercial hardware and its beamformer outputs to separate numerous talkers in a real world environment.

引用

页码：180 / 184

页数：5

共 21 条

[1] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS
ALLEN, JB
BERKLEY, DA
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) : 943 - 950
[2] [Anonymous], 2004, 5 ISCA WORKSH SPEECH
[3] Bertrand A., 2011, 2011 18 IEEE S COMM
[4] Distributed Adaptive Node-Specific Signal Estimation in Fully Connected Sensor Networks-Part I: Sequential Node Updating
Bertrand, Alexander
Moonen, Marc
[J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (10) : 5277 - 5291
[5] Cherkassky D, 2015, EUR SIGNAL PR CONF, P245, DOI 10.1109/EUSIPCO.2015.7362382
[6] Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices
Corey, Ryan M.
Mittal, Manan
Sarkar, Kanad
Singer, Andrew C.
[J]. INTERSPEECH 2022, 2022, : 5398 - 5402
[7] Corey RM, 2019, 2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), P296, DOI [10.1109/camsap45676.2019.9022475, 10.1109/CAMSAP45676.2019.9022475]
[8] A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation
Gannot, Sharon
Vincent, Emmanuel
Markovich-Golan, Shmulik
Ozerov, Alexey
[J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 692 - 730
[9] Golan SM, 2010, INT CONF ACOUST SPEE, P201, DOI 10.1109/ICASSP.2010.5496044
[10] Hioka Y, 2014, 2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), P85, DOI 10.1109/IWAENC.2014.6953343

← 1 2 3 →