LOW LATENCY TWO STAGE BEAMFORMING WITH DISTRIBUTED MICROPHONE ARRAYS USING A PLANE WAVE DECOMPOSITION

被引:0
作者
Mittal, Manan [1 ]
Corey, Ryan M. [2 ,3 ]
Zhuang, Yongjie [1 ]
Singer, Andrew C. [1 ]
机构
[1] SUNY Stony Brook, Stony Brook, NY 11794 USA
[2] Univ Illinois, Chicago, IL USA
[3] Discovery Partners Inst, Chicago, IL USA
来源
2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024 | 2024年
关键词
Plane Wave Decomposition; Beamforming; Microphone Arrays; Sensor Networks; SPEECH ENHANCEMENT;
D O I
10.1109/IWAENC61483.2024.10694131
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Many rooms are equipped with conferencing systems with a large number of microphones. However, these systems are often limited to a fixed number of delay-and-sum beamformers and typical automix software will pick the beamformer output with the most energy. A more constructive combination is possible if we have access to the output of multiple beams constructed by such commercial systems. This article proposes a beamspace projection that may effectively view such a commercial conferencing system as a low-latency dimensionality reduction operation. Such a projection can be formulated as a plane wave decomposition of the received signals. Experiments conducted in simulation show that beamspace projection can rapidly approach the SINR gain achievable with access to all the microphones with lower computational complexity. Finally, we use commercial hardware and its beamformer outputs to separate numerous talkers in a real world environment.
引用
收藏
页码:180 / 184
页数:5
相关论文
共 21 条
  • [1] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS
    ALLEN, JB
    BERKLEY, DA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) : 943 - 950
  • [2] [Anonymous], 2004, 5 ISCA WORKSH SPEECH
  • [3] Bertrand A., 2011, 2011 18 IEEE S COMM
  • [4] Distributed Adaptive Node-Specific Signal Estimation in Fully Connected Sensor Networks-Part I: Sequential Node Updating
    Bertrand, Alexander
    Moonen, Marc
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (10) : 5277 - 5291
  • [5] Cherkassky D, 2015, EUR SIGNAL PR CONF, P245, DOI 10.1109/EUSIPCO.2015.7362382
  • [6] Cooperative Speech Separation With a Microphone Array and Asynchronous Wearable Devices
    Corey, Ryan M.
    Mittal, Manan
    Sarkar, Kanad
    Singer, Andrew C.
    [J]. INTERSPEECH 2022, 2022, : 5398 - 5402
  • [7] Corey RM, 2019, 2019 IEEE 8TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2019), P296, DOI [10.1109/camsap45676.2019.9022475, 10.1109/CAMSAP45676.2019.9022475]
  • [8] A Consolidated Perspective on Multimicrophone Speech Enhancement and Source Separation
    Gannot, Sharon
    Vincent, Emmanuel
    Markovich-Golan, Shmulik
    Ozerov, Alexey
    [J]. IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (04) : 692 - 730
  • [9] Golan SM, 2010, INT CONF ACOUST SPEE, P201, DOI 10.1109/ICASSP.2010.5496044
  • [10] Hioka Y, 2014, 2014 14TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), P85, DOI 10.1109/IWAENC.2014.6953343