Inference-Adaptive Steering of Neural Networks for Real-Time Area-Based Sound Source Separation

被引:0
|
作者
Strauss, Martin [1 ]
Mack, Wolfgang [2 ]
Valero, Maria Luis [3 ]
Koepueklue, Okan [3 ]
机构
[1] Joint Inst Friedrich Alexander Univ Erlangen Nurnb, Int Audio Labs Erlangen, D-91058 Erlangen, Germany
[2] Friedrich Alexander Univ Erlangen Nurnberg, D-91058 Erlangen, Germany
[3] Microsoft Appl Sci Grp, D-80807 Munich, Germany
关键词
Microphone arrays; Source separation; Noise; Training; Real-time systems; Indexes; Mathematical models; Electronic mail; Background noise; Artificial neural networks; Neural steering; real-time DNNs; source separation;
D O I
10.1109/LSP.2025.3543454
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a novel adaptive steering technique that changes the target area of a spatial-aware multi-microphone sound source separation algorithm during inference without the necessity of retraining the deep neural network (DNN). To achieve this, we first train a DNN aiming to retain speech within a target region, defined by an angular span, while suppressing sound sources stemming from other directions. Afterward, a phase shift is applied to the microphone signals, allowing us to shift the center of the target area during inference at negligible additional cost in computational complexity. Further, we show that the proposed approach performs well in a wide variety of acoustic scenarios, including several speakers inside and outside the target area and additional noise. More precisely, the proposed approach performs on par with DNNs trained explicitly for the steered target area in terms of DNSMOS and SI-SDR.
引用
收藏
页码:1041 / 1045
页数:5
相关论文
共 50 条
  • [21] Scalable real-time sound source localization method based on TDOA
    Heydari, Zahra
    Mahabadi, Aminollah
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (15) : 23333 - 23372
  • [22] Real-time sound source localization based on audiovisual frequency integration
    Tsuji, Tokuo
    Yamamoto, Kenkichi
    Ishii, Idaku
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 4, PROCEEDINGS, 2006, : 322 - +
  • [23] Design of a Real-Time Automatic Source Monitoring Framework Based on Sound Source Localization
    Dey, Spandan
    Boppu, Srinivas
    Manikandan, M. Sabarimalai
    2019 SEVENTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS (ICDIPC 2019), 2019, : 35 - 40
  • [24] Real-time area-based haptic rendering and the augmented tactile display device for a palpation simulator
    Kim, Sang-Youn
    Kyung, Ki-Uk
    Park, Jinah
    Kwon, Dong-Soo
    ADVANCED ROBOTICS, 2007, 21 (09) : 961 - 981
  • [25] Real-time convolutive blind source separation based on a broadband approach
    Aichner, R
    Buchner, H
    Fei, Y
    Kellermann, W
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 840 - 848
  • [26] Real-time steering of curved sound beams in a feedback-based topological acoustic metamaterial
    Sirota, Lea
    Sabsovich, Daniel
    Lahini, Yoav
    Ilan, Roni
    Shokef, Yair
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2021, 153
  • [27] New approach to real-time adaptive learning control of neural networks based on an evolutionary algorithm (I)
    Chang, SO
    Lee, JK
    ISIE 2001: IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS PROCEEDINGS, VOLS I-III, 2001, : 1871 - 1876
  • [28] New approach to real-time adaptive learning control of neural networks based on an evolutionary algorithm (II)
    Chang, SO
    Lee, JK
    ISIE 2001: IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS PROCEEDINGS, VOLS I-III, 2001, : 1877 - 1880
  • [29] Real-time TDOA-based stationary sound source direction finding
    Zahra Heydari
    Aminollah Mahabadi
    Multimedia Tools and Applications, 2023, 82 : 39929 - 39960
  • [30] Real-time TDOA-based stationary sound source direction finding
    Heydari, Zahra
    Mahabadi, Aminollah
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (26) : 39929 - 39960