Inference-Adaptive Steering of Neural Networks for Real-Time Area-Based Sound Source Separation

被引:0
|
作者
Strauss, Martin [1 ]
Mack, Wolfgang [2 ]
Valero, Maria Luis [3 ]
Koepueklue, Okan [3 ]
机构
[1] Joint Inst Friedrich Alexander Univ Erlangen Nurnb, Int Audio Labs Erlangen, D-91058 Erlangen, Germany
[2] Friedrich Alexander Univ Erlangen Nurnberg, D-91058 Erlangen, Germany
[3] Microsoft Appl Sci Grp, D-80807 Munich, Germany
关键词
Microphone arrays; Source separation; Noise; Training; Real-time systems; Indexes; Mathematical models; Electronic mail; Background noise; Artificial neural networks; Neural steering; real-time DNNs; source separation;
D O I
10.1109/LSP.2025.3543454
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
We propose a novel adaptive steering technique that changes the target area of a spatial-aware multi-microphone sound source separation algorithm during inference without the necessity of retraining the deep neural network (DNN). To achieve this, we first train a DNN aiming to retain speech within a target region, defined by an angular span, while suppressing sound sources stemming from other directions. Afterward, a phase shift is applied to the microphone signals, allowing us to shift the center of the target area during inference at negligible additional cost in computational complexity. Further, we show that the proposed approach performs well in a wide variety of acoustic scenarios, including several speakers inside and outside the target area and additional noise. More precisely, the proposed approach performs on par with DNNs trained explicitly for the steered target area in terms of DNSMOS and SI-SDR.
引用
收藏
页码:1041 / 1045
页数:5
相关论文
共 50 条
  • [1] Real-time source separation based on sound localization in a reverberant environment
    Aoki, M
    Furuya, K
    NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, 2002, : 475 - 484
  • [2] Real-time area-based haptic rendering for a palpation simulator
    Kyung, Ki-Uk
    Park, Jinah
    Kwon, Dong-Soo
    Kim, Sang-Youn
    BIOMEDICAL SIMULATION, PROCEEDINGS, 2006, 4072 : 132 - 141
  • [3] Comparing Optimization Methods of Neural Networks for Real-time Inference
    Khan, Mir
    Lunnikivi, Henri
    Huttunen, Heikki
    Boutellier, Jani
    2019 27TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2019,
  • [4] Real-time Multi Source Speech Enhancement based on Sound Source Separation using Microphone Array
    Jeyasingh, P.
    Ismail, M. Mohamed
    2018 CONFERENCE ON EMERGING DEVICES AND SMART SYSTEMS (ICEDSS), 2018, : 183 - 187
  • [5] Real-Time Microphone Array Processing for Sound Source Separation and Localization
    Sun, Longji
    Cheng, Qi
    2013 47TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2013,
  • [6] Expected Area-Based Real-Time Routing Protocol for Supporting Mobile Sinks in Wireless Sensor Networks
    Nam, Youngju
    Choi, Hyunseok
    Shin, Yongje
    Park, Soochang
    Lee, Euisin
    ELECTRONICS, 2022, 11 (20)
  • [7] Real-time sound source localization and separation based on active audio-visual integration
    Okuno, HG
    Nakadai, K
    COMPUTATIONAL METHODS IN NEURAL MODELING, PT 1, 2003, 2686 : 118 - 125
  • [8] Real-Time Inference of Neural Networks on FPGAs for Motor Control Applications
    Schindler, Tobias
    Dietz, Armin
    2020 10TH INTERNATIONAL ELECTRIC DRIVES PRODUCTION CONFERENCE (EDPC), 2020, : 318 - 323
  • [9] Adaptive real-time road detection using neural networks
    Foedisch, M
    Takeuchi, A
    ITSC 2004: 7TH INTERNATIONAL IEEE CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, PROCEEDINGS, 2004, : 167 - 172
  • [10] Mixed-signal real-time adaptive blind source separation
    Celik, A
    Stanacevic, M
    Cawenberghs, G
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 5, PROCEEDINGS, 2004, : 760 - 763