MULTI-CHANNEL SPEECH ENHANCEMENT USING BEAMFORMING AND NULLFORMING FOR SEVERELY ADVERSE DRONE ENVIRONMENT

被引:0
作者
Kim, Seokhyun [1 ]
Jeong, Won [2 ]
Park, Hyung-Min [1 ,2 ]
机构
[1] Sogang Univ, Dept Elect Engn, Seoul, South Korea
[2] Sogang Univ, Dept Artificial Intelligence, Seoul, South Korea
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024 | 2024年
关键词
Multi-channel Speech Enhancement; Drone Audition Environments; Neural Beamforming;
D O I
10.1109/ICASSPW62465.2024.10626577
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we present an end-to-end neural beamforming method for multi-channel speech enhancement in drone environments with severe noise levels. In flying drone environment, recording with microphones attached to the drone creates a situation where the proximity and intensity of propeller and motor noise result in a low signal-to-noise ratio (SNR) compared to the target speech. EaBNet is a Deep Neural Network (DNN) based beamforming that utilizes embedding and beamforming modules to address the inherent un-interpretability in previous end-to-end beamforming models. EaBNet utilizes spatial information for speech enhancement and seeks additional improvement through PostNet. Building upon EaBNet, we incorporate a module inspired by the structure of the Generalized Sidelobe Canceller (GSC) algorithm to estimate the spatial information of ego-noise through nullforming. Additionally, we suggest estimating the spectral features of nullforming estimation outputs in addition to the beamforming output and for the input of the PostNet to achieve higher performance. As a result, it was confirmed that performance improved through these two methods.
引用
收藏
页码:755 / 759
页数:5
相关论文
共 32 条
  • [31] A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-channel Speech Recognition in the CHiME-6 Challenge
    Tu, Yan-Hui
    Du, Jun
    Sun, Lei
    Ma, Feng
    Pan, Jia
    Lee, Chin-Hui
    INTERSPEECH 2020, 2020, : 96 - 100
  • [32] MULTI-CHANNEL SPEAKER LOCALIZATION AND SEPARATION USING A MODEL-BASED GSC AND AN INERTIAL MEASUREMENT UNIT
    Zohourian, Mehdi
    Archer-Boyd, Alan
    Martin, Rainer
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5615 - 5619