MULTI-CHANNEL SPEECH ENHANCEMENT USING BEAMFORMING AND NULLFORMING FOR SEVERELY ADVERSE DRONE ENVIRONMENT

被引：0

作者：

Kim, Seokhyun ^{[1
]}

Jeong, Won ^{[2
]}

Park, Hyung-Min ^{[1
,2
]}

机构：

[1] Sogang Univ, Dept Elect Engn, Seoul, South Korea

[2] Sogang Univ, Dept Artificial Intelligence, Seoul, South Korea

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024 | 2024年

关键词：

Multi-channel Speech Enhancement; Drone Audition Environments; Neural Beamforming;

D O I：

10.1109/ICASSPW62465.2024.10626577

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we present an end-to-end neural beamforming method for multi-channel speech enhancement in drone environments with severe noise levels. In flying drone environment, recording with microphones attached to the drone creates a situation where the proximity and intensity of propeller and motor noise result in a low signal-to-noise ratio (SNR) compared to the target speech. EaBNet is a Deep Neural Network (DNN) based beamforming that utilizes embedding and beamforming modules to address the inherent un-interpretability in previous end-to-end beamforming models. EaBNet utilizes spatial information for speech enhancement and seeks additional improvement through PostNet. Building upon EaBNet, we incorporate a module inspired by the structure of the Generalized Sidelobe Canceller (GSC) algorithm to estimate the spatial information of ego-noise through nullforming. Additionally, we suggest estimating the spectral features of nullforming estimation outputs in addition to the beamforming output and for the input of the PostNet to achieve higher performance. As a result, it was confirmed that performance improved through these two methods.

引用

页码：755 / 759

页数：5

共 32 条

[31] A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-channel Speech Recognition in the CHiME-6 Challenge
Tu, Yan-Hui
Du, Jun
Sun, Lei
Ma, Feng
Pan, Jia
Lee, Chin-Hui
INTERSPEECH 2020, 2020, : 96 - 100
[32] MULTI-CHANNEL SPEAKER LOCALIZATION AND SEPARATION USING A MODEL-BASED GSC AND AN INERTIAL MEASUREMENT UNIT
Zohourian, Mehdi
Archer-Boyd, Alan
Martin, Rainer
2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 5615 - 5619

← 1 2 3 4 →