NEURAL NETWORK BASED SPECTRAL MASK ESTIMATION FOR ACOUSTIC BEAMFORMING

被引:0
|
作者
Heymann, Jahn [1 ]
Drude, Lukas [1 ]
Haeb-Umbach, Reinhold [1 ]
机构
[1] Univ Paderborn, Dept Commun Engn, Paderborn, Germany
来源
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS | 2016年
关键词
Robust Speech Recognition; Acoustic Beamforming; Feature Enhancement; Deep Neural Network; SOURCE SEPARATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a neural network based approach to acoustic beamforming. The network is used to estimate spectral masks from which the Cross-Power Spectral Density matrices of speech and noise are estimated, which in turn are used to compute the beamformer coefficients. The network training is independent of the number and the geometric configuration of the microphones. We further show that it is possible to train the network on clean speech only, avoiding the need for stereo data with separated speech and noise. Two types of networks are evaluated. One small feed-forward network with only one hidden layer and one more elaborated bi-directional Long Short-Term Memory network. We compare our system with different parametric approaches to mask estimation and using different beamforming algorithms. We show that our system yields superior results, both in terms of perceptual speech quality and with respect to speech recognition error rate. The results for the simple feed-forward network are especially encouraging considering its low computational requirements.
引用
收藏
页码:196 / 200
页数:5
相关论文
共 50 条
  • [1] ROBUST MASK ESTIMATION BY INTEGRATING NEURAL NETWORK-BASED AND CLUSTERING-BASED APPROACHES FOR ADAPTIVE ACOUSTIC BEAMFORMING
    Zhou, Ying
    Qian, Yanmin
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 536 - 540
  • [2] DNN-BASED MASK ESTIMATION INTEGRATING SPECTRAL AND SPATIAL FEATURES FOR ROBUST BEAMFORMING
    Deng, Chengyun
    Song, Hui
    Zhang, Yi
    Sha, Yongtao
    Li, Xiangang
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4647 - 4651
  • [3] A New Neural Network DOA Estimation Technique Based on Subarray Beamforming
    Caylar, S.
    ICEAA: 2009 INTERNATIONAL CONFERENCE ON ELECTROMAGNETICS IN ADVANCED APPLICATIONS, VOLS 1 AND 2, 2009, : 732 - 734
  • [4] Unsupervised training of neural mask-based beamforming
    Drude, Lukas
    Heymann, Jahn
    Haeb-Umbach, Reinhold
    INTERSPEECH 2019, 2019, : 1253 - 1257
  • [5] Acoustic neuroma classification algorithm based on mask region convolution neural network
    Li, Xiaojun
    Li, Cheng
    Zhou, Rong
    Wei, Lijie
    Wang, Yanping
    JOURNAL OF RADIATION RESEARCH AND APPLIED SCIENCES, 2024, 17 (01)
  • [6] DNN-BASED SPEECH MASK ESTIMATION FOR EIGENVECTOR BEAMFORMING
    Pfeifenberger, Lukas
    Zoehrer, Matthias
    Pernkopf, Franz
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 66 - 70
  • [7] Accelerating MRI Uncertainty Estimation with Mask-based Bayesian Neural Network
    Zhang, Zehuan
    Genci, Matej
    Fan, Hongxiang
    Wetscherek, Andreas
    Luk, Wayne
    2024 IEEE 35TH INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES AND PROCESSORS, ASAP 2024, 2024, : 107 - 115
  • [8] Multi-window spectral estimation based on neural network techniques
    Ma, Wei
    Wu, Mu-Qing
    Li, Guan-Nan
    Zhang, Ning
    Dianbo Kexue Xuebao/Chinese Journal of Radio Science, 2009, 24 (06): : 1154 - 1157
  • [9] A Neural Network based local SNR estimation for estimating spectral masks
    Hadjahmadi, Amir Hossein
    Homayounpour, Mohammad Mehdi
    Ahadi, Seyed Mohammad
    2008 INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS, VOLS 1 AND 2, 2008, : 608 - +
  • [10] Adaptive beamforming technique based on neural network
    Fan, Zehua
    Wilson, Jackson
    Grey, Alex
    ENGINEERING TECHNOLOGY AND APPLICATIONS, 2014, : 213 - 217