DNN-BASED DISTRIBUTED MULTICHANNEL MASK ESTIMATION FOR SPEECH ENHANCEMENT IN MICROPHONE ARRAYS

被引:0
作者
Furnon, Nicolas [1 ]
Serizel, Romain [1 ]
Illina, Irina [1 ]
Essid, Slim [2 ]
机构
[1] Univ Lorraine, CNRS, INRIA, Loria, F-54000 Nancy, France
[2] Inst Polytech Paris, Telecom Paris, LTCI, Palaiseau, France
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
Speech enhancement; microphone arrays; distributed processing; WIENER FILTER;
D O I
10.1109/icassp40776.2020.9054643
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Multichannel processing is widely used for speech enhancement but several limitations appear when trying to deploy these solutions in the real world. Distributed sensor arrays that consider several devices with a few microphones is a viable solution which allows for exploiting the multiple devices equipped with microphones that we are using in our everyday life. In this context, we propose to extend the distributed adaptive node-specific signal estimation approach to a neural network framework. At each node, a local filtering is performed to send one signal to the other nodes where a mask is estimated by a neural network in order to compute a global multichannel Wiener filter. In an array of two nodes, we show that this additional signal can be leveraged to predict the masks and leads to better speech enhancement performance than when the mask estimation relies only on the local signals.
引用
收藏
页码:4672 / 4676
页数:5
相关论文
共 27 条
  • [1] Adavanne S, 2018, EUR SIGNAL PR CONF, P1462, DOI 10.23919/EUSIPCO.2018.8553182
  • [2] [Anonymous], 2012, ADV IND
  • [3] Barker J, 2015, 2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), P504, DOI 10.1109/ASRU.2015.7404837
  • [4] Bertrand A., 2010, P IWAENC
  • [5] Special issue on wireless acoustic sensor networks and ad hoc microphone arrays
    Bertrand, Alexander
    Doclo, Simon
    Gannot, Sharon
    Ono, Nobutaka
    van Waterschoot, Toon
    [J]. SIGNAL PROCESSING, 2015, 107 : 1 - 3
  • [6] Distributed Adaptive Node-Specific Signal Estimation in Fully Connected Sensor Networks-Part I: Sequential Node Updating
    Bertrand, Alexander
    Moonen, Marc
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2010, 58 (10) : 5277 - 5291
  • [7] Chakrabarty S., 2017, IEEE J SEL TOP QUANT, V13, P1
  • [8] GSVD-based optimal filtering for single and multimicrophone speech enhancement
    Doclo, S
    Moonen, M
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2002, 50 (09) : 2230 - 2244
  • [9] Frequency-domain criterion for the speech distortion weighted multichannel Wiener filter for robust noise reduction
    Doclo, Simon
    Spriet, Ann
    Wouters, Jan
    Moonen, Marc
    [J]. SPEECH COMMUNICATION, 2007, 49 (7-8) : 636 - 656
  • [10] ALGORITHM FOR LINEARLY CONSTRAINED ADAPTIVE ARRAY PROCESSING
    FROST, OL
    [J]. PROCEEDINGS OF THE INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS, 1972, 60 (08): : 926 - &