Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter

被引:0
|
作者
Wang, Dujuan [1 ]
Bao, Changchun [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020) | 2020年
基金
中国国家自然科学基金;
关键词
beamforming; speech enhancement; residual neural network; real and imaginary masks; postfilter;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural network (DNN) based ideal ratio mask (IRM) estimation methods have yielded good performance in monaural speech enhancement. Meanwhile, these methods have also shown considerable potential for beamforming and multichannel speech enhancement. It is crucial for minimum variance distortionless response (MVDR) beamformer to estimate the covariance matrix of the speech and noise accurately. The accurate estimation of time-frequency (T-F) mask has significant impact on the estimation of the covariance matrices. So, in this paper, a complex real and imaginary ratio mask (CRIRM) based MVDR beamformer for speech enhancement using residual network is proposed. First, the real and imaginary masks of speech and noise are estimated by taking advantage of a residual neural network. After that, the estimations of speech and noise are obtained by using the estimated masks. Finally, the covariance matrices of speech and noise are estimated, and applied into the MVDR beamformer. In addition, in order to further reduce residual noise interference, the output of the MVDR beamformer is further processed by an end-to-end monaural speech enhancement module. Experiments show that, the proposed method can better improve the quality and intelligibility of the enhanced speech.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] ENHANCEMENT OF REVERBERANT SPEECH USING THE CELP POSTFILTER
    Jeub, Marco
    Vary, Peter
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 3993 - 3996
  • [42] Real-time Multi-channel Speech Enhancement Based on Neural Network Masking with Attention Model
    Xue, Cheng
    Huang, Weilong
    Chen, Weiguang
    Feng, Jinwei
    INTERSPEECH 2021, 2021, : 1862 - 1866
  • [43] UTILIZING HEAD ROTATION DATA IN DNN-BASED MULTI-CHANNEL SPEECH ENHANCEMENT FOR HEARING AIDS
    Lentz, Benjamin
    Martin, Rainer
    2024 18TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT, IWAENC 2024, 2024, : 165 - 169
  • [44] Speech Enhancement Algorithm Based on Microphone Array and Multi-Channel Parallel GRU-CNN Network
    Xi, Ji
    Xu, Zhe
    Zhang, Weiqi
    Xie, Yue
    Zhao, Li
    ELECTRONICS, 2025, 14 (04):
  • [45] TENSOR-TO-VECTOR REGRESSION FOR MULTI-CHANNEL SPEECH ENHANCEMENT BASED ON TENSOR-TRAIN NETWORK
    Qi, Jun
    Hu, Hu
    Wang, Yannan
    Yang, Chao-Han Huck
    Siniscalchi, Sabato Marco
    Lee, Chin-Hui
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 7504 - 7508
  • [46] MASS: Microphone Array Speech Simulator in Room Acoustic Environment for Multi-Channel Speech Coding and Enhancement
    Cheng, Rui
    Bao, Changchun
    Cui, Zihao
    APPLIED SCIENCES-BASEL, 2020, 10 (04):
  • [47] DIFFUSION-BASED DISTRIBUTED MVDR BEAMFORMER
    O'Connor, Matt
    Kleijn, W. Bastiaan
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [48] A delay-based constrained beamformer for blind speech enhancement and dereverberation
    Yermeche, Zohra
    Grbic, Nedelko
    PROCEEDINGS ELMAR 2007, 2007, : 159 - 162
  • [49] Insights Into Deep Non-Linear Filters for Improved Multi-Channel Speech Enhancement
    Tesch, Kristina
    Gerkmann, Timo
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 31 : 563 - 575
  • [50] Multi-channel speech enhancement using early and late fusion convolutional neural networks
    S. Siva Priyanka
    T. Kishore Kumar
    Signal, Image and Video Processing, 2023, 17 : 973 - 979