Multi-channel Speech Enhancement Based on the MVDR Beamformer and Postfilter

被引:0
|
作者
Wang, Dujuan [1 ]
Bao, Changchun [1 ]
机构
[1] Beijing Univ Technol, Fac Informat Technol, Speech & Audio Signal Proc Lab, Beijing, Peoples R China
来源
2020 IEEE INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATIONS AND COMPUTING (IEEE ICSPCC 2020) | 2020年
基金
中国国家自然科学基金;
关键词
beamforming; speech enhancement; residual neural network; real and imaginary masks; postfilter;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep neural network (DNN) based ideal ratio mask (IRM) estimation methods have yielded good performance in monaural speech enhancement. Meanwhile, these methods have also shown considerable potential for beamforming and multichannel speech enhancement. It is crucial for minimum variance distortionless response (MVDR) beamformer to estimate the covariance matrix of the speech and noise accurately. The accurate estimation of time-frequency (T-F) mask has significant impact on the estimation of the covariance matrices. So, in this paper, a complex real and imaginary ratio mask (CRIRM) based MVDR beamformer for speech enhancement using residual network is proposed. First, the real and imaginary masks of speech and noise are estimated by taking advantage of a residual neural network. After that, the estimations of speech and noise are obtained by using the estimated masks. Finally, the covariance matrices of speech and noise are estimated, and applied into the MVDR beamformer. In addition, in order to further reduce residual noise interference, the output of the MVDR beamformer is further processed by an end-to-end monaural speech enhancement module. Experiments show that, the proposed method can better improve the quality and intelligibility of the enhanced speech.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Factorized MVDR Deep Beamforming for Multi-Channel Speech Enhancement
    Kim, Hansol
    Kang, Kyeongmuk
    Shin, Jong Won
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1898 - 1902
  • [2] Influence of MVDR beamformer on a Speech Enhancement based Smartphone application for Hearing Aids
    Shankar, Nikhil
    Kucuk, Abdullah
    Reddy, Chandan K. A.
    Bhat, Gautam S.
    Panahi, Issa M. S.
    2018 40TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2018, : 417 - 420
  • [3] Integration of a Priori and Estimated Constraints Into an MVDR Beamformer for Speech Enhancement
    Ali, Randall
    Van Waterschoot, Toon
    Moonen, Marc
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2288 - 2300
  • [4] ADAPTATION MODE CONTROL WITH RESIDUAL NOISE ESTIMATION FOR BEAMFORMER-BASED MULTI-CHANNEL SPEECH ENHANCEMENT
    Kim, Seon Man
    Kim, Hong Kook
    Lee, Sung Joo
    Lee, Yun Keun
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 301 - 304
  • [5] Steering vector correction in MVDR beamformer for speech enhancement
    Bu, Suliang
    Zhao, Yunxin
    Zhao, Tuo
    INTERSPEECH 2022, 2022, : 5468 - 5472
  • [6] A MULTI-CHANNEL POSTFILTER BASED ON THE DIFFUSE NOISE SOUND FIELD
    Pfeifenberger, Lukas
    Pernkopf, Franz
    2014 PROCEEDINGS OF THE 22ND EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2014, : 686 - 690
  • [7] MULTI-CHANNEL SPEECH ENHANCEMENT BASED ON INDEPENDENT VECTOR EXTRACTION
    Cmejla, Jaroslav
    Koldovsky, Zbynek
    2018 16TH INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC), 2018, : 525 - 529
  • [8] Multi-channel Speech Enhancement in Driving Environment
    Jin, Weiyun
    Wei, Jie
    Zhong, Xiaofeng
    2017 11TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND COMMUNICATION SYSTEMS (ICSPCS), 2017,
  • [9] An integrated MVDR beamformer for speech enhancement using a local microphone array and external microphones
    Ali, Randall
    van Waterschoot, Toon
    Moonen, Marc
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2021, 2021 (01)
  • [10] Eigenvector-Based Speech Mask Estimation for Multi-Channel Speech Enhancement
    Pfeifenberger, Lukas
    Zoehrer, Matthias
    Pernkopf, Franz
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2019, 27 (12) : 2162 - 2172