A Hybrid Reverberation Model and Its Application to Joint Speech Dereverberation and Separation

被引:0
|
作者
Liu, Tongzheng [1 ]
Lu, Zhihua [1 ]
da Costa, Joao Paulo J. [2 ]
Fei, Tai [3 ]
机构
[1] Ningbo Univ, Coll Informat Sci & Engn, Ningbo 315211, Peoples R China
[2] Hamm Lippstadt Univ Appl Sci HSHL, Dept Lippstadt 2, D-59063 Hamm, Germany
[3] HELLA GmbH & Co KGaA, D-59552 Lippstadt, Germany
基金
中国国家自然科学基金;
关键词
Reverberation model; dereverberation; speech separation; blind source separation; multichannel nonnegative matrix factorization; microphone array; BLIND SOURCE SEPARATION; NONNEGATIVE MATRIX FACTORIZATION; INDEPENDENT VECTOR EXTRACTION; NOISE-REDUCTION; ALGORITHMS; CANCELLATION; ENHANCEMENT; MIXTURES;
D O I
10.1109/TASLP.2023.3301227
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article proposes a hybrid reverberation model by integrating two conventional models, namely, the multichannel linear prediction (MCLP) model and the spatial coherence model. The late reverberation is divided into two components. One component is modeled using an MCLP model, and the other is modeled using the spatial coherence model. In contrast with the conventional models, the proposed hybrid model increases modeling capacity, especially in the case of long reverberation time. In order to optimally estimate model parameters, joint speech dereverberation and separation is taken into account. The hybrid reverberation model is then used in conjunction with the multichannel nonnegative matrix factorization (MNMF). The method called Hybrid-FastMNMF is proposed by treating the reverberation component modeled by the spatial coherence model as a noise source and estimating its parameters similarly to speech sources. Furthermore, prior knowledge of the spatial coherence matrix is employed to whiten the observations, resulting in another method called Hybrid-FastMNMF-W. Experimental findings demonstrate the proposed methods' superior performance in terms of joint speech dereverberation and separation, and they further justify the efficiency of the proposed hybrid reverberation model.
引用
收藏
页码:3000 / 3014
页数:15
相关论文
共 50 条
  • [21] JOINT TRAINING OF DEEP NEURAL NETWORKS FOR MULTI-CHANNEL DEREVERBERATION AND SPEECH SOURCE SEPARATION
    Togami, Masahito
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3032 - 3036
  • [22] Multichannel speech enhancement algorithm based on hybrid reverberation model
    Xie, Yuan
    Zou, Tao
    Sun, Weijun
    Xie, Shengli
    Tongxin Xuebao/Journal on Communications, 2024, 45 (11): : 15 - 26
  • [23] Over-determined Speech Source Separation and Dereverberation
    Togami, Masahito
    Scheibler, Robin
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 705 - 710
  • [24] A Bayesian Hierarchical Model for Speech Dereverberation
    Laufer, Yaron
    Gannot, Sharon
    2018 IEEE INTERNATIONAL CONFERENCE ON THE SCIENCE OF ELECTRICAL ENGINEERING IN ISRAEL (ICSEE), 2018,
  • [25] Blind Speech Separation and Dereverberation using neural beamforming
    Pfeifenberger, Lukas
    Pernkopf, Franz
    SPEECH COMMUNICATION, 2022, 140 : 29 - 41
  • [26] Blind speech separation using a joint model of speech production
    Smith, D
    Lukasiak, J
    Burnett, I
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (11) : 784 - 787
  • [27] A Reverberation-Time-Aware Approach to Speech Dereverberation Based on Deep Neural Networks
    Wu, Bo
    Li, Kehuang
    Yang, Minglei
    Lee, Chin-Hui
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (01) : 102 - 111
  • [28] JOINT AUDIO SOURCE SEPARATION AND DEREVERBERATION BASED ON MULTICHANNEL FACTORIAL HIDDEN MARKOV MODEL
    Higuchi, Takuya
    Kameoka, Hirokazu
    2014 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP), 2014,
  • [29] A blind separation algorithm for speech in strong reverberation
    Li, Huxiong
    Gu, Fan
    Journal of Computational Information Systems, 2010, 6 (11): : 3613 - 3622
  • [30] Defeating reverberation: Advanced dereverberation and recognition techniques for hands-free speech recognition
    Delcroix, Marc
    Yoshioka, Takuya
    Ogawa, Atsunori
    Kubo, Yotaro
    Fujimoto, Masakiyo
    Ito, Nobutaka
    Kinoshita, Keisuke
    Espi, Miquel
    Araki, Shoko
    Hori, Takaaki
    Nakatani, Tomohiro
    2014 IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (GLOBALSIP), 2014, : 522 - 526