Deficient-basis-complementary rank-constrained spatial covariance matrix estimation based on multivariate generalized Gaussian distribution for blind speech extraction

被引:3
|
作者
Kondo, Yuto [1 ]
Kubo, Yuki [1 ]
Takamune, Norihiro [1 ]
Kitamura, Daichi [2 ]
Saruwatari, Hiroshi [1 ]
机构
[1] Univ Tokyo, Bunkyo Ku, 7-3-1 Hongo, Tokyo 1138654, Japan
[2] Kagawa Coll, Natl Inst Technol, 355 Chokushi, Takamatsu, Kagawa 7618058, Japan
关键词
Blind speech extraction; Blind source separation; Diffuse noise; Spatial covariance matrix; Rank-constrained spatial covariance matrix estimation; SOURCE SEPARATION; CONVOLUTIVE MIXTURES; FACTORIZATION; ENHANCEMENT; ALGORITHMS; ICA;
D O I
10.1186/s13634-022-00905-z
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Rank-constrained spatial covariance matrix estimation (RCSCME) is a blind speech extraction method utilized under the condition that one-directional target speech and diffuse background noise are mixed. In this paper, we propose a new model extension of RCSCME. RCSCME simultaneously conducts both the deficient rank-1 component complementation of the diffuse noise spatial covariance matrix, which is incompletely estimated by preprocessing methods such as independent low-rank matrix analysis, and the estimation of the source model parameters. In the conventional RCSCME, between the two parameters constituting the deficient rank-1 component, only the scale is estimated, whereas the other parameter, the deficient basis, is fixed in advance; however, how to choose the fixed deficient basis is not unique. In the proposed RCSCME model, we also regard the deficient basis as a parameter to estimate. As the generative model of an observed signal, we utilized the super-Gaussian generalized Gaussian distribution, which achieves better separation performance than the Gaussian distribution in the conventional RCSCME. Assuming the model, we derive new majorization-minimization (MM)- and majorization-equalization (ME)-algorithm-based update rules for the deficient basis. In particular, among innumerable ME-algorithm-based update rules, we successfully find an ME-algorithm-based update rule with a mathematical proof supporting the fact that the step of the update rule is larger than that of the MM-algorithm-based update rule. We confirm that the proposed method outperforms conventional methods under several simulated noise conditions and a real noise condition.
引用
收藏
页数:24
相关论文
共 8 条
  • [1] Deficient-basis-complementary rank-constrained spatial covariance matrix estimation based on multivariate generalized Gaussian distribution for blind speech extraction
    Yuto Kondo
    Yuki Kubo
    Norihiro Takamune
    Daichi Kitamura
    Hiroshi Saruwatari
    EURASIP Journal on Advances in Signal Processing, 2022
  • [2] Blind Speech Extraction Based on Rank-Constrained Spatial Covariance Matrix Estimation With Multivariate Generalized Gaussian Distribution
    Kubo, Yuki
    Takamune, Norihiro
    Kitamura, Daichi
    Saruwatari, Hiroshi
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 28 : 1948 - 1963
  • [3] DEFICIENT BASIS ESTIMATION OF NOISE SPATIAL COVARIANCE MATRIX FOR RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION METHOD IN BLIND SPEECH EXTRACTION
    Kondo, Yuto
    Kubo, Yuki
    Takamune, Norihiro
    Kitamura, Daichi
    Saruwatari, Hiroshi
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 806 - 810
  • [4] Acceleration of rank-constrained spatial covariance matrix estimation for blind speech extraction
    Kubo, Yuki
    Takamune, Norihiro
    Kitamura, Daichi
    Saruwatari, Hiroshi
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 332 - 338
  • [5] REAL-TIME SPEECH EXTRACTION USING SPATIALLY REGULARIZED INDEPENDENT LOW-RANK MATRIX ANALYSIS AND RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION
    Ishikawa, Yuto
    Konaka, Kohei
    Nakamura, Tomohiko
    Takamune, Norihiro
    Saruwatari, Hiroshi
    2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING WORKSHOPS, ICASSPW 2024, 2024, : 730 - 734
  • [6] MIMO filters based on robust rank-constrained Kronecker covariance matrix estimation
    Breloy, Arnaud
    Ginolhac, Guillaume
    Gao, Yongchan
    Pascal, Frederic
    SIGNAL PROCESSING, 2021, 187
  • [7] Evaluation of Multichannel Hearing Aid System by Rank-Constrained Spatial Covariance Matrix Estimation
    Une, Masakazu
    Kubo, Yuki
    Takamune, Norihiro
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Makino, Shoji
    2019 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2019, : 1874 - 1879
  • [8] Speech Enhancement by Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation via Independent Deeply Learned Matrix Analysis
    Misawa, Sota
    Takamune, Norihiro
    Nakamura, Tomohiko
    Kitamura, Daichi
    Saruwatari, Hiroshi
    Une, Masakazu
    Makino, Shoji
    2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 578 - 584