Independent Vector Analysis for Blind Speech Separation Using Complex Generalized Gaussian Mixture Model with Weighted Variance

被引:0
|
作者
Tang, Xinyu [1 ,2 ]
Chen, Rilin [1 ]
Wang, Xiyuan [3 ]
Zhou, Yi [2 ]
Su, Dan [1 ]
机构
[1] Tencent AI Lab, Beijing 100193, Peoples R China
[2] Chongqing Univ Posts & Telecommun, Sch Commun & Informat Engn, Chongqing 400065, Peoples R China
[3] Beijing Informat Sci & Technol Univ, Sch Informat & Commun Engn, Beijing 100101, Peoples R China
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose using complex generalized Gaussian mixture distribution with weighted variance for speech modelling and devise an improved independent vector analysis (IVA) algorithm for blind speech separation (BSS). Capable of capturing both non-Gaussianity and non-stationarity, the proposed complex generalized Gaussian mixture model (CGGMM) allows for a much flexible characterization of practical speech signals. The majorization minimization (MM) framework is adopted for the IVA algorithm design. Each iteration of the algorithm is comprised of the updates of demixing matrices and mixture model parameters. For demixing matrices, the update operates in a manner similar to that of the auxiliary function based IVA (AuxIVA) method, and for mixture parameters, the expectation maximization (EM) update is performed. As both updates are in closed form and pre-whitening is not a prerequisite, the IVA algorithm under CGGMM is of low complexity and can be carried out efficiently. Experimental results show that the proposed algorithm outperforms existing ones in terms of separation accuracy and also enjoys a fast convergence rate in both simulated and real environments.
引用
收藏
页码:720 / 726
页数:7
相关论文
共 50 条
  • [31] Classification of stressed speech using Gaussian mixture model
    Patro, H
    Raja, GS
    Dandapat, S
    INDICON 2005 Proceedings, 2005, : 342 - 346
  • [32] Speech emotion recognition using Gaussian mixture vector autoregressive models
    El Ayadi, Moataz M. H.
    Kamel, Mohamed S.
    Karray, Fakhri
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL IV, PTS 1-3, 2007, : 957 - +
  • [33] An Expectation-Maximization Algorithm for Blind Separation of Noisy Mixtures Using Gaussian Mixture Model
    Fanglin Gu
    Hang Zhang
    Wenwu Wang
    Shan Wang
    Circuits, Systems, and Signal Processing, 2017, 36 : 2697 - 2726
  • [34] An Expectation-Maximization Algorithm for Blind Separation of Noisy Mixtures Using Gaussian Mixture Model
    Gu, Fanglin
    Zhang, Hang
    Wang, Wenwu
    Wang, Shan
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2017, 36 (07) : 2697 - 2726
  • [35] Mixing Matrix Estimation in Blind Source Separation Based on Generalized Gaussian Mixture Modal
    Chen, Yongqiang
    Liu, Jun
    SMART TECHNOLOGIES FOR COMMUNICATION, 2012, 4 : 217 - 221
  • [36] A Gaussian mixture model for underdetermined independent component analysis
    Zhang, Yingyu
    Shi, Xizhi
    Chen, Chi Hau
    SIGNAL PROCESSING, 2006, 86 (07) : 1538 - 1549
  • [37] Blind speech separation using a joint model of speech production
    Smith, D
    Lukasiak, J
    Burnett, I
    IEEE SIGNAL PROCESSING LETTERS, 2005, 12 (11) : 784 - 787
  • [38] Blind speech separation employing Laplacian normal mixture distribution model
    Cai, Hua
    Sun, Junxi
    Ou, Shifeng
    2007 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, VOLS I-V, CONFERENCE PROCEEDINGS, 2007, : 3185 - +
  • [39] Independent vector analysis for convolutive blind noncircular source separation
    Zhang, Hefa
    Li, Liping
    Li, Wanchun
    SIGNAL PROCESSING, 2012, 92 (09) : 2275 - 2283
  • [40] Mixture texture model with weighted generalized inverse Gaussian distribution for target detection
    Chen, Xiaolin
    Liu, Kai
    Zhang, Zhibo
    Deng, Hui
    DIGITAL SIGNAL PROCESSING, 2024, 154