Independent Vector Analysis for Source Separation Using a Mixture of Gaussians Prior

被引:24
|
作者
Hao, Jiucang [1 ]
Lee, Intae [2 ]
Lee, Te-Won [3 ]
Sejnowski, Terrence J. [4 ,5 ]
机构
[1] Salk Inst Biol Studies, Computat Neurobiol Lab, La Jolla, CA 92037 USA
[2] Univ Calif San Diego, Inst Neural Computat, La Jolla, CA 92093 USA
[3] Qualcomm, San Diego, CA 92121 USA
[4] Salk Inst Biol Studies, Howard Hughes Med Inst, La Jolla, CA 92037 USA
[5] Univ Calif San Diego, Div Biol Sci, La Jolla, CA 92093 USA
关键词
COMPONENT ANALYSIS; BLIND SEPARATION; ALGORITHMS;
D O I
10.1162/neco.2010.11-08-906
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Convolutive mixtures of signals, which are common in acoustic environments, can be difficult to separate into their component sources. Here we present a uniform probabilistic framework to separate convolutive mixtures of acoustic signals using independent vector analysis (IVA), which is based on a joint distribution for the frequency components originating from the same source and is capable of preventing permutation disorder. Different gaussian mixture models (GMM) served as source priors, in contrast to the original IVA model, where all sources were modeled by identical multivariate Laplacian distributions. This flexible source prior enabled the IVA model to separate different type of signals. Three classes of models were derived and tested: noiseless IVA, online IVA, and noisy IVA. In the IVA model without sensor noise, the unmixing matrices were efficiently estimated by the expectation maximization (EM) algorithm. An online EM algorithm was derived for the online IVA algorithm to track the movement of the sources and separate them under nonstationary conditions. The noisy IVA model included the sensor noise and combined denoising with separation. An EM algorithm was developed that found the model parameters and separated the sources simultaneously. These algorithms were applied to separate mixtures of speech and music. Performance as measured by the signal-to-interference ratio (SIR) was substantial for all three models.
引用
收藏
页码:1646 / 1673
页数:28
相关论文
共 50 条
  • [1] Speech Separation Using Independent Vector Analysis with an Amplitude Variable Gaussian Mixture Model
    Gu, Zhaoyi
    Lu, Jing
    Chen, Kai
    INTERSPEECH 2019, 2019, : 1358 - 1362
  • [2] Flow-Based Independent Vector Analysis for Blind Source Separation
    Nugraha, Aditya Arie
    Sekiguchi, Kouhei
    Fontaine, Mathieu
    Bando, Yoshiaki
    Yoshii, Kazuyoshi
    IEEE SIGNAL PROCESSING LETTERS, 2020, 27 (27) : 2173 - 2177
  • [3] PMOG: The projected mixture of Gaussians model with application to blind source separation
    Pendse, Gautam V.
    NEURAL NETWORKS, 2012, 28 : 40 - 60
  • [4] Independent vector analysis for convolutive blind noncircular source separation
    Zhang, Hefa
    Li, Liping
    Li, Wanchun
    SIGNAL PROCESSING, 2012, 92 (09) : 2275 - 2283
  • [5] A Survey of Optimization Methods for Independent Vector Analysis in Audio Source Separation
    Guo, Ruiming
    Luo, Zhongqiang
    Li, Mingchun
    SENSORS, 2023, 23 (01)
  • [6] INDEPENDENT VECTOR ANALYSIS ASSISTED ADAPTIVE BEAMFOMRING FOR SPEECH SOURCE SEPARATION WITH AN ACOUSTIC VECTOR SENSOR
    Yang, Yichen
    Wang, Xianrui
    Zhang, Wen
    Chen, Jingdong
    2022 INTERNATIONAL WORKSHOP ON ACOUSTIC SIGNAL ENHANCEMENT (IWAENC 2022), 2022,
  • [7] Real-Time Independent Vector Analysis for Convolutive Blind Source Separation
    Kim, Taesu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2010, 57 (07) : 1431 - 1438
  • [8] A semiparametric approach to source separation using independent component analysis
    Eloyan, Ani
    Ghosh, Sujit K.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 58 : 383 - 396
  • [9] An Explicit Connection Between Independent Vector Analysis and Tensor Decomposition in Blind Source Separation
    Ruan, Haoxin
    Lei, Tong
    Chen, Kai
    Lu, Jing
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 1277 - 1281
  • [10] Stability of independent vector analysis
    Itahashi, Takashi
    Matsuoka, Kiyotoshi
    SIGNAL PROCESSING, 2012, 92 (08) : 1809 - 1820