Initialization method for speech separation algorithms that work in the time-frequency domain

被引:5
|
作者
Sarmiento, Auxiliadora [1 ]
Duran-Diaz, Ivan [1 ]
Cruces, Sergio [1 ]
机构
[1] Univ Seville, Dept Teoria Senal & Comunicac, Seville 41092, Spain
来源
关键词
acoustic signal processing; blind source separation; independent component analysis; speech processing; time-frequency analysis;
D O I
10.1121/1.3310248
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article addresses the problem of the unsupervised separation of speech signals in realistic scenarios. An initialization procedure is proposed for independent component analysis (ICA) algorithms that work in the time-frequency domain and require the prewhitening of the observations. It is shown that the proposed method drastically reduces the permuted solutions in that domain and helps to reduce the execution time of the algorithms. Simulations confirm these advantages for several ICA instantaneous algorithms and the effectiveness of the proposed technique in emulated reverberant environments.
引用
收藏
页码:EL121 / EL126
页数:6
相关论文
共 50 条
  • [41] A recursive algorithm for blind sources separation in time-frequency domain
    Hu, XB
    Kobatake, H
    PROCEEDINGS OF THE 5TH ASIA-PACIFIC CONFERENCE ON CONTROL & MEASUREMENT, 2002, : 56 - 61
  • [42] A NEW TIME-FREQUENCY APPROACH FOR UNDERDETERMINED CONVOLUTIVE BLIND SPEECH SEPARATION
    Bouafif, Mariem
    Lachiri, Zied
    2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, : 3226 - 3230
  • [43] Blind speech source separation via nonlinear time-frequency masking
    XU Shun CHEN Shaorong LIU Yulin (DSP Lab.
    ChineseJournalofAcoustics, 2008, (03) : 203 - 214
  • [44] On the integration of time-frequency masking speech separation and recognition in underdetermined environments
    Jafari, Ingrid
    Haque, Serajul
    Togneri, Roberto
    Nordholm, Sven
    2012 CONFERENCE RECORD OF THE FORTY SIXTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS (ASILOMAR), 2012, : 1613 - 1617
  • [45] Blind speech source separation via nonlinear time-frequency masking
    Xu, Shun
    Chen, Shaorong
    Liu, Yulin
    Shengxue Xuebao/Acta Acustica, 2007, 32 (04): : 375 - 381
  • [46] Auxiliary Function Based Independent Vector Analysis with Spatial Initialization for Frequency Domain Speech Separation
    Chen, Songbo
    Zhao, Yuxin
    Liang, Yanfeng
    2014 SEVENTH INTERNATIONAL JOINT CONFERENCE ON COMPUTATIONAL SCIENCES AND OPTIMIZATION (CSO), 2014, : 185 - 189
  • [47] Residual Unet with Attention Mechanism for Time-Frequency Domain Speech Enhancement
    Chen, Hanyu
    Peng, Xiwei
    Jiang, Qiqi
    Guo, Yujie
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 7007 - 7011
  • [48] Time-Frequency Domain Impulsive Noise Detection System in Speech Signal
    Choi, Min-Seok
    Shin, Ho Seon
    Hwang, Young-Soo
    Kang, Hong-Goo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2011, 30 (02): : 73 - 79
  • [49] Speech presence detection in the time-frequency domain using minimum statistics
    Sorensen, KV
    Andersen, SV
    NORSIG 2004: PROCEEDINGS OF THE 6TH NORDIC SIGNAL PROCESSING SYMPOSIUM, 2004, 46 : 340 - 343
  • [50] Frequency Gating: Improved Convolutional Neural Networks for Speech Enhancement in the Time-Frequency Domain
    Oostermeijer, Koen
    Wang, Qing
    Du, Jun
    2020 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2020, : 465 - 470