Initialization method for speech separation algorithms that work in the time-frequency domain

被引:5
|
作者
Sarmiento, Auxiliadora [1 ]
Duran-Diaz, Ivan [1 ]
Cruces, Sergio [1 ]
机构
[1] Univ Seville, Dept Teoria Senal & Comunicac, Seville 41092, Spain
来源
关键词
acoustic signal processing; blind source separation; independent component analysis; speech processing; time-frequency analysis;
D O I
10.1121/1.3310248
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article addresses the problem of the unsupervised separation of speech signals in realistic scenarios. An initialization procedure is proposed for independent component analysis (ICA) algorithms that work in the time-frequency domain and require the prewhitening of the observations. It is shown that the proposed method drastically reduces the permuted solutions in that domain and helps to reduce the execution time of the algorithms. Simulations confirm these advantages for several ICA instantaneous algorithms and the effectiveness of the proposed technique in emulated reverberant environments.
引用
收藏
页码:EL121 / EL126
页数:6
相关论文
共 50 条
  • [31] Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising
    Williamson, Donald S.
    Wang, DeLiang
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1492 - 1501
  • [32] Modeling speech signals in the time-frequency domain using GARCH
    Cohen, I
    SIGNAL PROCESSING, 2004, 84 (12) : 2453 - 2459
  • [33] An approach to digital watermarking of speech signals in the time-frequency domain
    Stankovic, Srdjan
    Orovic, Irena
    Zaric, Nikola
    Ioana, Cornel
    PROCEEDINGS ELMAR-2006, 2006, : 127 - 130
  • [34] HYBRID TIME-FREQUENCY DOMAIN ARTICULATORY SPEECH SYNTHESIZER.
    Sondhi, Man Mohan
    Schroeter, Juergen
    IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987, ASSP-35 (07): : 955 - 967
  • [35] End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain
    Wang, Kai
    Huang, Hao
    Hu, Ying
    Huang, Zhihua
    Li, Sheng
    INTERSPEECH 2021, 2021, : 3046 - 3050
  • [36] A Time-Frequency Domain Formant Frequency Estimation Scheme for Noisy Speech Signals
    Fattah, S. A.
    Zhu, W-P.
    Ahmad, M. O.
    ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 1201 - 1204
  • [37] Instantaneous polarization attributes in the time-frequency domain and wavefield separation
    Diallo, MS
    Kulesh, M
    Holschneider, M
    Scherbaum, F
    GEOPHYSICAL PROSPECTING, 2005, 53 (05) : 723 - 731
  • [38] A Frequency Domain Method for Speech Separation in a Reverberant Room
    Mischie, Septimiu
    Simion, Georgiana
    2010 9TH INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND TELECOMMUNICATIONS (ISETC), 2010, : 303 - 306
  • [39] Underdetermined blind separation of nondisjoint sources in the time-frequency domain
    Aissa-El-Bey, Abdeldjalil
    Linh-Trung, Nguyen
    Abed-Meraim, Karim
    Belouchrani, Adel
    Grenier, Yves
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (03) : 897 - 907
  • [40] Underdetermined source separation of EEG signals in the time-frequency domain
    Shan, Zeyong
    Swary, Jacob
    Aviyente, Selin
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3637 - 3640