Initialization method for speech separation algorithms that work in the time-frequency domain

被引:5
|
作者
Sarmiento, Auxiliadora [1 ]
Duran-Diaz, Ivan [1 ]
Cruces, Sergio [1 ]
机构
[1] Univ Seville, Dept Teoria Senal & Comunicac, Seville 41092, Spain
来源
关键词
acoustic signal processing; blind source separation; independent component analysis; speech processing; time-frequency analysis;
D O I
10.1121/1.3310248
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article addresses the problem of the unsupervised separation of speech signals in realistic scenarios. An initialization procedure is proposed for independent component analysis (ICA) algorithms that work in the time-frequency domain and require the prewhitening of the observations. It is shown that the proposed method drastically reduces the permuted solutions in that domain and helps to reduce the execution time of the algorithms. Simulations confirm these advantages for several ICA instantaneous algorithms and the effectiveness of the proposed technique in emulated reverberant environments.
引用
收藏
页码:EL121 / EL126
页数:6
相关论文
共 50 条
  • [1] Improvement of the Initialization of ICA Time-Frequency Algorithms for Speech Separation
    Sarmiento, Auxiliadora
    Cruces, Sergio
    Duran, Ivan
    INDEPENDENT COMPONENT ANALYSIS AND SIGNAL SEPARATION, PROCEEDINGS, 2009, 5441 : 629 - 636
  • [2] Coarse-to-fine speech separation method in the time-frequency domain
    Yang, Xue
    Bao, Changchun
    Chen, Xianhong
    SPEECH COMMUNICATION, 2023, 155
  • [3] A New Initialization Method for Frequency-Domain Blind Source Separation Algorithms
    Clark, Felipe S. P.
    Petraglia, Mariane R.
    Haddad, Diego B.
    IEEE SIGNAL PROCESSING LETTERS, 2011, 18 (06) : 343 - 346
  • [4] TFPSNET: TIME-FREQUENCY DOMAIN PATH SCANNING NETWORK FOR SPEECH SEPARATION
    Yang, Lei
    Liu, Wei
    Wang, Weiqin
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 6842 - 6846
  • [5] A Method of Sound Segmentation in Time-Frequency Domain Using Peaks and Valleys in Spectrogram for Speech Separation
    Lim, Sung-Kil
    Lee, Hyon-Soo
    JOURNAL OF THE ACOUSTICAL SOCIETY OF KOREA, 2008, 27 (08): : 418 - 426
  • [6] A TIME-FREQUENCY BLIND SEPARATION METHOD FOR UNDERDETERMINED SPEECH MIXTURES
    Lv Yao Li Shuangtian(Institute of Acoustics
    JournalofElectronics(China), 2008, (05) : 702 - 708
  • [7] Underdetermined Blind Source Separation of Anechoic Speech Mixtures in the Time-frequency Domain
    Lv Yao
    Li Shuangtian
    ICSP: 2008 9TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, VOLS 1-5, PROCEEDINGS, 2008, : 22 - 25
  • [8] Segmentation on time-frequency domain for speech segregation
    Lim, Sung-Kil
    Lee, Hyon-Soo
    2006 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS, VOLS 1 AND 2, 2006, : 433 - +
  • [9] Watermarking of speech signals in the time-frequency domain
    Al-Khassaweneh, Mahmood
    Al-Zoubi, Hussein
    Aviyente, Selin
    2009 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY, 2009, : 317 - +
  • [10] Neural speech enhancement in the time-frequency domain
    Volkmer, M
    2003 IEEE XIII WORKSHOP ON NEURAL NETWORKS FOR SIGNAL PROCESSING - NNSP'03, 2003, : 617 - 626