Initialization method for speech separation algorithms that work in the time-frequency domain

被引:5
|
作者
Sarmiento, Auxiliadora [1 ]
Duran-Diaz, Ivan [1 ]
Cruces, Sergio [1 ]
机构
[1] Univ Seville, Dept Teoria Senal & Comunicac, Seville 41092, Spain
来源
关键词
acoustic signal processing; blind source separation; independent component analysis; speech processing; time-frequency analysis;
D O I
10.1121/1.3310248
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This article addresses the problem of the unsupervised separation of speech signals in realistic scenarios. An initialization procedure is proposed for independent component analysis (ICA) algorithms that work in the time-frequency domain and require the prewhitening of the observations. It is shown that the proposed method drastically reduces the permuted solutions in that domain and helps to reduce the execution time of the algorithms. Simulations confirm these advantages for several ICA instantaneous algorithms and the effectiveness of the proposed technique in emulated reverberant environments.
引用
收藏
页码:EL121 / EL126
页数:6
相关论文
共 50 条
  • [21] A Time-Frequency Domain Blind Source Separation Method for Underdetermined Instantaneous Mixtures
    Peng, Tianliang
    Chen, Yang
    Liu, Zengli
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (12) : 3883 - 3895
  • [22] Time-frequency Domain Filter-and-sum Network for Multi-channel Speech Separation
    Deng, Zhewen
    Zhou, Yi
    Liu, Hongqing
    INTERSPEECH 2023, 2023, : 3689 - 3693
  • [23] Underdetermined blind separation of overlapped speech mixtures in time-frequency domain with estimated number of sources
    Zhang, Haijian
    Hua, Guang
    Yu, Lei
    Cai, Yunlong
    Bi, Guoan
    SPEECH COMMUNICATION, 2017, 89 : 1 - 16
  • [24] A Time-Frequency Preprocessing Method for Blind Source Separation of Speech Signal with Temporal Structure
    Wang, Zhe
    Bi, Guoan
    2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2015,
  • [25] PHASE RECONSTRUCTION METHOD BASED ON TIME-FREQUENCY DOMAIN HARMONIC STRUCTURE FOR SPEECH ENHANCEMENT
    Wakabayashi, Yukoh
    Fukumori, Takahiro
    Nakayama, Masato
    Nishiura, Takanobu
    Yamashita, Yoichi
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5560 - 5564
  • [26] Speech preprocessing and enhancement based on joint time domain and time-frequency domain analysis
    Zhang, Wenbo
    Xie, Xuefeng
    Du, Yanling
    Huang, Dongmei
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (06): : 3580 - 3588
  • [27] Blind separation of speech mixtures via time-frequency masking
    Yilmaz, Ö
    Rickard, S
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) : 1830 - 1847
  • [28] Binaural Speech Separation Based on the Time-Frequency Binary Mask
    Mahmoodzadeh, A.
    Abutalebi, H. R.
    Soltanian-Zadeh, H.
    Sheikhzadeh, H.
    2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 848 - 853
  • [29] Towards a hardware realization of time-frequency source separation of speech
    Harte, N
    Hurley, N
    Fearon, C
    Rickard, S
    PROCEEDINGS OF THE 2005 EUROPEAN CONFERENCE ON CIRCUIT THEORY AND DESIGN, VOL 1, 2005, : 71 - 74
  • [30] Exploring the time-frequency microstructure of speech for blind source separation
    Wu, HC
    Principe, JC
    Xu, DX
    PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1145 - 1148