Initialization method for speech separation algorithms that work in the time-frequency domain

被引：5

作者：

Sarmiento, Auxiliadora ^{[1
]}

Duran-Diaz, Ivan ^{[1
]}

Cruces, Sergio ^{[1
]}

机构：

[1] Univ Seville, Dept Teoria Senal & Comunicac, Seville 41092, Spain

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2010年 / 127卷 / 04期

关键词：

acoustic signal processing; blind source separation; independent component analysis; speech processing; time-frequency analysis;

D O I：

10.1121/1.3310248

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This article addresses the problem of the unsupervised separation of speech signals in realistic scenarios. An initialization procedure is proposed for independent component analysis (ICA) algorithms that work in the time-frequency domain and require the prewhitening of the observations. It is shown that the proposed method drastically reduces the permuted solutions in that domain and helps to reduce the execution time of the algorithms. Simulations confirm these advantages for several ICA instantaneous algorithms and the effectiveness of the proposed technique in emulated reverberant environments.

引用

页码：EL121 / EL126

页数：6

共 50 条

[21] A Time-Frequency Domain Blind Source Separation Method for Underdetermined Instantaneous Mixtures
Peng, Tianliang
Chen, Yang
Liu, Zengli
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (12) : 3883 - 3895
[22] Time-frequency Domain Filter-and-sum Network for Multi-channel Speech Separation
Deng, Zhewen
Zhou, Yi
Liu, Hongqing
INTERSPEECH 2023, 2023, : 3689 - 3693
[23] Underdetermined blind separation of overlapped speech mixtures in time-frequency domain with estimated number of sources
Zhang, Haijian
Hua, Guang
Yu, Lei
Cai, Yunlong
Bi, Guoan
SPEECH COMMUNICATION, 2017, 89 : 1 - 16
[24] A Time-Frequency Preprocessing Method for Blind Source Separation of Speech Signal with Temporal Structure
Wang, Zhe
Bi, Guoan
2015 10TH INTERNATIONAL CONFERENCE ON INFORMATION, COMMUNICATIONS AND SIGNAL PROCESSING (ICICS), 2015,
[25] PHASE RECONSTRUCTION METHOD BASED ON TIME-FREQUENCY DOMAIN HARMONIC STRUCTURE FOR SPEECH ENHANCEMENT
Wakabayashi, Yukoh
Fukumori, Takahiro
Nakayama, Masato
Nishiura, Takanobu
Yamashita, Yoichi
2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5560 - 5564
[26] Speech preprocessing and enhancement based on joint time domain and time-frequency domain analysis
Zhang, Wenbo
Xie, Xuefeng
Du, Yanling
Huang, Dongmei
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2024, 155 (06): : 3580 - 3588
[27] Blind separation of speech mixtures via time-frequency masking
Yilmaz, Ö
Rickard, S
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (07) : 1830 - 1847
[28] Binaural Speech Separation Based on the Time-Frequency Binary Mask
Mahmoodzadeh, A.
Abutalebi, H. R.
Soltanian-Zadeh, H.
Sheikhzadeh, H.
2012 SIXTH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST), 2012, : 848 - 853
[29] Towards a hardware realization of time-frequency source separation of speech
Harte, N
Hurley, N
Fearon, C
Rickard, S
PROCEEDINGS OF THE 2005 EUROPEAN CONFERENCE ON CIRCUIT THEORY AND DESIGN, VOL 1, 2005, : 71 - 74
[30] Exploring the time-frequency microstructure of speech for blind source separation
Wu, HC
Principe, JC
Xu, DX
PROCEEDINGS OF THE 1998 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-6, 1998, : 1145 - 1148

← 1 2 3 4 5 →