Initialization method for speech separation algorithms that work in the time-frequency domain

被引：5

作者：

Sarmiento, Auxiliadora ^{[1
]}

Duran-Diaz, Ivan ^{[1
]}

Cruces, Sergio ^{[1
]}

机构：

[1] Univ Seville, Dept Teoria Senal & Comunicac, Seville 41092, Spain

来源：

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA | 2010年 / 127卷 / 04期

关键词：

acoustic signal processing; blind source separation; independent component analysis; speech processing; time-frequency analysis;

D O I：

10.1121/1.3310248

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This article addresses the problem of the unsupervised separation of speech signals in realistic scenarios. An initialization procedure is proposed for independent component analysis (ICA) algorithms that work in the time-frequency domain and require the prewhitening of the observations. It is shown that the proposed method drastically reduces the permuted solutions in that domain and helps to reduce the execution time of the algorithms. Simulations confirm these advantages for several ICA instantaneous algorithms and the effectiveness of the proposed technique in emulated reverberant environments.

引用

页码：EL121 / EL126

页数：6

共 50 条

[31] Time-Frequency Masking in the Complex Domain for Speech Dereverberation and Denoising
Williamson, Donald S.
Wang, DeLiang
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2017, 25 (07) : 1492 - 1501
[32] Modeling speech signals in the time-frequency domain using GARCH
Cohen, I
SIGNAL PROCESSING, 2004, 84 (12) : 2453 - 2459
[33] An approach to digital watermarking of speech signals in the time-frequency domain
Stankovic, Srdjan
Orovic, Irena
Zaric, Nikola
Ioana, Cornel
PROCEEDINGS ELMAR-2006, 2006, : 127 - 130
[34] HYBRID TIME-FREQUENCY DOMAIN ARTICULATORY SPEECH SYNTHESIZER.
Sondhi, Man Mohan
Schroeter, Juergen
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987, ASSP-35 (07): : 955 - 967
[35] End-to-End Speech Separation Using Orthogonal Representation in Complex and Real Time-Frequency Domain
Wang, Kai
Huang, Hao
Hu, Ying
Huang, Zhihua
Li, Sheng
INTERSPEECH 2021, 2021, : 3046 - 3050
[36] A Time-Frequency Domain Formant Frequency Estimation Scheme for Noisy Speech Signals
Fattah, S. A.
Zhu, W-P.
Ahmad, M. O.
ISCAS: 2009 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-5, 2009, : 1201 - 1204
[37] Instantaneous polarization attributes in the time-frequency domain and wavefield separation
Diallo, MS
Kulesh, M
Holschneider, M
Scherbaum, F
GEOPHYSICAL PROSPECTING, 2005, 53 (05) : 723 - 731
[38] A Frequency Domain Method for Speech Separation in a Reverberant Room
Mischie, Septimiu
Simion, Georgiana
2010 9TH INTERNATIONAL SYMPOSIUM ON ELECTRONICS AND TELECOMMUNICATIONS (ISETC), 2010, : 303 - 306
[39] Underdetermined blind separation of nondisjoint sources in the time-frequency domain
Aissa-El-Bey, Abdeldjalil
Linh-Trung, Nguyen
Abed-Meraim, Karim
Belouchrani, Adel
Grenier, Yves
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2007, 55 (03) : 897 - 907
[40] Underdetermined source separation of EEG signals in the time-frequency domain
Shan, Zeyong
Swary, Jacob
Aviyente, Selin
2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 3637 - 3640

← 1 2 3 4 5 →