An analysis of the limitations of blind signal separation application with speech

被引:13
作者
Smith, D [1 ]
Lukasiak, J [1 ]
Burnett, IS [1 ]
机构
[1] Univ Wollongong, Sch Elect Comp & Telecommun Engn, Whisper Labs, Wollongong, NSW, Australia
关键词
mutual information; blind signal separation; independent component analysis; statistical independence;
D O I
10.1016/j.sigpro.2005.05.020
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Blind Signal Separation (BSS) techniques are commonly employed in the separation of speech signals, using Independent Component Analysis (ICA) as the criterion for separation. This paper investigates the viability of employing ICA for real-time speech separation (where short frame sizes are the norm). The relationship between the statistics of speech and the assumption of statistical independence (at the core of ICA) is examined over a range of frame sizes. The investigation confirms that statistical independence is not a valid assumption for speech when divided into the short frames appropriate to real-time separation. This is primarily due to the quasi-stationary nature of speech over the temporal short term. We conclude that employing ICA for real-time speech separation will always result in limited performance due to a fundamental failure to meet the strict assumptions of ICA. (C) 2005 Elsevier B.V. All rights reserved.
引用
收藏
页码:353 / 359
页数:7
相关论文
共 18 条
[1]   The fundamental limitation of frequency domain blind source separation for convolutive mixtures of speech [J].
Araki, S ;
Mukai, R ;
Makino, S ;
Nishikawa, T ;
Saruwatari, H .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (02) :109-116
[2]   CuBICA: Independent component analysis by simultaneous third- and fourth-order cumulant diagonalization [J].
Blaschke, T ;
Wiskott, L .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (05) :1250-1256
[3]  
Brandstein MS, 1998, INT CONF ACOUST SPEE, P3613, DOI 10.1109/ICASSP.1998.679662
[4]   BLIND BEAMFORMING FOR NON-GAUSSIAN SIGNALS [J].
CARDOSO, JF ;
SOULOUMIAC, A .
IEE PROCEEDINGS-F RADAR AND SIGNAL PROCESSING, 1993, 140 (06) :362-370
[5]   Estimation of the information by an adaptive partitioning of the observation space [J].
Darbellay, GA ;
Vajda, I .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1999, 45 (04) :1315-1321
[6]  
Ferrari D., 1990, 1193 RFC
[7]   INDEPENDENT COORDINATES FOR STRANGE ATTRACTORS FROM MUTUAL INFORMATION [J].
FRASER, AM ;
SWINNEY, HL .
PHYSICAL REVIEW A, 1986, 33 (02) :1134-1140
[8]   Voicing-specific LPC quantization for variable-rate speech coding [J].
Hagen, R ;
Paksoy, E ;
Gersho, A .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1999, 7 (05) :485-494
[9]  
HASEGAWAJOHNSON M, VOWELS MRI DATABASE
[10]   Blind source separation using Renyi's mutual information [J].
Hild, KE ;
Erdogmus, D ;
Príncipe, J .
IEEE SIGNAL PROCESSING LETTERS, 2001, 8 (06) :174-176