Convolutive blind separation of speech mixtures using the natural gradient

被引：41

作者：

Douglas, SC ^{[1
]}

Sun, XA ^{[1
]}

机构：

[1] So Methodist Univ, Dept Elect Engn, Dallas, TX 75275 USA

来源：

SPEECH COMMUNICATION | 2003年 / 39卷 / 1-2期

关键词：

D O I：

10.1016/S0167-6393(02)00059-6

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

Convolutive blind separation of speech, also known as the "cocktail party problem", is a challenging task for which few successful algorithms have been developed. In this paper, we explore two novel algorithms for separating mixtures of multiple speech signals as measured by multiple microphones in a room environment. Both algorithms are modifications of an existing approach for density-based multichannel blind deconvolution (MBD) using natural gradient adaptation. The first approach employs non-holonomic constraints on the multichannel separation system to effectively avoid the partial deconvolution of the extracted speech signals within the separation system's outputs. The second approach employs linear predictors within the coefficient updates and produces separated speech signals whose autocorrelation properties can be arbitrarily specified. Unlike MBD methods, the proposed techniques maintain the spectral content of the original speech signals in the extracted outputs. Performance comparisons of the proposed methods with existing techniques show their usefulness in separating real-world speech signal mixtures. (C) 2002 Published by Elsevier Science B.V.

引用

页码：65 / 78

页数：14

共 25 条

[1] Nonholonomic orthogonal learning algorithms for blind source separation [J].

Amari, S ;

Chen, TP ;

Cichocki, A .

NEURAL COMPUTATION, 2000, 12 (06) :1463-1484

[2] Natural gradient works efficiently in learning [J].

Amari, S .

NEURAL COMPUTATION, 1998, 10 (02) :251-276

[3] Multichannel blind deconvolution and equalization using the natural gradient [J].

Amari, S ;

Douglas, SC ;

Cichocki, A ;

Yang, HH .

FIRST IEEE SIGNAL PROCESSING WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, 1997, :101-104

[4]

AMARI S, 1997, P 11 IFAC S SYST ID, V3, P1057

[5]

ANEMULLER J, 2000, P 2 INT WORKSH IND C, P215

[6]

[Anonymous], UNSUPERVISED ADAPT 1

[7] SOME EXPERIMENTS ON THE RECOGNITION OF SPEECH, WITH ONE AND WITH 2 EARS [J].

CHERRY, EC .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1953, 25 (05) :975-979

[8]

DAVENPORT WB, 1950, 148 MIT RES LAB EL

[9] Microphone-array hearing aids with binaural output .1. Fixed-processing systems [J].

Desloge, JG ;

Rabinowitz, WM ;

Zurek, PM .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (06) :529-542

[10]

Douglas SC, 2001, DIGITAL SIGNAL PROC, P355

← 1 2 3 →