Sparseness-based 2ch BSS using the em algorithm in reverberant environment

被引：47

作者：

Izumi, Yosuke ^{[1
]}

Ono, Nobutaka ^{[1
]}

Sagayama, Shigeki ^{[1
]}

机构：

[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, Tokyo 1138656, Japan

来源：

2007 IEEE WORKSHOP ON APPLICATIONS OF SIGNAL PROCESSING TO AUDIO AND ACOUSTICS | 2007年

关键词：

D O I：

10.1109/ASPAA.2007.4393015

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose a new approach to sparseness-based BSS based on the EM algorithm, which iteratively estimates the DOA and the time-frequency mask for each source through the EM algorithm under the sparseness assumption. Our method has the following characteristics: 1) it enables the introduction of physical observation models such as the diffuse sound field, because the likelihood is defined in the original signal domain and not in the feature domain, 2) one does not necessarily have to know in advance the power of the background noise since they are also parameters which can be estimated from the observed signal, 3) it takes short computational time, 4) a common objective function is iteratively increased in localization and separation steps, which correspond to the E-step and M-step, respectively. Although our framework is applicable to general N channel BSS, we will concentrate on the formulation of the problem in the particular case where two sensory inputs are available, and we show some numerical simulation results.

引用

页码：147 / 150

页数：4

共 12 条

[1]

[Anonymous], P IEEE WORKSH APP SI

[2]

Araki S., 2005, P IEEE INT C AC SPEE, VIII, P81

[3]

CEMGIL AT, 2005, P 13 EUSIPCO

[4] MEASUREMENT OF CORRELATION COEFFICIENTS IN REVERBERANT SOUND FIELDS [J].

COOK, RK ;

WATERHOUSE, RV ;

BERENDT, RD ;

EDELMAN, S ;

THOMPSON, MC .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1955, 27 (06) :1072-1077

[5] A Bayesian approach for blind separation of sparse sources [J].

Fevotte, Cedric ;

Godsill, Simon J. .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (06) :2174-2188

[6]

MANDEL M, 2006, P NEUR INF P SYS

[7] Microphone array post-filter based on noise field coherence [J].

McCowan, IA ;

Bourlard, H .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (06) :709-716

[8]

OGRADY PD, 2004, INT C IND COMP AN GR, P428

[9]

Rickard S, 2002, INT CONF ACOUST SPEE, P529

[10]

VIELVA L, 2002, P IEEE INT C AC SPEE, V3, P3049

← 1 2 →