Inverse Truncated Mixing Matrix (ITMM) Algorithm Application to Underdetermined Convolutive Blind Speech Sources Separation

被引：0

作者：

Peng Tianliang ^{[1
]}

Chen Yang ^{[1
]}

机构：

[1] Southeast Univ, Sch Informat Sci & Engn, Nanjing 210018, Jiangsu, Peoples R China

来源：

2015 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) | 2015年

关键词：

Inverse Truncated Mixing Matrix; underdetermined; convolutive blind source separation; time-frequency; AUDIO SOURCE SEPARATION; FREQUENCY-DOMAIN; MIXTURES;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Inverse Truncated Mixing Matrix (ITMM) is a powerful method for underdetermined instantaneous blind source separation [1]. In this paper, we generalize ITMM algorithm to underdetermined convolutive blind source separation case. The proposed algorithm can be divided into two steps. The first step is the mixing filters estimation. The convolutive mixture can become an instantaneous mixture in time-frequency (TF) domain under some narrowband assumptions. Then, we used cluster method to estimate mixing matrix in every frequency bin. The second step is the source recovery part, we used ITMM method to mixing matrix in every frequency bin to source recovery in TF domain. Experimental evaluations are gained in artificial Room Impulse Responses (RIRs) environments, compared with conventional algorithms, the ITMM algorithm can separate speech sources to a higher signal-to-interference ratio (SIR).

引用

页码：801 / 806

页数：6

共 14 条

[1] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS [J].

ALLEN, JB ;

BERKLEY, DA .

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) :943-950

[2] Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors [J].

Araki, Shoko ;

Sawada, Hiroshi ;

Mukai, Ryo ;

Makino, Shoji .

SIGNAL PROCESSING, 2007, 87 (08) :1833-1847

[3] Underdetermined blind source separation using sparse representations [J].

Bofill, P ;

Zibulevsky, M .

SIGNAL PROCESSING, 2001, 81 (11) :2353-2362

[4] Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model [J].

Duong, Ngoc Q. K. ;

Vincent, Emmanuel ;

Gribonval, Remi .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (07) :1830-1840

[5] Underdetermined blind source separation based on sparse representation [J].

Li, YQ ;

Amari, SI ;

Cichocki, A ;

Ho, DWC ;

Xie, SL .

IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (02) :423-437

[6] A Time-Frequency Domain Blind Source Separation Method for Underdetermined Instantaneous Mixtures [J].

Peng, Tianliang ;

Chen, Yang ;

Liu, Zengli .

CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2015, 34 (12) :3883-3895

[7]

Peng Tianliang, INVERSE TRUNCA UNPUB

[8] A robust and precise method for solving the permutation problem of frequency-domain blind source separation [J].

Sawada, H ;

Mukai, R ;

Araki, S ;

Makino, S .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2004, 12 (05) :530-538

[9] Underdetermined Convolutive Blind Source Separation via Frequency Bin-Wise Clustering and Permutation Alignment [J].

Sawada, Hiroshi ;

Araki, Shoko ;

Makino, Shoji .

IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (03) :516-527

[10] Blind separation of convolved mixtures in the frequency domain [J].

Smaragdis, P .

NEUROCOMPUTING, 1998, 22 (1-3) :21-34

← 1 2 →