Audio video based fast fixed-point independent vector analysis for multisource separation in a room environment

被引:13
作者
Liang, Yanfeng [1 ]
Naqvi, Syed Mohsen [1 ]
Chambers, Jonathon A. [1 ]
机构
[1] Univ Loughborough, Sch Elect Elect & Syst Engn, Loughborough LE11 3TU, Leics, England
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2012年
基金
英国工程与自然科学研究理事会;
关键词
BLIND SOURCE SEPARATION; SPEECH;
D O I
10.1186/1687-6180-2012-183
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Fast fixed-point independent vector analysis (FastIVA) is an improved independent vector analysis (IVA) method, which can achieve faster and better separation performance than original IVA. As an example IVA method, it is designed to solve the permutation problem in frequency domain independent component analysis by retaining the higher order statistical dependency between frequencies during learning. However, the performance of all IVA methods is limited due to the dimensionality of the parameter space commonly encountered in practical frequency-domain source separation problems and the spherical symmetry assumed with the source model. In this article, a particular permutation problem encountered in using the FastIVA algorithm is highlighted, namely the block permutation problem. Therefore a new audio video based fast fixed-point independent vector analysis algorithm is proposed, which uses video information to provide a smart initialization for the optimization problem. The method cannot only avoid the ill convergence resulting from the block permutation problem but also improve the separation performance even in noisy and high reverberant environments. Different multisource datasets including the real audio video corpus AV16.3 are used to verify the proposed method. For the evaluation of the separation performance on real room recordings, a new pitch based evaluation criterion is also proposed.
引用
收藏
页数:16
相关论文
empty
未找到相关数据