Audio video based fast fixed-point independent vector analysis for multisource separation in a room environment

被引:13
|
作者
Liang, Yanfeng [1 ]
Naqvi, Syed Mohsen [1 ]
Chambers, Jonathon A. [1 ]
机构
[1] Univ Loughborough, Sch Elect Elect & Syst Engn, Loughborough LE11 3TU, Leics, England
来源
EURASIP JOURNAL ON ADVANCES IN SIGNAL PROCESSING | 2012年
基金
英国工程与自然科学研究理事会;
关键词
BLIND SOURCE SEPARATION; SPEECH;
D O I
10.1186/1687-6180-2012-183
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Fast fixed-point independent vector analysis (FastIVA) is an improved independent vector analysis (IVA) method, which can achieve faster and better separation performance than original IVA. As an example IVA method, it is designed to solve the permutation problem in frequency domain independent component analysis by retaining the higher order statistical dependency between frequencies during learning. However, the performance of all IVA methods is limited due to the dimensionality of the parameter space commonly encountered in practical frequency-domain source separation problems and the spherical symmetry assumed with the source model. In this article, a particular permutation problem encountered in using the FastIVA algorithm is highlighted, namely the block permutation problem. Therefore a new audio video based fast fixed-point independent vector analysis algorithm is proposed, which uses video information to provide a smart initialization for the optimization problem. The method cannot only avoid the ill convergence resulting from the block permutation problem but also improve the separation performance even in noisy and high reverberant environments. Different multisource datasets including the real audio video corpus AV16.3 are used to verify the proposed method. For the evaluation of the separation performance on real room recordings, a new pitch based evaluation criterion is also proposed.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Implementation of the Fast H∞ Filter Based on Fixed-Point Arithmetics
    Katsumata, Tomonori
    Nishiyama, Kiyoshi
    Matsuzuka, Haruo
    Satoh, Katsuaki
    2010 IEEE 10TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING PROCEEDINGS (ICSP2010), VOLS I-III, 2010, : 123 - +
  • [22] The Fixed-Point Algorithm and Maximum Likelihood Estimation for Independent Component Analysis
    Aapo Hyvärinen
    Neural Processing Letters, 1999, 10 : 1 - 5
  • [23] Fixed-point neural independent component analysis algorithms on the orthogonal group
    Fiori, S
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2006, 22 (04): : 430 - 440
  • [24] The fixed-point algorithm and maximum likelihood estimation for independent component analysis
    Hyvärinen, A
    NEURAL PROCESSING LETTERS, 1999, 10 (01) : 1 - 5
  • [25] Multisource Fault Signal Separation of Rotating Machinery Based on Wavelet Packet and Fast Independent Component Analysis
    Miao, Feng
    Zhao, Rongzhen
    Jia, Leilei
    Wang, Xianli
    INTERNATIONAL JOURNAL OF ROTATING MACHINERY, 2021, 2021
  • [26] A Survey of Optimization Methods for Independent Vector Analysis in Audio Source Separation
    Guo, Ruiming
    Luo, Zhongqiang
    Li, Mingchun
    SENSORS, 2023, 23 (01)
  • [27] An improved audio encoding architecture based on 16-bit fixed-point DSP
    Wang, X
    Dou, WB
    Hou, ZR
    2002 INTERNATIONAL CONFERENCE ON COMMUNICATIONS, CIRCUITS AND SYSTEMS AND WEST SINO EXPOSITION PROCEEDINGS, VOLS 1-4, 2002, : 918 - 921
  • [28] Audio source separation based on independent component analysis
    Makino, S
    Araki, S
    Mukai, R
    Sawada, H
    2004 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL 5, PROCEEDINGS, 2004, : 668 - 671
  • [29] A fixed-point algorithm for independent component analysis using AR source model
    Yang, Yumin
    Xu, Difei
    INFORMATION TECHNOLOGY, 2015, : 33 - 38
  • [30] A fixed-point algorithm for independent component analysis which uses a priori information
    Barros, AK
    Cichocki, A
    VTH BRAZILIAN SYMPOSIUM ON NEURAL NETWORKS, PROCEEDINGS, 1998, : 39 - 42