Underdetermined blind sparse source separation for arbitrarily arranged multiple sensors

被引:204
|
作者
Araki, Shoko
Sawada, Hiroshi
Mukai, Ryo
Makino, Shoji
机构
[1] NTT Corp, NTT Commun Sci Labs, Kyoto 6190237, Japan
[2] Hokkaido Univ, Grad Sch Informat Sci & Technol, Kita Ku, Sapporo, Hokkaido 0600814, Japan
关键词
blind source separation; sparseness; clustering; normalization; binary mask; speech separation; reverberation;
D O I
10.1016/j.sigpro.2007.02.003
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a new method for blind sparse source separation. Some sparse source separation methods, which. rely on source sparseness and an anechoic mixing model, have already been proposed. These methods utilize level ratios and phase differences between sensor observations as their features, and they separate signals by classifying them. However, some of the features cannot form clusters with a well-known clustering algorithm, e.g., the k-means. Moreover, most previous methods utilize a linear sensor array (or only two sensors), and therefore they cannot separate symmetrically positioned sources. To overcome such problems, we propose a new feature that can be clustered by the k-means algorithm and that can be easily applied to more than three sensors arranged non-linearly. We have obtained promising results for two- and three-dimensionally distributed speech separation with non-linear/non-uniform sensor arrays in a real room even in underdetermined situations. We also investigate the way in which the performance of such methods is affected by room reverberation, which may cause the sparseness and anechoic assumptions to collapse. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1833 / 1847
页数:15
相关论文
共 50 条
  • [1] Underdetermined Blind Source Separation with Fuzzy Clustering for Arbitrarily Arranged Sensors
    Jafari, Ingrid
    Haque, Serajul
    Togneri, Roberto
    Nordholm, Sven
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 1764 - +
  • [2] A ROBUST APPROACH TO REVERBERANT BLIND SOURCE SEPARATION IN THE PRESENCE OF NOISE FOR ARBITRARILY ARRANGED SENSORS
    Jafari, Ingrid
    Togneri, Roberto
    Nordholm, Sven
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 2413 - 2416
  • [3] DOA Estimation for Multiple Sparse Sources with Arbitrarily Arranged Multiple Sensors
    Shoko Araki
    Hiroshi Sawada
    Ryo Mukai
    Shoji Makino
    Journal of Signal Processing Systems, 2011, 63 : 265 - 275
  • [4] DOA Estimation for Multiple Sparse Sources with Arbitrarily Arranged Multiple Sensors
    Araki, Shoko
    Sawada, Hiroshi
    Mukai, Ryo
    Makino, Shoji
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2011, 63 (03): : 265 - 275
  • [5] Underdetermined blind source separation using sparse representations
    Bofill, P
    Zibulevsky, M
    SIGNAL PROCESSING, 2001, 81 (11) : 2353 - 2362
  • [6] Underdetermined Sparse Blind Source Separation by Clustering on Hyperplanes
    Tan Beihai
    Zhao Min
    PROCEEDINGS OF THE SECOND INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY, VOL I, 2009, : 270 - 274
  • [7] Underdetermined Blind Source Separation Based on Sparse Component
    Ren, Ming-rong
    Wang, Pu
    ICECT: 2009 INTERNATIONAL CONFERENCE ON ELECTRONIC COMPUTER TECHNOLOGY, PROCEEDINGS, 2009, : 174 - 177
  • [8] Underdetermined blind source separation based on sparse representation
    Li, YQ
    Amari, SI
    Cichocki, A
    Ho, DWC
    Xie, SL
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2006, 54 (02) : 423 - 437
  • [9] Underdetermined Blind Source Separation Using Sparse Coding
    Zhen, Liangli
    Peng, Dezhong
    Yi, Zhang
    Xiang, Yong
    Chen, Peng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (12) : 3102 - 3108
  • [10] A statistically sparse decompositron principle for underdetermined blind source separation
    Xiao, M
    Xie, SL
    Fu, YL
    ISPACS 2005: PROCEEDINGS OF THE 2005 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATION SYSTEMS, 2005, : 165 - 168