DISTRIBUTED MICROPHONE ARRAY PROCESSING FOR SPEECH SOURCE SEPARATION WITH CLASSIFIER FUSION

被引:0
作者
Souden, Mehrez [1 ]
Kinoshita, Keisuke [1 ]
Delcroix, Marc [1 ]
Nakatani, Tomohiro [1 ]
机构
[1] NTT Commun Sci Labs, Kyoto, Japan
来源
2012 IEEE INTERNATIONAL WORKSHOP ON MACHINE LEARNING FOR SIGNAL PROCESSING (MLSP) | 2012年
关键词
Distributed microphone array processing; blind source separation; speech clustering; classifier combination; BLIND; MIXTURES;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose a new approach for clustering and separating competing speech signals using a distributed microphone array (DMA). This approach can be viewed as an extension of expectation-maximization (EM)-based source separation to DMAs. To achieve distributed processing, we assume the conditional independence (with respect to sources' activities) of the normalized recordings of different nodes. By doing so, only the posterior probabilities of sources' activities need to be shared between nodes. Consequently, the EM algorithm is formulated such that at the expectation step, local posterior probabilities are estimated locally and shared between nodes. In the maximization step, every node fuses the received probabilities via either product or sum rules and estimates its local parameters. We show that, even if we make binary decisions (presence/absence of speech) during EM iterations instead of transmitting continuous posterior probability values, we can achieve separation without causing significant speech distortion. Our preliminary investigations demonstrate that the proposed processing technique approaches the centralized solution and can outperform Oracle best node-wise clustering in terms of objective source separation metrics.
引用
收藏
页数:6
相关论文
共 20 条
  • [1] IMAGE METHOD FOR EFFICIENTLY SIMULATING SMALL-ROOM ACOUSTICS
    ALLEN, JB
    BERKLEY, DA
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 65 (04) : 943 - 950
  • [2] Aoki M., 2001, Acoustical Science and Technology, V22, P149, DOI 10.1250/ast.22.149
  • [3] Sparse source separation based on simultaneous clustering of source locational and spectral features
    Araki, Shoko
    Nakatani, Tomohiro
    Sawada, Hiroshi
    [J]. ACOUSTICAL SCIENCE AND TECHNOLOGY, 2011, 32 (04) : 161 - 164
  • [4] Bertrand A., 2011, P IEEE S COMM VEH TE, P1, DOI [10.1109/SCVT.2011.6101302, DOI 10.1109/SCVT.2011.6101302]
  • [5] Distributed Adaptive Estimation of Node-Specific Signals in Wireless Sensor Networks With a Tree Topology
    Bertrand, Alexander
    Moonen, Marc
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (05) : 2196 - 2210
  • [6] Noise estimation by minima controlled recursive averaging for robust speech enhancement
    Cohen, I
    Berdugo, B
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2002, 9 (01) : 12 - 15
  • [7] Reduced-Bandwidth and Distributed MWF-Based Noise Reduction Algorithms for Binaural Hearing Aids
    Doclo, Simon
    Moonen, Marc
    Van den Bogaert, Tim
    Wouters, Jan
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (01): : 38 - 51
  • [8] Fisher W., 1986, PROC DARPA WORKSHOP, P93
  • [9] Distributed EM algorithm for Gaussian mixtures in sensor networks
    Gu, Dongbing
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (07): : 1154 - 1166
  • [10] Clustered Blind Beamforming From Ad-Hoc Microphone Arrays
    Himawan, Ivan
    McCowan, Iain
    Sridharan, Sridha
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04): : 661 - 676