JOINT SINGLE-CHANNEL SPEECH SEPARATION AND SPEAKER IDENTIFICATION

被引:13
|
作者
Mowlaee, P. [1 ]
Saeidi, R. [2 ]
Tan, Z. -H. [1 ]
Christensen, M. G. [3 ]
Franti, P. [2 ]
Jensen, S. H. [1 ]
机构
[1] Aalborg Univ, Dept Elect Syst, Aalborg, Denmark
[2] Univ Joensuu, Dept Comp Sci & Stat, Joensuu, Finland
[3] Aalborg Univ, Dept Media, Aalborg, Denmark
来源
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年
关键词
Single-channel speech separation; speaker identification; sinusoidal mixture estimator; vector quantization; Gaussian mixture model;
D O I
10.1109/ICASSP.2010.5495619
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we propose a closed loop system to improve the performance of single-channel speech separation in a speaker independent scenario. The system is composed of two interconnected blocks: a separation block and a speaker identification block. The improvement is accomplished by incorporating the speaker identities found by the speaker identification block as additional information for the separation block, which converts the speaker-independent separation problem to a speaker-dependent one where the speaker codebooks are known. Simulation results show that the closed loop system enhances the quality of the separated output signals. To assess the improvements, the results are reported in terms of PESQ for both target and masked signals.
引用
收藏
页码:4430 / 4433
页数:4
相关论文
共 50 条
  • [41] Speech Enhancement for Speaker Identification
    Mahesh, R.
    2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
  • [42] On the choice of window size in model-based single channel speech separation
    Radfar, M. H.
    Dansereau, R. M.
    Sayadiyan, A.
    2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 1084 - +
  • [43] Closed-set Speaker Identification in Speech Gateways
    Neiva, J.
    Guimaraes, A.
    Macedo, H.
    IEEE LATIN AMERICA TRANSACTIONS, 2014, 12 (06) : 1127 - 1133
  • [44] Single Channel Speech Separation Using Maximum a Posteriori Estimation
    Radfar, M. H.
    Dansereau, R. M.
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 841 - 844
  • [45] Single Channel Speech Separation Using Deep Neural Network
    Chen, Linlin
    Ma, Xiaohong
    Ding, Shuxue
    ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 285 - 292
  • [46] SPEAKER IDENTIFICATION WITH DISTANT MICROPHONE SPEECH
    Jin, Qin
    Li, Runxin
    Yang, Qian
    Laskowski, Kornel
    Schultz, Tanja
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4518 - 4521
  • [47] Speaker identification utilizing noncontemporary speech
    Hollien, H
    Schwartz, R
    JOURNAL OF FORENSIC SCIENCES, 2001, 46 (01) : 63 - 67
  • [48] Speaker Identification using Whispered Speech
    Jawarkar, Naresh P.
    Holambe, Raghunath S.
    Basu, Tapan Kumar
    2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 778 - 781
  • [49] Probabilistic Linear Discriminant Analysis for Robust Speaker Identification in Co-channel Speech
    Shokouhi, Navid
    Hansen, John H. L.
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3016 - 3020
  • [50] Speaker Re-identification with Speaker Dependent Speech Enhancement
    Shi, Yanpei
    Huang, Qiang
    Hain, Thomas
    INTERSPEECH 2020, 2020, : 1530 - 1534