JOINT SINGLE-CHANNEL SPEECH SEPARATION AND SPEAKER IDENTIFICATION

被引：13

作者：

Mowlaee, P. ^{[1
]}

Saeidi, R. ^{[2
]}

Tan, Z. -H. ^{[1
]}

Christensen, M. G. ^{[3
]}

Franti, P. ^{[2
]}

Jensen, S. H. ^{[1
]}

机构：

[1] Aalborg Univ, Dept Elect Syst, Aalborg, Denmark

[2] Univ Joensuu, Dept Comp Sci & Stat, Joensuu, Finland

[3] Aalborg Univ, Dept Media, Aalborg, Denmark

来源：

2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2010年

关键词：

Single-channel speech separation; speaker identification; sinusoidal mixture estimator; vector quantization; Gaussian mixture model;

D O I：

10.1109/ICASSP.2010.5495619

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we propose a closed loop system to improve the performance of single-channel speech separation in a speaker independent scenario. The system is composed of two interconnected blocks: a separation block and a speaker identification block. The improvement is accomplished by incorporating the speaker identities found by the speaker identification block as additional information for the separation block, which converts the speaker-independent separation problem to a speaker-dependent one where the speaker codebooks are known. Simulation results show that the closed loop system enhances the quality of the separated output signals. To assess the improvements, the results are reported in terms of PESQ for both target and masked signals.

引用

页码：4430 / 4433

页数：4

共 50 条

[41] Speech Enhancement for Speaker Identification
Mahesh, R.
2018 9TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND NETWORKING TECHNOLOGIES (ICCCNT), 2018,
[42] On the choice of window size in model-based single channel speech separation
Radfar, M. H.
Dansereau, R. M.
Sayadiyan, A.
2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 1084 - +
[43] Closed-set Speaker Identification in Speech Gateways
Neiva, J.
Guimaraes, A.
Macedo, H.
IEEE LATIN AMERICA TRANSACTIONS, 2014, 12 (06) : 1127 - 1133
[44] Single Channel Speech Separation Using Maximum a Posteriori Estimation
Radfar, M. H.
Dansereau, R. M.
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 841 - 844
[45] Single Channel Speech Separation Using Deep Neural Network
Chen, Linlin
Ma, Xiaohong
Ding, Shuxue
ADVANCES IN NEURAL NETWORKS, PT I, 2017, 10261 : 285 - 292
[46] SPEAKER IDENTIFICATION WITH DISTANT MICROPHONE SPEECH
Jin, Qin
Li, Runxin
Yang, Qian
Laskowski, Kornel
Schultz, Tanja
2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4518 - 4521
[47] Speaker identification utilizing noncontemporary speech
Hollien, H
Schwartz, R
JOURNAL OF FORENSIC SCIENCES, 2001, 46 (01) : 63 - 67
[48] Speaker Identification using Whispered Speech
Jawarkar, Naresh P.
Holambe, Raghunath S.
Basu, Tapan Kumar
2013 INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT 2013), 2013, : 778 - 781
[49] Probabilistic Linear Discriminant Analysis for Robust Speaker Identification in Co-channel Speech
Shokouhi, Navid
Hansen, John H. L.
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 3016 - 3020
[50] Speaker Re-identification with Speaker Dependent Speech Enhancement
Shi, Yanpei
Huang, Qiang
Hain, Thomas
INTERSPEECH 2020, 2020, : 1530 - 1534

← 1 2 3 4 5 →