Collaborative and adversarial network for text-independent speaker verification in domain adaptation

被引：0

作者：

Qiang, Junhao ^{[1
]}

Yang, Qun ^{[1
]}

Gao, Jie ^{[1
]}

Liu, Shaohan ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China

来源：

ELECTRONICS LETTERS | 2023年 / 59卷 / 02期

关键词：

audio signal processing; speaker recognition;

D O I：

10.1049/ell2.12709

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Speaker verification models have achieved good results on the single genre data. But the performance degrades when model training and testing are not in the same domain. The adversarial training method is proposed to solve this problem by minimizing domain distribution differences. However, the adversarial training ignores domain-specific information for the domain-invariant speaker representations. In this paper, an improved collaborative adversarial network for domain adaptation in speaker verification is performed. Compared to the adversarial training, a collaborative discriminator is newly incorporated that learns domain-specific information at the lower layers. Further, the projection block is added to the collaborative discriminator. It reduces the noise introduced by the collaborative discriminator. Experiments are conducted in different mismatch scenarios and using different speaker encoders. All the experimental results show that the performance of this method is better than the baseline and previous work using adversarial training.

引用

页数：3

共 50 条

[31] Score normalization for text-independent speaker verification systems
Auckenthaler, R
Carey, M
Lloyd-Thomas, H
DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 42 - 54
[32] Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings
Zhang, Chunlei
Koishida, Kazuhito
Hansen, John H. L.
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1633 - 1644
[33] Exploration of Local Variability in Text-Independent Speaker Verification
Chen, Liping
Lee, Kong Aik
Ma, Bin
Guo, Wu
Li, Haizhou
Dai, Li-Rong
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 82 (02): : 217 - 228
[34] Local Variability Vector for Text-Independent Speaker Verification
Chen, Liping
Lee, Kong Aik
Ma, Bin
Guo, Wu
Li, Haizhou
Dai, Li Rong
2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 54 - +
[35] A robust sequential test for text-independent speaker verification
Lund, MA
Lee, CC
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (01): : 609 - 621
[36] Robust sequential test for text-independent speaker verification
Lund, Michael A.
Lee, C.C.
Journal of the Acoustical Society of America, 1996, 99 (01):
[37] Masked Proxy Loss For Text-Independent Speaker Verification
Dan, Jiachen
Kumar, Aiswarya Vinod
Dhamyal, Hira
Raj, Bhiksha
Singh, Rita
INTERSPEECH 2021, 2021, : 4638 - 4642
[38] A New Score Normalization for Text-Independent Speaker Verification
Ning, Hongke
Zou, Y. X.
Hu, Xuyan
2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 636 - 639
[39] Text-independent speaker verification:: State of the art and challenges
Petrovska-Delacretaz, Dijana
El Hannani, Asmaa
Chollet, Gerard
PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 135 - +
[40] Exploration of Local Variability in Text-Independent Speaker Verification
Liping Chen
Kong Aik Lee
Bin Ma
Wu Guo
Haizhou Li
Li-Rong Dai
Journal of Signal Processing Systems, 2016, 82 : 217 - 228

← 1 2 3 4 5 →