Collaborative and adversarial network for text-independent speaker verification in domain adaptation

被引:0
|
作者
Qiang, Junhao [1 ]
Yang, Qun [1 ]
Gao, Jie [1 ]
Liu, Shaohan [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing, Peoples R China
关键词
audio signal processing; speaker recognition;
D O I
10.1049/ell2.12709
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Speaker verification models have achieved good results on the single genre data. But the performance degrades when model training and testing are not in the same domain. The adversarial training method is proposed to solve this problem by minimizing domain distribution differences. However, the adversarial training ignores domain-specific information for the domain-invariant speaker representations. In this paper, an improved collaborative adversarial network for domain adaptation in speaker verification is performed. Compared to the adversarial training, a collaborative discriminator is newly incorporated that learns domain-specific information at the lower layers. Further, the projection block is added to the collaborative discriminator. It reduces the noise introduced by the collaborative discriminator. Experiments are conducted in different mismatch scenarios and using different speaker encoders. All the experimental results show that the performance of this method is better than the baseline and previous work using adversarial training.
引用
收藏
页数:3
相关论文
共 50 条
  • [31] Score normalization for text-independent speaker verification systems
    Auckenthaler, R
    Carey, M
    Lloyd-Thomas, H
    DIGITAL SIGNAL PROCESSING, 2000, 10 (1-3) : 42 - 54
  • [32] Text-Independent Speaker Verification Based on Triplet Convolutional Neural Network Embeddings
    Zhang, Chunlei
    Koishida, Kazuhito
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1633 - 1644
  • [33] Exploration of Local Variability in Text-Independent Speaker Verification
    Chen, Liping
    Lee, Kong Aik
    Ma, Bin
    Guo, Wu
    Li, Haizhou
    Dai, Li-Rong
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 82 (02): : 217 - 228
  • [34] Local Variability Vector for Text-Independent Speaker Verification
    Chen, Liping
    Lee, Kong Aik
    Ma, Bin
    Guo, Wu
    Li, Haizhou
    Dai, Li Rong
    2014 9TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2014, : 54 - +
  • [35] A robust sequential test for text-independent speaker verification
    Lund, MA
    Lee, CC
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (01): : 609 - 621
  • [36] Robust sequential test for text-independent speaker verification
    Lund, Michael A.
    Lee, C.C.
    Journal of the Acoustical Society of America, 1996, 99 (01):
  • [37] Masked Proxy Loss For Text-Independent Speaker Verification
    Dan, Jiachen
    Kumar, Aiswarya Vinod
    Dhamyal, Hira
    Raj, Bhiksha
    Singh, Rita
    INTERSPEECH 2021, 2021, : 4638 - 4642
  • [38] A New Score Normalization for Text-Independent Speaker Verification
    Ning, Hongke
    Zou, Y. X.
    Hu, Xuyan
    2014 19TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2014, : 636 - 639
  • [39] Text-independent speaker verification:: State of the art and challenges
    Petrovska-Delacretaz, Dijana
    El Hannani, Asmaa
    Chollet, Gerard
    PROGRESS IN NONLINEAR SPEECH PROCESSING, 2007, 4391 : 135 - +
  • [40] Exploration of Local Variability in Text-Independent Speaker Verification
    Liping Chen
    Kong Aik Lee
    Bin Ma
    Wu Guo
    Haizhou Li
    Li-Rong Dai
    Journal of Signal Processing Systems, 2016, 82 : 217 - 228