CentriForce: Multiple-Domain Adaptation for Domain-Invariant Speaker Representation Learning

被引:3
|
作者
Wei, Yuheng [1 ]
Du, Junzhao [1 ]
Liu, Hui [1 ]
Zhang, Zhipeng [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Training; Speaker recognition; Mathematical models; Adaptation models; Speech recognition; Representation learning; Task analysis; Multiple speech sources; multiple-domain adaptation; speaker embedding; speaker recognition; RECOGNITION;
D O I
10.1109/LSP.2022.3154237
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the real world, speaker recognition systems usually suffer from serious performance degradation due to the domain mismatch between training and test conditions. To alleviate the harmful effect of domain shift, unsupervised domain adaptation methods are introduced to learn domain-invariant speaker representations, which focus on addressing the single-source-to-single-target domain adaptation issue. However, labeled speaker data are usually collected from multiple sources, such as different languages, genres and devices. The single-domain adaptation methods can not deal with the complex multiple-domain mismatch problem. To address this issue, we propose a multiple-domain adaptation framework named CentriForce to extract domain-invariant speaker representations for speaker recognition. Different from previous methods, CentriForce learns multiple domain-related speaker representation spaces. To mitigate the multiple-domain mismatch, CentriForce reduces the Wasserstein distance between each pair of source and target domains in their domain-related representation space and meanwhile uses the target domain as an anchor point to draw all source domains closer to each other. In our experiments, CentriForce achieves the best performance on most of the 16 challenging adaptation tasks, compared with other competing adaptation methods. Ablation study and representation visualization further demonstrate its effectiveness for learning the domain-invariant speaker embedding.
引用
收藏
页码:807 / 811
页数:5
相关论文
共 50 条
  • [21] UNSUPERVISED DOMAIN ADAPTATION VIA DOMAIN ADVERSARIAL TRAINING FOR SPEAKER RECOGNITION
    Wang, Qing
    Rao, Wei
    Sun, Sining
    Xie, Lei
    Chng, Eng Siong
    Li, Haizhou
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4889 - 4893
  • [22] Learning Smooth Representation for Unsupervised Domain Adaptation
    Cai, Guanyu
    He, Lianghua
    Zhou, MengChu
    Alhumade, Hesham
    Hu, Die
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (08) : 4181 - 4195
  • [23] On robustness of unsupervised domain adaptation for speaker recognition
    Bousquet, Pierre-Michel
    Rouvier, Mickael
    INTERSPEECH 2019, 2019, : 2958 - 2962
  • [24] Multi-Source Domain Adaptation and Fusion for Speaker Verification
    Zhu, Donghui
    Chen, Ning
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 2103 - 2116
  • [25] Single-Source Cross-Domain Bearing Fault Diagnosis via Multipseudo-Domain-Augmented Adversarial Domain-Invariant Learning
    Bi, Yuanguo
    Fu, Rao
    Jiang, Cunyu
    Han, Guangjie
    Yin, Zhenyu
    Zhao, Liang
    Li, Qihao
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (19): : 31521 - 31533
  • [26] Domain Adaptation via Prompt Learning
    Ge, Chunjiang
    Huang, Rui
    Xie, Mixue
    Lai, Zihang
    Song, Shiji
    Li, Shuang
    Huang, Gao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2025, 36 (01) : 1160 - 1170
  • [27] DSIL-DDI: A Domain-Invariant Substructure Interaction Learning for Generalizable Drug-Drug Interaction Prediction
    Tang, Zhenchao
    Chen, Guanxing
    Yang, Hualin
    Zhong, Weihe
    Chen, Calvin Yu-Chian
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (08) : 10552 - 10560
  • [28] Learning Target-Domain-Specific Classifier for Partial Domain Adaptation
    Ren, Chuan-Xian
    Ge, Pengfei
    Yang, Peiyi
    Yan, Shuicheng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (05) : 1989 - 2001
  • [29] Representation learning via serial autoencoders for domain adaptation
    Yang, Shuai
    Zhang, Yuhong
    Zhu, Yi
    Li, Peipei
    Hu, Xuegang
    NEUROCOMPUTING, 2019, 351 : 1 - 9
  • [30] Representation Learning and Knowledge Distillation for Lightweight Domain Adaptation
    Bin Shah, Sayed Rafay
    Putty, Shreyas Subhash
    Schwung, Andreas
    2024 IEEE CONFERENCE ON ARTIFICIAL INTELLIGENCE, CAI 2024, 2024, : 1202 - 1207