CentriForce: Multiple-Domain Adaptation for Domain-Invariant Speaker Representation Learning

被引:3
|
作者
Wei, Yuheng [1 ]
Du, Junzhao [1 ]
Liu, Hui [1 ]
Zhang, Zhipeng [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Training; Speaker recognition; Mathematical models; Adaptation models; Speech recognition; Representation learning; Task analysis; Multiple speech sources; multiple-domain adaptation; speaker embedding; speaker recognition; RECOGNITION;
D O I
10.1109/LSP.2022.3154237
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the real world, speaker recognition systems usually suffer from serious performance degradation due to the domain mismatch between training and test conditions. To alleviate the harmful effect of domain shift, unsupervised domain adaptation methods are introduced to learn domain-invariant speaker representations, which focus on addressing the single-source-to-single-target domain adaptation issue. However, labeled speaker data are usually collected from multiple sources, such as different languages, genres and devices. The single-domain adaptation methods can not deal with the complex multiple-domain mismatch problem. To address this issue, we propose a multiple-domain adaptation framework named CentriForce to extract domain-invariant speaker representations for speaker recognition. Different from previous methods, CentriForce learns multiple domain-related speaker representation spaces. To mitigate the multiple-domain mismatch, CentriForce reduces the Wasserstein distance between each pair of source and target domains in their domain-related representation space and meanwhile uses the target domain as an anchor point to draw all source domains closer to each other. In our experiments, CentriForce achieves the best performance on most of the 16 challenging adaptation tasks, compared with other competing adaptation methods. Ablation study and representation visualization further demonstrate its effectiveness for learning the domain-invariant speaker embedding.
引用
收藏
页码:807 / 811
页数:5
相关论文
共 50 条
  • [31] THE CORAL plus plus ALGORITHM FOR UNSUPERVISED DOMAIN ADAPTATION OF SPEAKER RECOGNITION
    Li, Rongjin
    Zhang, Weibin
    Chen, Dongpeng
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7172 - 7176
  • [32] Zero-Shot Deep Domain Adaptation With Common Representation Learning
    Kutbi, Mohammed
    Peng, Kuan-Chuan
    Wu, Ziyan
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (07) : 3909 - 3924
  • [33] Compressed Domain Invariant Adversarial Representation Learning for Robust Audio Deepfake Detection
    Yuan, Chengsheng
    Chen, Yifei
    Zhou, Zhili
    Xia, Zhihua
    Huang, Yongfeng
    IEEE SIGNAL PROCESSING LETTERS, 2025, 32 : 1111 - 1115
  • [34] Discriminative Invariant Alignment for Unsupervised Domain Adaptation
    Lu, Yuwu
    Li, Desheng
    Wang, Wenjing
    Lai, Zhihui
    Zhou, Jie
    Li, Xuelong
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 1871 - 1882
  • [35] DOMAIN ADAPTATION FOR SPEAKER RECOGNITION IN SINGING AND SPOKEN VOICE
    Chowdhury, Anurag
    Cozzo, Austin
    Ross, Arun
    2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 7192 - 7196
  • [36] Unsupervised domain adaptation via representation learning and adaptive classifier learning
    Gheisari, Marzieh
    Baghshah, Mandieh Soleymani
    NEUROCOMPUTING, 2015, 165 : 300 - 311
  • [37] Unsupervised Domain Adaptation in the Wild via Disentangling Representation Learning
    Haoliang Li
    Renjie Wan
    Shiqi Wang
    Alex C. Kot
    International Journal of Computer Vision, 2021, 129 : 267 - 283
  • [38] Domain Adaptation with Representation Learning and Nonlinear Relation for Time Series
    Hussein, Amir
    Hajj, Hazem
    ACM TRANSACTIONS ON INTERNET OF THINGS, 2022, 3 (02):
  • [39] Representation learning via an integrated autoencoder for unsupervised domain adaptation
    Yi ZHU
    Xindong WU
    Jipeng QIANG
    Yunhao YUAN
    Yun LI
    Frontiers of Computer Science, 2023, 17 (05) : 77 - 89
  • [40] Joint predictive model and representation learning for visual domain adaptation
    Gheisari, Marzieh
    Baghshah, Mandieh Soleymani
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 58 : 157 - 170