CentriForce: Multiple-Domain Adaptation for Domain-Invariant Speaker Representation Learning

被引:3
|
作者
Wei, Yuheng [1 ]
Du, Junzhao [1 ]
Liu, Hui [1 ]
Zhang, Zhipeng [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
Training; Speaker recognition; Mathematical models; Adaptation models; Speech recognition; Representation learning; Task analysis; Multiple speech sources; multiple-domain adaptation; speaker embedding; speaker recognition; RECOGNITION;
D O I
10.1109/LSP.2022.3154237
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the real world, speaker recognition systems usually suffer from serious performance degradation due to the domain mismatch between training and test conditions. To alleviate the harmful effect of domain shift, unsupervised domain adaptation methods are introduced to learn domain-invariant speaker representations, which focus on addressing the single-source-to-single-target domain adaptation issue. However, labeled speaker data are usually collected from multiple sources, such as different languages, genres and devices. The single-domain adaptation methods can not deal with the complex multiple-domain mismatch problem. To address this issue, we propose a multiple-domain adaptation framework named CentriForce to extract domain-invariant speaker representations for speaker recognition. Different from previous methods, CentriForce learns multiple domain-related speaker representation spaces. To mitigate the multiple-domain mismatch, CentriForce reduces the Wasserstein distance between each pair of source and target domains in their domain-related representation space and meanwhile uses the target domain as an anchor point to draw all source domains closer to each other. In our experiments, CentriForce achieves the best performance on most of the 16 challenging adaptation tasks, compared with other competing adaptation methods. Ablation study and representation visualization further demonstrate its effectiveness for learning the domain-invariant speaker embedding.
引用
收藏
页码:807 / 811
页数:5
相关论文
共 50 条
  • [41] Representation learning via serial robust autoencoder for domain adaptation
    Yang, Shuai
    Zhang, Yuhong
    Wang, Hao
    Li, Peipei
    Hu, Xuegang
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 160
  • [42] Riemannian representation learning for multi-source domain adaptation
    Chen, Sentao
    Zheng, Lin
    Wu, Hanrui
    PATTERN RECOGNITION, 2023, 137
  • [43] Representation learning via an integrated autoencoder for unsupervised domain adaptation
    Zhu, Yi
    Wu, Xindong
    Qiang, Jipeng
    Yuan, Yunhao
    Li, Yun
    FRONTIERS OF COMPUTER SCIENCE, 2023, 17 (05)
  • [44] Generalized Fixation Invariant Nuclei Detection Through Domain Adaptation Based Deep Learning
    Valkonen, Mira
    Hognas, Gunilla
    Bova, G. Steven
    Ruusuvuori, Pekka
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (05) : 1747 - 1757
  • [45] TOWARD DOMAIN-INVARIANT SPEECH RECOGNITION VIA LARGE SCALE TRAINING
    Narayanan, Arun
    Misra, Ananya
    Sim, Khe Chai
    Pundak, Golan
    Tripathi, Anshuman
    Elfeky, Mohamed
    Haghani, Parisa
    Strohman, Trevor
    Bacchiani, Michiel
    2018 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2018), 2018, : 441 - 447
  • [46] Unsupervised Domain Adaptation in the Wild via Disentangling Representation Learning
    Li, Haoliang
    Wan, Renjie
    Wang, Shiqi
    Kot, Alex C.
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 129 (02) : 267 - 283
  • [47] Adversarial Multitask Learning for Domain Adaptation Through Domain Adapter
    Hidayaturrahman
    Trisetyarso, Agung
    Kartowisastro, Iman Herwidiana
    Budiharto, Widodo
    IEEE ACCESS, 2024, 12 : 184989 - 184999
  • [48] WDSRL: Multi-Domain Neural Machine Translation With Word-Level Domain-Sensitive Representation Learning
    Man, Zhibo
    Huang, Zengcheng
    Zhang, Yujie
    Li, Yu
    Chen, Yuanmeng
    Chen, Yufeng
    Xu, Jinan
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 32 : 577 - 590
  • [49] Multi-Source Contribution Learning for Domain Adaptation
    Li, Keqiuyin
    Lu, Jie
    Zuo, Hua
    Zhang, Guangquan
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 33 (10) : 5293 - 5307
  • [50] Learning Domain Invariant Prompt for Vision-Language Models
    Zhao, Cairong
    Wang, Yubin
    Jiang, Xinyang
    Shen, Yifei
    Song, Kaitao
    Li, Dongsheng
    Miao, Duoqian
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 1348 - 1360