CentriForce: Multiple-Domain Adaptation for Domain-Invariant Speaker Representation Learning

被引:3
|
作者
Wei, Yuheng [1 ]
Du, Junzhao [1 ]
Liu, Hui [1 ]
Zhang, Zhipeng [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Training; Speaker recognition; Mathematical models; Adaptation models; Speech recognition; Representation learning; Task analysis; Multiple speech sources; multiple-domain adaptation; speaker embedding; speaker recognition; RECOGNITION;
D O I
10.1109/LSP.2022.3154237
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the real world, speaker recognition systems usually suffer from serious performance degradation due to the domain mismatch between training and test conditions. To alleviate the harmful effect of domain shift, unsupervised domain adaptation methods are introduced to learn domain-invariant speaker representations, which focus on addressing the single-source-to-single-target domain adaptation issue. However, labeled speaker data are usually collected from multiple sources, such as different languages, genres and devices. The single-domain adaptation methods can not deal with the complex multiple-domain mismatch problem. To address this issue, we propose a multiple-domain adaptation framework named CentriForce to extract domain-invariant speaker representations for speaker recognition. Different from previous methods, CentriForce learns multiple domain-related speaker representation spaces. To mitigate the multiple-domain mismatch, CentriForce reduces the Wasserstein distance between each pair of source and target domains in their domain-related representation space and meanwhile uses the target domain as an anchor point to draw all source domains closer to each other. In our experiments, CentriForce achieves the best performance on most of the 16 challenging adaptation tasks, compared with other competing adaptation methods. Ablation study and representation visualization further demonstrate its effectiveness for learning the domain-invariant speaker embedding.
引用
收藏
页码:807 / 811
页数:5
相关论文
共 50 条
  • [21] ATTENTIVE ADVERSARIAL LEARNING FOR DOMAIN-INVARIANT TRAINING
    Meng, Zhong
    Li, Jinyu
    Gong, Yifan
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6740 - 6744
  • [22] Learning a Domain-Invariant Embedding for Unsupervised Domain Adaptation Using Class-Conditioned Distribution Alignment
    Gabourie, Alexander J.
    Rostami, Mohammad
    Pope, Philip E.
    Kolouri, Soheil
    Kim, Kyungnam
    2019 57TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2019, : 352 - 359
  • [23] Gradient-aware domain-invariant learning for domain generalizationGradient-Aware Domain-Invariant Learning for Domain GeneralizationF. Hou et al.
    Feng Hou
    Yao Zhang
    Yang Liu
    Jin Yuan
    Cheng Zhong
    Yang Zhang
    Zhongchao Shi
    Jianping Fan
    Zhiqiang He
    Multimedia Systems, 2025, 31 (1)
  • [24] Domain-Invariant Speaker Vector Projection by Model-Agnostic Meta-Learning
    Kang, Jiawen
    Liu, Ruiqi
    Li, Lantian
    Cai, Yunqi
    Wang, Dong
    Zheng, Thomas Fang
    INTERSPEECH 2020, 2020, : 3825 - 3829
  • [25] Domain-Invariant Feature Alignment Using Variational Inference For Partial Domain Adaptation
    Choudhuri, Sandipan
    Adeniye, Suli
    Sen, Arunabha
    Venkateswara, Hemanth
    2022 56TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2022, : 349 - 355
  • [26] Learning Domain-Invariant Representations of Histological Images
    Lafarge, Maxime W.
    Pluim, Josien P. W.
    Eppenhof, Koen A. J.
    Veta, Mitko
    FRONTIERS IN MEDICINE, 2019, 6
  • [27] Graph-Diffusion-Based Domain-Invariant Representation Learning for Cross-Domain Facial Expression Recognition
    Wang, Run
    Song, Peng
    Zheng, Wenming
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (03) : 4163 - 4174
  • [28] Learning Domain-Invariant Subspace Using Domain Features and Independence Maximization
    Yan, Ke
    Kou, Lu
    Zhang, David
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (01) : 288 - 299
  • [29] Graph embedding-based heterogeneous domain adaptation with domain-invariant feature learning and distributional order preserving
    Wang, Wenxu
    Li, Zhenbo
    Li, Weiran
    NEURAL NETWORKS, 2024, 170 : 427 - 440
  • [30] Attribute-Aligned Domain-Invariant Feature Learning for Unsupervised Domain Adaptation Person Re-Identification
    Li, Huafeng
    Chen, Yiwen
    Tao, Dapeng
    Yu, Zhengtao
    Qi, Guanqiu
    IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2021, 16 : 1480 - 1494