CentriForce: Multiple-Domain Adaptation for Domain-Invariant Speaker Representation Learning

被引:3
|
作者
Wei, Yuheng [1 ]
Du, Junzhao [1 ]
Liu, Hui [1 ]
Zhang, Zhipeng [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Training; Speaker recognition; Mathematical models; Adaptation models; Speech recognition; Representation learning; Task analysis; Multiple speech sources; multiple-domain adaptation; speaker embedding; speaker recognition; RECOGNITION;
D O I
10.1109/LSP.2022.3154237
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the real world, speaker recognition systems usually suffer from serious performance degradation due to the domain mismatch between training and test conditions. To alleviate the harmful effect of domain shift, unsupervised domain adaptation methods are introduced to learn domain-invariant speaker representations, which focus on addressing the single-source-to-single-target domain adaptation issue. However, labeled speaker data are usually collected from multiple sources, such as different languages, genres and devices. The single-domain adaptation methods can not deal with the complex multiple-domain mismatch problem. To address this issue, we propose a multiple-domain adaptation framework named CentriForce to extract domain-invariant speaker representations for speaker recognition. Different from previous methods, CentriForce learns multiple domain-related speaker representation spaces. To mitigate the multiple-domain mismatch, CentriForce reduces the Wasserstein distance between each pair of source and target domains in their domain-related representation space and meanwhile uses the target domain as an anchor point to draw all source domains closer to each other. In our experiments, CentriForce achieves the best performance on most of the 16 challenging adaptation tasks, compared with other competing adaptation methods. Ablation study and representation visualization further demonstrate its effectiveness for learning the domain-invariant speaker embedding.
引用
收藏
页码:807 / 811
页数:5
相关论文
共 50 条
  • [31] A Bit More Bayesian: Domain-Invariant Learning with Uncertainty
    Xiao, Zehao
    Shen, Jiayi
    Zhen, Xiantong
    Shao, Ling
    Snoek, Cees G. M.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
  • [32] Domain Adaptation with Invariant Representation Learning: What Transformations to Learn?
    Stojanov, Petar
    Li, Zijian
    Gong, Mingming
    Cai, Ruichu
    Carbonell, Jaime G.
    Zhang, Kun
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [33] COD: Learning Conditional Invariant Representation for Domain Adaptation Regression
    Yang, Hao-Ran
    Ren, Chuan-Xian
    Luo, You-Wei
    COMPUTER VISION - ECCV 2024, PT LXXVI, 2025, 15134 : 108 - 125
  • [34] Deep domain-invariant learning for facial age estimation
    Bao, Zenghao
    Luo, Yutian
    Tan, Zichang
    Wan, Jun
    Ma, Xibo
    Lei, Zhen
    NEUROCOMPUTING, 2023, 534 : 86 - 93
  • [35] A Dictionary Approach to Domain-Invariant Learning in Deep Networks
    Wang, Ze
    Cheng, Xiuyuan
    Sapiro, Guillermo
    Qiu, Qiang
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [36] DALSCLIP: Domain aggregation via learning stronger domain-invariant features for CLIP
    Zhang, Yuewen
    Wang, Jiuhang
    Tang, Hongying
    Qin, Ronghua
    IMAGE AND VISION COMPUTING, 2025, 154
  • [37] Learning A Self-Supervised Domain-Invariant Feature Representation for Generalized Audio Deepfake Detection
    Xie, Yuankun
    Cheng, Haonan
    Wang, Yutian
    Ye, Long
    INTERSPEECH 2023, 2023, : 2808 - 2812
  • [38] Learning Domain-Invariant Discriminative Features for Heterogeneous Face Recognition
    Yang, Shanmin
    Fu, Keren
    Yang, Xiao
    Lin, Ye
    Zhang, Jianwei
    Peng, Cheng
    IEEE ACCESS, 2020, 8 : 209790 - 209801
  • [39] Domain-invariant feature learning with label information integration for cross-domain classification
    Jiang L.
    Wu J.
    Zhao S.
    Li J.
    Neural Computing and Applications, 2024, 36 (21) : 13107 - 13126
  • [40] Domain Invariant Representation Learning with Domain Density Transformations
    Nguyen, A. Tuan
    Tran, Toan
    Gal, Yarin
    Baydin, Atilim Gunes
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34