CentriForce: Multiple-Domain Adaptation for Domain-Invariant Speaker Representation Learning

被引:3
|
作者
Wei, Yuheng [1 ]
Du, Junzhao [1 ]
Liu, Hui [1 ]
Zhang, Zhipeng [1 ]
机构
[1] Xidian Univ, Sch Comp Sci & Technol, Xian 710071, Shaanxi, Peoples R China
基金
国家重点研发计划; 中国国家自然科学基金;
关键词
Training; Speaker recognition; Mathematical models; Adaptation models; Speech recognition; Representation learning; Task analysis; Multiple speech sources; multiple-domain adaptation; speaker embedding; speaker recognition; RECOGNITION;
D O I
10.1109/LSP.2022.3154237
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In the real world, speaker recognition systems usually suffer from serious performance degradation due to the domain mismatch between training and test conditions. To alleviate the harmful effect of domain shift, unsupervised domain adaptation methods are introduced to learn domain-invariant speaker representations, which focus on addressing the single-source-to-single-target domain adaptation issue. However, labeled speaker data are usually collected from multiple sources, such as different languages, genres and devices. The single-domain adaptation methods can not deal with the complex multiple-domain mismatch problem. To address this issue, we propose a multiple-domain adaptation framework named CentriForce to extract domain-invariant speaker representations for speaker recognition. Different from previous methods, CentriForce learns multiple domain-related speaker representation spaces. To mitigate the multiple-domain mismatch, CentriForce reduces the Wasserstein distance between each pair of source and target domains in their domain-related representation space and meanwhile uses the target domain as an anchor point to draw all source domains closer to each other. In our experiments, CentriForce achieves the best performance on most of the 16 challenging adaptation tasks, compared with other competing adaptation methods. Ablation study and representation visualization further demonstrate its effectiveness for learning the domain-invariant speaker embedding.
引用
收藏
页码:807 / 811
页数:5
相关论文
共 50 条
  • [41] DIVIDE: Learning a Domain-Invariant Geometric Space for Depth Estimation
    Shim, Dongseok
    Kim, H. Jin
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (05) : 4663 - 4670
  • [42] Learning discriminative domain-invariant prototypes for generalized zero shot learning
    Wang, Yinduo
    Zhang, Haofeng
    Zhang, Zheng
    Long, Yang
    Shao, Ling
    KNOWLEDGE-BASED SYSTEMS, 2020, 196
  • [43] LEARNING INVARIANT REPRESENTATION AND RISK MINIMIZED FOR UNSUPERVISED ACCENT DOMAIN ADAPTATION
    Zhao, Chendong
    Wang, Jianzong
    Qu, Xiaoyang
    Wang, Haoqian
    Xiao, Jing
    2022 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP, SLT, 2022, : 509 - 516
  • [44] Domain-Invariant Feature Learning for General Face Forgery Detection
    Zhang, Jian
    Ni, Jiangqun
    2023 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, ICME, 2023, : 2321 - 2326
  • [45] Learning List-Level Domain-Invariant Representations for Ranking
    Xian, Ruicheng
    Zhuang, Honglei
    Qin, Zhen
    Zamani, Hamed
    Lu, Jing
    Ma, Ji
    Hui, Kai
    Zhao, Han
    Wang, Xuanhui
    Bendersky, Michael
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [46] Automatic Seizure Classification Based on Domain-Invariant Deep Representation of EEG
    Cao, Xincheng
    Yao, Bin
    Chen, Binqiang
    Sun, Weifang
    Tan, Guowei
    FRONTIERS IN NEUROSCIENCE, 2021, 15
  • [47] Domain-Invariant Projection Learning for Zero-Shot Recognition
    Zhao, An
    Ding, Mingyu
    Guan, Jiechao
    Lu, Zhiwu
    Xiang, Tao
    Wen, Ji-Rong
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [48] On Learning Invariant Representations for Domain Adaptation
    Zhao, Han
    des Combes, Remi Tachet
    Zhang, Kun
    Gordon, Geoffrey J.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97
  • [49] Beyond cross-domain learning: Multiple-domain nonnegative matrix factorization
    Wang, Jim Jing-Yan
    Gao, Xin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 28 : 181 - 189
  • [50] Learning intra-domain style-invariant representation for unsupervised domain adaptation of semantic segmentation
    Li, Zongyao
    Togo, Ren
    Ogawa, Takahiro
    Haseyama, Miki
    PATTERN RECOGNITION, 2022, 132