On robustness of unsupervised domain adaptation for speaker recognition

被引:18
作者
Bousquet, Pierre-Michel [1 ]
Rouvier, Mickael [1 ]
机构
[1] Univ Avignon LIA, Avignon, France
来源
INTERSPEECH 2019 | 2019年
关键词
Speaker recognition; speaker embeddings; x-vectors; unsupervised; domain adaptation;
D O I
10.21437/Interspeech.2019-1524
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Current speaker recognition systems, that are learned by using wide training datasets and include sophisticated modelings, turn out to be very specific, providing sometimes disappointing results in real-life applications. Any shift between training and test data, in terms of device, language, duration, noise or other tends to degrade accuracy of speaker detection. This study investigates unsupervised domain adaptation,when only a scarce and unlabeled "in-domain" development dataset is available. Details and relevance of different approaches are described and commented, leading to a new robust method that we call feature-Distribution Adaptor. Efficiency of the proposed technique is experimentally validated on the recent NIST 2016 and 2018 Speaker Recognition Evaluation datasets.
引用
收藏
页码:2958 / 2962
页数:5
相关论文
共 50 条
[31]   Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition [J].
Gong, Boqing ;
Grauman, Kristen ;
Sha, Fei .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 109 (1-2) :3-27
[32]   Unsupervised Domain Adaptation with Generative Adversarial Networks for Facial Emotion Recognition [J].
Fan, Yingruo ;
Lam, Jacqueline C. K. ;
Li, Victor O. K. .
2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, :4460-4464
[33]   Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition [J].
Boqing Gong ;
Kristen Grauman ;
Fei Sha .
International Journal of Computer Vision, 2014, 109 :3-27
[34]   Multi-source Unsupervised Domain Adaptation for Medical Image Recognition [J].
Liu, Yujie ;
Zhang, Qicheng .
ADVANCED INTELLIGENT COMPUTING IN BIOINFORMATICS, PT I, ICIC 2024, 2024, 14881 :428-440
[35]   Unsupervised Domain Adaptation in Activity Recognition: A GAN-Based Approach [J].
Sanabria, Andrea Rosales ;
Zambonelli, Franco ;
Ye, Juan .
IEEE ACCESS, 2021, 9 :19421-19438
[36]   Noisy-Aware Unsupervised Domain Adaptation for Scene Text Recognition [J].
Liu, Xiao-Qian ;
Zhang, Peng-Fei ;
Luo, Xin ;
Huang, Zi ;
Xu, Xin-Shun .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 :6550-6563
[37]   Autoencoder based Domain Adaptation for Speaker Recognition under Insufficient Channel Information [J].
Shon, Suwon ;
Mun, Seongkyu ;
Kim, Wooil ;
Ko, Hanseok .
18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, :1014-1018
[38]   Improving Robustness to Compressed Speech in Speaker Recognition [J].
McLaren, Mitchell ;
Abrash, Victor ;
Graciarena, Martin ;
Lei, Yun ;
Pesan, Jan .
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, :3665-3669
[39]   Cluster adaptation networks for unsupervised domain adaptation [J].
Zhou, Qiang ;
Zhou, Wen'an ;
Wang, Shirui .
IMAGE AND VISION COMPUTING, 2021, 108
[40]   Semantic adaptation network for unsupervised domain adaptation [J].
Zhou, Qiang ;
Zhou, Wen'an ;
Wang, Shirui .
NEUROCOMPUTING, 2021, 454 :313-323