On robustness of unsupervised domain adaptation for speaker recognition

被引:17
作者
Bousquet, Pierre-Michel [1 ]
Rouvier, Mickael [1 ]
机构
[1] Univ Avignon LIA, Avignon, France
来源
INTERSPEECH 2019 | 2019年
关键词
Speaker recognition; speaker embeddings; x-vectors; unsupervised; domain adaptation;
D O I
10.21437/Interspeech.2019-1524
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Current speaker recognition systems, that are learned by using wide training datasets and include sophisticated modelings, turn out to be very specific, providing sometimes disappointing results in real-life applications. Any shift between training and test data, in terms of device, language, duration, noise or other tends to degrade accuracy of speaker detection. This study investigates unsupervised domain adaptation,when only a scarce and unlabeled "in-domain" development dataset is available. Details and relevance of different approaches are described and commented, leading to a new robust method that we call feature-Distribution Adaptor. Efficiency of the proposed technique is experimentally validated on the recent NIST 2016 and 2018 Speaker Recognition Evaluation datasets.
引用
收藏
页码:2958 / 2962
页数:5
相关论文
共 50 条
[21]   An unsupervised deep domain adaptation approach for robust speech recognition [J].
Sun, Sining ;
Zhang, Binbin ;
Xie, Lei ;
Zhang, Yanning .
NEUROCOMPUTING, 2017, 257 :79-87
[22]   Adversarial unsupervised domain adaptation for cross scenario waveform recognition [J].
Wang, Qing ;
Du, Panfei ;
Liu, Xiaofeng ;
Yang, Jingyu ;
Wang, Guohua .
SIGNAL PROCESSING, 2020, 171 (171)
[23]   Kurcuma: a kitchen utensil recognition collection for unsupervised domain adaptation [J].
Adrian Rosello ;
Jose J. Valero-Mas ;
Antonio Javier Gallego ;
Javier Sáez-Pérez ;
Jorge Calvo-Zaragoza .
Pattern Analysis and Applications, 2023, 26 :1557-1569
[24]   THE CORAL plus ALGORITHM FOR UNSUPERVISED DOMAIN ADAPTATION OF PLDA [J].
Lee, Kong Aik ;
Wang, Qiongqiong ;
Koshinaka, Takafumi .
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, :5821-5825
[25]   CYCLE-GANS FOR DOMAIN ADAPTATION OF ACOUSTIC FEATURES FOR SPEAKER RECOGNITION [J].
Nidadavolu, Phani Sankar ;
Villalba, Jesus ;
Dehak, Najim .
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, :6206-6210
[26]   SUPERVISED DOMAIN ADAPTATION FOR I-VECTOR BASED SPEAKER RECOGNITION [J].
Garcia-Romero, Daniel ;
McCree, Alan .
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[27]   UNSUPERVISED IDIOLECT DISCOVERY FOR SPEAKER RECOGNITION [J].
Jansen, Aren ;
Garcia-Romero, Daniel ;
Clark, Pascal ;
Hernandez-Cordero, Jaime .
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
[28]   An investigation of domain adaptation in speaker embedding space for [J].
Bahmaninezhad, Fahimeh ;
Zhang, Chunlei ;
Hansen, John H. L. .
SPEECH COMMUNICATION, 2021, 129 :7-16
[29]   Autoencoder-based Unsupervised Domain Adaptation for Speech Emotion Recognition [J].
Deng, Jun ;
Zhang, Zixing ;
Eyben, Florian ;
Schuller, Bjoern .
IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (09) :1068-1072
[30]   Learning Kernels for Unsupervised Domain Adaptation with Applications to Visual Object Recognition [J].
Gong, Boqing ;
Grauman, Kristen ;
Sha, Fei .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2014, 109 (1-2) :3-27