Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance

被引:0
|
作者
Shulipa, Andrey [1 ]
Novoselov, Sergey [1 ,2 ]
Melnikov, Aleksandr [2 ]
机构
[1] ITMO Univ, St Petersburg, Russia
[2] Speech Technol Ctr, St Petersburg, Russia
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speaker recognition; Domain adaptation; Mismatch conditions;
D O I
10.1007/978-3-319-43958-7_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In last years satisfactory performance of speaker recognition (SR) systems have been achieved in evaluations provided by NIST. It was possible due to using large datasets to train system parameters and accurate speaker variability modeling. In such a cases test and train conditions are similar and it ensures good performance for the evaluations. However in practical applications when training and testing conditions are different the problem of mismatching of the optimal SR system parameters occurs. It is the main problem in the deployment of the real application systems. It leads to reducing SR systems effectiveness. This paper investigates discriminative and generative approaches for the adaptation of the parameters of the speaker recognition systems and proposes effective solutions to improve their performance.
引用
收藏
页码:124 / 130
页数:7
相关论文
共 50 条
  • [31] Deep Neural Network Approaches to Speaker and Language Recognition
    Richardson, Fred
    Reynolds, Douglas
    Dehak, Najim
    IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (10) : 1671 - 1675
  • [32] ADVANCES IN DEEP NEURAL NETWORK APPROACHES TO SPEAKER RECOGNITION
    McLaren, Mitchell
    Lei, Yun
    Ferrer, Luciana
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4814 - 4818
  • [33] Towards Fully Bayesian Speaker Recognition: Integrating Out the Between-Speaker Covariance
    Villalba, Jesus
    Bruemmer, Niko
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 512 - +
  • [34] In-set/out-of-set speaker recognition: Leveraging the speaker and noise balance
    Leonard, Matthew R.
    Hansen, John H. L.
    2008 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, VOLS 1-12, 2008, : 1585 - 1588
  • [35] Comparison of Speaker Adaptation Methods as Feature Extraction for SVM-Based Speaker Recognition
    Ferras, Marc
    Leung, Cheung-Chi
    Barras, Claude
    Gauvain, Jean-Luc
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2010, 18 (06): : 1366 - 1378
  • [36] Cross-lingual Speaker Adaptation using Domain Adaptation and Speaker Consistency Loss for Text-To-Speech Synthesis
    Xin, Detai
    Saito, Yuki
    Takamichi, Shinnosuke
    Koriyama, Tomoki
    Saruwatari, Hiroshi
    INTERSPEECH 2021, 2021, : 1614 - 1618
  • [37] Adversarial Training for Multi-domain Speaker Recognition
    Wang, Qing
    Rao, Wei
    Guo, Pengcheng
    Xie, Lei
    2021 12TH INTERNATIONAL SYMPOSIUM ON CHINESE SPOKEN LANGUAGE PROCESSING (ISCSLP), 2021,
  • [38] Score domain speaking rate normalization for speaker recognition
    Aisikaer R.
    Wang D.
    Li L.
    Zheng F.
    Zhang X.
    Jin P.
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2018, 58 (04): : 337 - 341
  • [39] EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification
    Li, Jingyu
    Liu, Wei
    Lee, Tan
    INTERSPEECH 2022, 2022, : 3694 - 3698
  • [40] Unsupervised Discriminative Training of PLDA for Domain Adaptation in Speaker Verification
    Wang, Qiongqiong
    Koshinaka, Takafumi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 3727 - 3731