Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance

被引:0
|
作者
Shulipa, Andrey [1 ]
Novoselov, Sergey [1 ,2 ]
Melnikov, Aleksandr [2 ]
机构
[1] ITMO Univ, St Petersburg, Russia
[2] Speech Technol Ctr, St Petersburg, Russia
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speaker recognition; Domain adaptation; Mismatch conditions;
D O I
10.1007/978-3-319-43958-7_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In last years satisfactory performance of speaker recognition (SR) systems have been achieved in evaluations provided by NIST. It was possible due to using large datasets to train system parameters and accurate speaker variability modeling. In such a cases test and train conditions are similar and it ensures good performance for the evaluations. However in practical applications when training and testing conditions are different the problem of mismatching of the optimal SR system parameters occurs. It is the main problem in the deployment of the real application systems. It leads to reducing SR systems effectiveness. This paper investigates discriminative and generative approaches for the adaptation of the parameters of the speaker recognition systems and proposes effective solutions to improve their performance.
引用
收藏
页码:124 / 130
页数:7
相关论文
共 50 条
  • [41] Improving face recognition with domain adaptation
    Wen, Ge
    Chen, Huaguan
    Cai, Deng
    He, Xiaofei
    NEUROCOMPUTING, 2018, 287 : 45 - 51
  • [42] Extended Variability Modeling and Unsupervised Adaptation for PLDA Speaker Recognition
    McCree, Alan
    Sell, Gregory
    Garcia-Romero, Daniel
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1552 - 1556
  • [43] DOMAIN ADAPTATION VIA WITHIN-CLASS COVARIANCE CORRECTION IN I-VECTOR BASED SPEAKER RECOGNITION SYSTEMS
    Glembek, Ondrej
    Ma, Jeff
    Matejka, Pavel
    Zhang, Bing
    Plchot, Oldrich
    Burget, Lukas
    Matsoukas, Spyros
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [44] Towards improving the performance of speaker recognition systems
    Johnson, Neethu
    George, Kuruvachan K.
    Kumar, Santhosh C.
    Raj, Reghu P. C.
    2014 FIRST INTERNATIONAL CONFERENCE ON COMPUTATIONAL SYSTEMS AND COMMUNICATIONS (ICCSC), 2014, : 38 - 41
  • [45] Boosting Speaker Recognition Performance with Compact Representations
    Yaman, Sibel
    Pelecanos, Jason
    Omar, Mohamed K.
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 388 - 391
  • [46] Discriminative in-set/out-of-set speaker recognition
    Angkititrakul, Pongtep
    Hansen, John H. L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2007, 15 (02): : 498 - 508
  • [47] CentriForce: Multiple-Domain Adaptation for Domain-Invariant Speaker Representation Learning
    Wei, Yuheng
    Du, Junzhao
    Liu, Hui
    Zhang, Zhipeng
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 807 - 811
  • [48] Speaker recognition - general and data fusion classifier approaches methods
    Ramachandran, RP
    Farrell, KR
    Ramachandran, R
    Mammone, RJ
    PATTERN RECOGNITION, 2002, 35 (12) : 2801 - 2821
  • [49] Learning domain-heterogeneous speaker recognition systems with personalized continual federated learning
    Zhiyong Chen
    Shugong Xu
    EURASIP Journal on Audio, Speech, and Music Processing, 2023
  • [50] Comparison of Generative and Discriminative Approaches for Speaker Recognition with Limited Data
    Silovsky, Jan
    Cerva, Petr
    Zdansky, Jindrich
    RADIOENGINEERING, 2009, 18 (03) : 307 - 316