Approaches for Out-of-Domain Adaptation to Improve Speaker Recognition Performance

被引:0
|
作者
Shulipa, Andrey [1 ]
Novoselov, Sergey [1 ,2 ]
Melnikov, Aleksandr [2 ]
机构
[1] ITMO Univ, St Petersburg, Russia
[2] Speech Technol Ctr, St Petersburg, Russia
来源
SPEECH AND COMPUTER | 2016年 / 9811卷
关键词
Speaker recognition; Domain adaptation; Mismatch conditions;
D O I
10.1007/978-3-319-43958-7_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In last years satisfactory performance of speaker recognition (SR) systems have been achieved in evaluations provided by NIST. It was possible due to using large datasets to train system parameters and accurate speaker variability modeling. In such a cases test and train conditions are similar and it ensures good performance for the evaluations. However in practical applications when training and testing conditions are different the problem of mismatching of the optimal SR system parameters occurs. It is the main problem in the deployment of the real application systems. It leads to reducing SR systems effectiveness. This paper investigates discriminative and generative approaches for the adaptation of the parameters of the speaker recognition systems and proposes effective solutions to improve their performance.
引用
收藏
页码:124 / 130
页数:7
相关论文
共 50 条
  • [21] Multi-Source Domain Adaptation for Text-Independent Forensic Speaker Recognition
    Wang, Zhenyu
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 30 : 60 - 75
  • [22] Learning from noisy out-of-domain corpus using dataless classification
    Jin, Yiping
    Wanvarie, Dittaya
    Le, Phu T., V
    NATURAL LANGUAGE ENGINEERING, 2022, 28 (01) : 39 - 69
  • [23] Domain Adaptation for Text Dependent Speaker Verification
    Aronowitz, Hagai
    Rendel, Asaf
    15TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2014), VOLS 1-4, 2014, : 1337 - 1341
  • [24] A Comparative Study of Different Approaches for the Speaker Recognition
    Returi, Kanaka Durga
    Mohan, Vaka Murali
    Lagisetty, Praveen Kumar
    INFORMATION SYSTEMS DESIGN AND INTELLIGENT APPLICATIONS, VOL 1, INDIA 2016, 2016, 433 : 599 - 608
  • [25] Comparison of Speaker Recognition Approaches for Real Applications
    Cumani, Sandro
    Batzu, Pier Domenico
    Colibro, Daniele
    Vair, Claudio
    Laface, Pietro
    Vasilakakis, Vasileios
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2376 - +
  • [26] Total variability subspace adaptation based speaker recognition
    Li, Zhi-Yi, 1836, Science Press (40): : 1836 - 1840
  • [27] Maximum-Likelihood Linear Transformation for Unsupervised Domain Adaptation in Speaker Verification
    Misra, Abhinav
    Hansen, John H. L.
    IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2018, 26 (09) : 1549 - 1558
  • [28] Performance Comparison of Speaker and Emotion Recognition
    Revathy, A.
    Shanmugapriya, P.
    Mohan, V.
    2015 3RD INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING, COMMUNICATION AND NETWORKING (ICSCN), 2015,
  • [29] Standoff Speaker Recognition: Effects of Recording Distance Mismatch on Speaker Recognition System Performance
    Fowler, Mike
    McCurry, Mark
    Bramsen, Jonathan
    Dunsin, Kehinde
    Remus, Jeremiah
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3680 - 3683
  • [30] A COMPARISON OF APPROACHES FOR MODELING PROSODIC FEATURES IN SPEAKER RECOGNITION
    Ferrer, Luciana
    Scheffer, Nicolas
    Shriberg, Elizabeth
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 4414 - 4417