Sparsity Analysis and Compensation for i-Vector Based Speaker Verification

被引:1
|
作者
Li, Wei [1 ]
Fu, Tian Fan [2 ]
Zhu, Jie [1 ]
Chen, Ning [3 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Comp Sci & Engn CSE, Shanghai 200240, Peoples R China
[3] East China Univ S&T, Sch Informat Sci & Engn, Shanghai 200237, Peoples R China
来源
关键词
Speaker verification; i-vector; Phonetic sparsity; Adapted first order Baum-Welch statistics analysis (AFSA);
D O I
10.1007/978-3-319-23132-7_47
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Over recent years, i-vector based framework has been proven to provide state-of-art performance in speaker verification. Most of the researches focus on compensating the channel variability of i-vector. In this paper we will give an analysis that in the case that the duration of enrollment or test utterance is limited, i-vector based system may suffer from biased estimation problem. In order to solve this problem, we propose an improved i-vector extraction algorithm which we term Adapted First order Baum-Welch Statistics Analysis (AFSA). This new algorithm suppresses and compensates the deviation of first order Baum-Welch statistics caused by phonetic sparsity and phonetic imbalance. Experiments were performed based on NIST 2008 SRE data sets, Experimental results show that 10%-15% relative improvement is achieved compared to the baseline of traditional i-vector based system.
引用
收藏
页码:381 / 388
页数:8
相关论文
共 50 条
  • [21] Improved i-vector Speaker Verification Based on WCCN and ZT-norm
    Xing, Yujuan
    Tan, Ping
    Zhang, Chengwen
    BIOMETRIC RECOGNITION, 2016, 9967 : 424 - 431
  • [22] Nonparametrically Trained Probabilistic Linear Discriminant Analysis for i-Vector Speaker Verification
    Khosravani, Abbas
    Homayounpour, Mohammad Mehdi
    18TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2017), VOLS 1-6: SITUATED INTERACTION, 2017, : 1019 - 1023
  • [23] Weighted I-Vector Based Text-Independent Speaker Verification System
    Mohammadi, Mohsen
    Mohammadi, Hamid Reza Sadegh
    2019 27TH IRANIAN CONFERENCE ON ELECTRICAL ENGINEERING (ICEE 2019), 2019, : 1647 - 1653
  • [24] Using Fishervoice to enhance the performance of I-vector based Speaker Verification System
    Li, Na
    Zeng, Xiangyang
    Li, Zhifeng
    Qiao, Yu
    Jiang, Weiwu
    2014 4TH IEEE INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2014, : 578 - 581
  • [25] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
  • [26] Cosine Metric Learning for Speaker Verification in the i-Vector Space
    Bai, Zhong
    Zhang, Xiao-Lei
    Chen, Jingdong
    19TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2018), VOLS 1-6: SPEECH RESEARCH FOR EMERGING MARKETS IN MULTILINGUAL SOCIETIES, 2018, : 1126 - 1130
  • [27] ADDITIVE NOISE COMPENSATION IN THE I-VECTOR SPACE FOR SPEAKER RECOGNITION
    Ben Kheder, Waad
    Matrouf, Driss
    Bonastre, Jean-Francois
    Ajili, Moez
    Bousquet, Pierre-Michel
    2015 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING (ICASSP), 2015, : 4190 - 4194
  • [28] Evaluation of I-vector and GMM Based Speaker Verification Systems for Forensic Application
    Gumus, Fatma
    Yankayis, Mustafa
    Karabiber, Fethullah
    2016 24TH SIGNAL PROCESSING AND COMMUNICATION APPLICATION CONFERENCE (SIU), 2016, : 617 - 620
  • [29] I-vector based speaker recognition using advanced channel compensation techniques
    Kanagasundaram, Ahilan
    Dean, David
    Sridharan, Sridha
    McLaren, Mitchell
    Vogt, Robbie
    COMPUTER SPEECH AND LANGUAGE, 2014, 28 (01): : 121 - 140
  • [30] Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning
    Rao, Wei
    Mak, Man-Wai
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2013, 21 (05): : 1012 - 1022