Discriminant Analysis Methods Comparison in I-Vector Space for Speaker Verification

被引:0
|
作者
Mohammadi, Mohsen [1 ]
Mohammadi, Hamid Reza Sadegh [1 ]
机构
[1] ACECR, Iranian Res Inst Elect Engn, Dept Commun Engn, Tehran, Iran
来源
2018 9TH INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (IST) | 2018年
关键词
Gaussian Mixture Model; Noise Contaminated Speech; Speech Feature Vectors; Speaker Verification; PLDA;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Identity vectors are the state-of-the-art feature vectors for speaker recognition applications. One of the most important advantages of i-vector is its allowance for implementation of channel and noise compensatory methods such as linear discriminant analysis (LDA). The motivation for this is to look for new orthogonal axes to achieve superior discrimination between different classes. The axes should comply with the inter-class variance maximization and intra-class variance minimization requirements. The conventional method for the LDA transform computation considers Gaussian distribution assumption and uses parametric representations for both intra-and inter-speaker scatter matrices. Of course, the actual distribution of i-vectors may not necessarily be Gaussian. In this paper, we investigate the performance of LDA, and three nonparametric techniques, i.e., NDA, GDA, and SVDA separately and in combination with LDA. Experiments were conducted on TIMIT and NIST SRE 2008 datasets with MFCC and PNCC feature vectors. The results show that using the combination of parametric and nonparametric methods can lead to better results.
引用
收藏
页码:166 / 172
页数:7
相关论文
共 50 条
  • [21] An improved i-vector extraction algorithm for speaker verification
    Li, Wei
    Fu, Tianfan
    Zhu, Jie
    EURASIP JOURNAL ON AUDIO SPEECH AND MUSIC PROCESSING, 2015, : 1 - 9
  • [22] Noise Compensation in i-vector Space Using Linear Regression for Robust Speaker Verification
    Baby, Renjith
    Kumar, C. Santhosh
    George, Kuruvachan K.
    Panda, Ashish
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2017, : 161 - 165
  • [23] An Adaptive i-Vector Extraction for Speaker Verification with Short Utterance
    Poddar, Arnab
    Sahidullah, Md
    Saha, Goutam
    PATTERN RECOGNITION AND MACHINE INTELLIGENCE, PREMI 2017, 2017, 10597 : 326 - 332
  • [24] WEIGHTED LDA TECHNIQUES FOR I-VECTOR BASED SPEAKER VERIFICATION
    Kanagasundaram, A.
    Dean, D.
    Vogt, R.
    McLaren, M.
    Sridharan, S.
    Mason, M.
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 4781 - 4784
  • [25] PERFORMANCE OF I-VECTOR SPEAKER VERIFICATION AND THE DETECTION OF SYNTHETIC SPEECH
    McClanahan, Richard D.
    Stewart, Bryan
    De Leon, Phillip L.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [26] SPEAKER VERIFICATION USING SIMPLIFIED AND SUPERVISED I-VECTOR MODELING
    Li, Ming
    Tsiartas, Andreas
    Van Segbroeck, Maarten
    Narayanan, Shrikanth S.
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 7199 - 7203
  • [27] Minimax i-vector extractor for short duration speaker verification
    Hautamaki, Ville
    Cheng, You-Chi
    Rajan, Padmanabhan
    Lee, Chin-Hui
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3675 - 3679
  • [28] Bayesian Distance Metric Learning on i-vector for Speaker Verification
    Fang, Xiao
    Dehak, Najim
    Glass, James
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2513 - 2517
  • [29] Non-linear PLDA for i-Vector Speaker Verification
    Novoselov, Sergey
    Pekhovsky, Timur
    Kudashev, Oleg
    Mendelev, Valentin
    Prudnikov, Alexey
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 214 - 218
  • [30] Whisper to neutral mapping using cosine similarity maximization in i-vector space for speaker verification
    Naini, Abinay Reddy
    Rao, Achuth M., V
    Ghosh, Prasanta Kumar
    INTERSPEECH 2019, 2019, : 4340 - 4344