SPEAKER VERIFICATION USING KERNEL-BASED BINARY CLASSIFIERS WITH BINARY OPERATION DERIVED FEATURES

被引:0
|
作者
Lee, Hung-Shin [1 ,2 ]
Tso, Yu [3 ]
Chang, Yun-Fan [3 ]
Wang, Hsin-Min [2 ]
Jeng, Shyh-Kang [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan
[2] Inst Sci Informat, Taipei, Taiwan
[3] Res Ctr Informat Technol Innovat, Taipei, Taiwan
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
speaker verification; SVM; DNN; i-vector;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we study the use of two kinds of kernel-based discriminative models, namely support vector machine (SVM) and deep neural network (DNN), for speaker verification. We treat the verification task as a binary classification problem, in which a pair of two utterances, each represented by an i-vector, is assumed to belong to either the "within-speaker" group or the "between-speaker" group. To solve the problem, we employ various binary operations to retain the basic relationship between any pair of i-vectors to form a single vector for training the discriminative models. This study also investigates the correlation of achievable performances with the number of training pairs and the various combinations of basic binary operations, using the SVM and DNN binary classifiers. The experiments are conducted on the male portion of the core task in the NIST 2005 Speaker Recognition Evaluation (SRE), and the results are competitive or even better, in terms of normalized decision cost function (minDCF) and equal error rate (EER), while compared to other non-probabilistic based models, such as the conventional speaker SVMs and the LDA-based cosine distance scoring.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] A GMM-based Probabilistic Sequence Kernel for Speaker Verification
    Lee, Kong-Aik
    You, Changhuai
    Li, Haizhou
    Kinnunen, Tomi
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1553 - 1556
  • [22] A novel data description kernel based on one-class SVM for speaker verification
    Shen, Yufeng
    Yang, Yingchun
    2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 489 - +
  • [23] Articulatory-feature based sequence kernel for high-level speaker verification
    Zhang, Shi-Xiong
    Mak, Man-Wei
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2799 - 2804
  • [24] On-Line Linear Combination of Classifiers Based on Incremental Information in Speaker Verification
    Huenupan, Fernando
    Becerra Yoma, Nestor
    Garreton, Claudio
    Molina, Carlos
    ETRI JOURNAL, 2010, 32 (03) : 395 - 405
  • [25] Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers
    Abdalmalak, Kerlos Atia
    Gallardo-Antolin, Ascension
    NEURAL COMPUTING & APPLICATIONS, 2018, 29 (03) : 637 - 651
  • [26] Using combined features to improve speaker verification in the face of limited reverberant data
    Al-Karawi K.A.
    Mohammed D.Y.
    International Journal of Speech Technology, 2023, 26 (03) : 789 - 799
  • [27] Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers
    Kerlos Atia Abdalmalak
    Ascensión Gallardo-Antolín
    Neural Computing and Applications, 2018, 29 : 637 - 651
  • [28] SVM with Gaussian Kernel-based Image Spam Detection on Textual Features
    Kumar, Prashant
    Biswas, Mantosh
    2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [29] Speaker verification using boosted cepstral features with Gaussian distributions
    Salman, Ahmad
    Muhammad, Ejaz
    Khurshid, Khawar
    INMIC 2007: PROCEEDINGS OF THE 11TH IEEE INTERNATIONAL MULTITOPIC CONFERENCE, 2007, : 14 - 18
  • [30] Audiovisual Speaker Identity Verification Based on Lip Motion Features
    Chetty, Girija
    Wagner, Michael
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2604 - 2607