SPEAKER VERIFICATION USING KERNEL-BASED BINARY CLASSIFIERS WITH BINARY OPERATION DERIVED FEATURES

被引:0
作者
Lee, Hung-Shin [1 ,2 ]
Tso, Yu [3 ]
Chang, Yun-Fan [3 ]
Wang, Hsin-Min [2 ]
Jeng, Shyh-Kang [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan
[2] Inst Sci Informat, Taipei, Taiwan
[3] Res Ctr Informat Technol Innovat, Taipei, Taiwan
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
speaker verification; SVM; DNN; i-vector;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we study the use of two kinds of kernel-based discriminative models, namely support vector machine (SVM) and deep neural network (DNN), for speaker verification. We treat the verification task as a binary classification problem, in which a pair of two utterances, each represented by an i-vector, is assumed to belong to either the "within-speaker" group or the "between-speaker" group. To solve the problem, we employ various binary operations to retain the basic relationship between any pair of i-vectors to form a single vector for training the discriminative models. This study also investigates the correlation of achievable performances with the number of training pairs and the various combinations of basic binary operations, using the SVM and DNN binary classifiers. The experiments are conducted on the male portion of the core task in the NIST 2005 Speaker Recognition Evaluation (SRE), and the results are competitive or even better, in terms of normalized decision cost function (minDCF) and equal error rate (EER), while compared to other non-probabilistic based models, such as the conventional speaker SVMs and the LDA-based cosine distance scoring.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Efficient Integrated Features Based on Pre-trained Models for Speaker Verification
    Li, Yishuang
    Guan, Wenhao
    Huang, Hukai
    Miao, Shiyu
    Su, Qi
    Li, Lin
    Hong, Qingyang
    INTERSPEECH 2024, 2024, : 2140 - 2144
  • [42] Multitaper MFCC and normalized multitaper phase-based features for speaker verification
    Arash Mansouri
    Eduardo Castillo-Guerra
    SN Applied Sciences, 2019, 1
  • [43] Multitaper MFCC and normalized multitaper phase-based features for speaker verification
    Mansouri, Arash
    Castillo-Guerra, Eduardo
    SN APPLIED SCIENCES, 2019, 1 (04):
  • [44] Cluster Adaptive Training Weights as Features in SVM-Based Speaker Verification
    Yang, Hao
    Dong, Yuan
    Zhao, Xianyu
    Zhao, Jian
    Lu, Liang
    Wang, Haila
    INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 573 - +
  • [45] Ensemble of binary SVM classifiers based on PCA and LDA feature extraction for intrusion detection
    Aburomman, Abdulla Amin
    Reaz, Mamun Bin Ibne
    PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 636 - 640
  • [46] Speaker verification based on speaker background model virtually synthesized using local acoustic information
    Isobe, T
    Takahashi, J
    Nakamura, T
    ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2002, 85 (04): : 47 - 57
  • [47] Speaker verification using various dynamic kernels for prosodic features combined with spectral information
    Drgas, Szymon
    Dabrowski, Adam
    Zamorski, Dariusz
    PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (06): : 51 - 54
  • [48] DNN BASED SPEAKER EMBEDDING USING CONTENT INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
    Dey, Subhadeep
    Koshinaka, Takafumi
    Motlicek, Petr
    Madikeri, Srikanth
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5344 - 5348
  • [49] Recognition of Blood and Bone Marrow Cells using Kernel-based Image Retrieval
    Pan, Chen
    Yan, Xiangguo
    Zheng, Chongxun
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (10): : 29 - 35
  • [50] Combining Amplitude and Phase-based Features for Speaker Verification with Short Duration Utterances
    Alam, Md Jahangir
    Kenny, Patrick
    Stafylakis, Themos
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 249 - 253