SPEAKER VERIFICATION USING KERNEL-BASED BINARY CLASSIFIERS WITH BINARY OPERATION DERIVED FEATURES

被引:0
作者
Lee, Hung-Shin [1 ,2 ]
Tso, Yu [3 ]
Chang, Yun-Fan [3 ]
Wang, Hsin-Min [2 ]
Jeng, Shyh-Kang [1 ]
机构
[1] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan
[2] Inst Sci Informat, Taipei, Taiwan
[3] Res Ctr Informat Technol Innovat, Taipei, Taiwan
来源
2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年
关键词
speaker verification; SVM; DNN; i-vector;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In this paper, we study the use of two kinds of kernel-based discriminative models, namely support vector machine (SVM) and deep neural network (DNN), for speaker verification. We treat the verification task as a binary classification problem, in which a pair of two utterances, each represented by an i-vector, is assumed to belong to either the "within-speaker" group or the "between-speaker" group. To solve the problem, we employ various binary operations to retain the basic relationship between any pair of i-vectors to form a single vector for training the discriminative models. This study also investigates the correlation of achievable performances with the number of training pairs and the various combinations of basic binary operations, using the SVM and DNN binary classifiers. The experiments are conducted on the male portion of the core task in the NIST 2005 Speaker Recognition Evaluation (SRE), and the results are competitive or even better, in terms of normalized decision cost function (minDCF) and equal error rate (EER), while compared to other non-probabilistic based models, such as the conventional speaker SVMs and the LDA-based cosine distance scoring.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Secure Binary Embeddings of Front-end Factor Analysis for Privacy Preserving Speaker Verification
    Portelo, Jose
    Abad, Alberto
    Raj, Bhiksha
    Trancoso, Isabel
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2493 - 2497
  • [32] Using Kernel Discriminant Analysis to Improve the Characterization of the Alternative Hypothesis for Speaker Verification
    Chao, Yi-Hsiang
    Tsai, Wei-Ho
    Wang, Hsin-Min
    Chang, Ruei-Chuan
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1675 - 1684
  • [33] Speaker verification using short utterances with DNN-based estimation of subglottal acoustic features
    Guo, Jinxi
    Yeung, Gary
    Muralidharan, Deepak
    Arsikere, Harish
    Afshan, Amber
    Alwan, Abeer
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2219 - 2222
  • [34] Speaker verification robust to talking style variation using multiple kernel learning based on conditional entropy minimization
    Ogawa, Tetsuji
    Hino, Hideitsu
    Murata, Noboru
    Kobayashi, Tetsunori
    12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2752 - +
  • [35] Acoustic features selection of speaker verification based on average KL distance
    Luan Yu
    Li Hongzuo
    Wang Yafei
    MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 629 - +
  • [36] FORMANT-GAPS FEATURES FOR SPEAKER VERIFICATION USING WHISPERED SPEECH
    Naini, Abinay Reddy
    Rao, Achuth M., V
    Ghosh, Prasanta Kumar
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6231 - 6235
  • [37] DEEP BOTTLENECK FEATURES FOR I-VECTOR BASED TEXT-INDEPENDENT SPEAKER VERIFICATION
    Ghalehjegh, Sina Hamidi
    Rose, Richard C.
    2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 555 - 560
  • [38] Neural Network based Speaker Classification and Verification Systems with Enhanced Features
    Ge, Zhenhao
    Iyer, Ananth N.
    Cheluvaraja, Srinath
    Sundaram, Ram
    Ganapathiraju, Aravind
    PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 1089 - 1094
  • [39] Text-independent speaker verification using ant colony optimization-based selected features
    Nemati, Shahla
    Basiri, Mohammad Ehsan
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (01) : 620 - 630
  • [40] Multitaper MFCC and PLP features for speaker verification using i-vectors
    Alam, Md Jahangir
    Kinnunen, Tomi
    Kenny, Patrick
    Ouellet, Pierre
    O'Shaughnessy, Douglas
    SPEECH COMMUNICATION, 2013, 55 (02) : 237 - 251