SPEAKER VERIFICATION USING KERNEL-BASED BINARY CLASSIFIERS WITH BINARY OPERATION DERIVED FEATURES

被引：0

作者：

Lee, Hung-Shin ^{[1
,2
]}

Tso, Yu ^{[3
]}

Chang, Yun-Fan ^{[3
]}

Wang, Hsin-Min ^{[2
]}

Jeng, Shyh-Kang ^{[1
]}

机构：

[1] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan

[2] Inst Sci Informat, Taipei, Taiwan

[3] Res Ctr Informat Technol Innovat, Taipei, Taiwan

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

speaker verification; SVM; DNN; i-vector;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we study the use of two kinds of kernel-based discriminative models, namely support vector machine (SVM) and deep neural network (DNN), for speaker verification. We treat the verification task as a binary classification problem, in which a pair of two utterances, each represented by an i-vector, is assumed to belong to either the "within-speaker" group or the "between-speaker" group. To solve the problem, we employ various binary operations to retain the basic relationship between any pair of i-vectors to form a single vector for training the discriminative models. This study also investigates the correlation of achievable performances with the number of training pairs and the various combinations of basic binary operations, using the SVM and DNN binary classifiers. The experiments are conducted on the male portion of the core task in the NIST 2005 Speaker Recognition Evaluation (SRE), and the results are competitive or even better, in terms of normalized decision cost function (minDCF) and equal error rate (EER), while compared to other non-probabilistic based models, such as the conventional speaker SVMs and the LDA-based cosine distance scoring.

引用

页数：5

共 50 条

[21] A GMM-based Probabilistic Sequence Kernel for Speaker Verification
Lee, Kong-Aik
You, Changhuai
Li, Haizhou
Kinnunen, Tomi
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 1553 - 1556
[22] A novel data description kernel based on one-class SVM for speaker verification
Shen, Yufeng
Yang, Yingchun
2007 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PTS 1-3, 2007, : 489 - +
[23] Articulatory-feature based sequence kernel for high-level speaker verification
Zhang, Shi-Xiong
Mak, Man-Wei
PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2008, : 2799 - 2804
[24] On-Line Linear Combination of Classifiers Based on Incremental Information in Speaker Verification
Huenupan, Fernando
Becerra Yoma, Nestor
Garreton, Claudio
Molina, Carlos
ETRI JOURNAL, 2010, 32 (03) : 395 - 405
[25] Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers
Abdalmalak, Kerlos Atia
Gallardo-Antolin, Ascension
NEURAL COMPUTING & APPLICATIONS, 2018, 29 (03) : 637 - 651
[26] Using combined features to improve speaker verification in the face of limited reverberant data
Al-Karawi K.A.
Mohammed D.Y.
International Journal of Speech Technology, 2023, 26 (03) : 789 - 799
[27] Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers
Kerlos Atia Abdalmalak
Ascensión Gallardo-Antolín
Neural Computing and Applications, 2018, 29 : 637 - 651
[28] SVM with Gaussian Kernel-based Image Spam Detection on Textual Features
Kumar, Prashant
Biswas, Mantosh
2017 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE & COMMUNICATION TECHNOLOGY (CICT), 2017,
[29] Speaker verification using boosted cepstral features with Gaussian distributions
Salman, Ahmad
Muhammad, Ejaz
Khurshid, Khawar
INMIC 2007: PROCEEDINGS OF THE 11TH IEEE INTERNATIONAL MULTITOPIC CONFERENCE, 2007, : 14 - 18
[30] Audiovisual Speaker Identity Verification Based on Lip Motion Features
Chetty, Girija
Wagner, Michael
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 2604 - 2607

← 1 2 3 4 5 →