SPEAKER VERIFICATION USING KERNEL-BASED BINARY CLASSIFIERS WITH BINARY OPERATION DERIVED FEATURES

被引：0

作者：

Lee, Hung-Shin ^{[1
,2
]}

Tso, Yu ^{[3
]}

Chang, Yun-Fan ^{[3
]}

Wang, Hsin-Min ^{[2
]}

Jeng, Shyh-Kang ^{[1
]}

机构：

[1] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan

[2] Inst Sci Informat, Taipei, Taiwan

[3] Res Ctr Informat Technol Innovat, Taipei, Taiwan

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

speaker verification; SVM; DNN; i-vector;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we study the use of two kinds of kernel-based discriminative models, namely support vector machine (SVM) and deep neural network (DNN), for speaker verification. We treat the verification task as a binary classification problem, in which a pair of two utterances, each represented by an i-vector, is assumed to belong to either the "within-speaker" group or the "between-speaker" group. To solve the problem, we employ various binary operations to retain the basic relationship between any pair of i-vectors to form a single vector for training the discriminative models. This study also investigates the correlation of achievable performances with the number of training pairs and the various combinations of basic binary operations, using the SVM and DNN binary classifiers. The experiments are conducted on the male portion of the core task in the NIST 2005 Speaker Recognition Evaluation (SRE), and the results are competitive or even better, in terms of normalized decision cost function (minDCF) and equal error rate (EER), while compared to other non-probabilistic based models, such as the conventional speaker SVMs and the LDA-based cosine distance scoring.

引用

页数：5

共 50 条

[41] Efficient Integrated Features Based on Pre-trained Models for Speaker Verification
Li, Yishuang
Guan, Wenhao
Huang, Hukai
Miao, Shiyu
Su, Qi
Li, Lin
Hong, Qingyang
INTERSPEECH 2024, 2024, : 2140 - 2144
[42] Multitaper MFCC and normalized multitaper phase-based features for speaker verification
Arash Mansouri
Eduardo Castillo-Guerra
SN Applied Sciences, 2019, 1
[43] Multitaper MFCC and normalized multitaper phase-based features for speaker verification
Mansouri, Arash
Castillo-Guerra, Eduardo
SN APPLIED SCIENCES, 2019, 1 (04):
[44] Cluster Adaptive Training Weights as Features in SVM-Based Speaker Verification
Yang, Hao
Dong, Yuan
Zhao, Xianyu
Zhao, Jian
Lu, Liang
Wang, Haila
INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 573 - +
[45] Ensemble of binary SVM classifiers based on PCA and LDA feature extraction for intrusion detection
Aburomman, Abdulla Amin
Reaz, Mamun Bin Ibne
PROCEEDINGS OF 2016 IEEE ADVANCED INFORMATION MANAGEMENT, COMMUNICATES, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IMCEC 2016), 2016, : 636 - 640
[46] Speaker verification based on speaker background model virtually synthesized using local acoustic information
Isobe, T
Takahashi, J
Nakamura, T
ELECTRONICS AND COMMUNICATIONS IN JAPAN PART II-ELECTRONICS, 2002, 85 (04): : 47 - 57
[47] Speaker verification using various dynamic kernels for prosodic features combined with spectral information
Drgas, Szymon
Dabrowski, Adam
Zamorski, Dariusz
PRZEGLAD ELEKTROTECHNICZNY, 2012, 88 (06): : 51 - 54
[48] DNN BASED SPEAKER EMBEDDING USING CONTENT INFORMATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION
Dey, Subhadeep
Koshinaka, Takafumi
Motlicek, Petr
Madikeri, Srikanth
2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 5344 - 5348
[49] Recognition of Blood and Bone Marrow Cells using Kernel-based Image Retrieval
Pan, Chen
Yan, Xiangguo
Zheng, Chongxun
INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2006, 6 (10): : 29 - 35
[50] Combining Amplitude and Phase-based Features for Speaker Verification with Short Duration Utterances
Alam, Md Jahangir
Kenny, Patrick
Stafylakis, Themos
16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 249 - 253

← 1 2 3 4 5 →