SPEAKER VERIFICATION USING KERNEL-BASED BINARY CLASSIFIERS WITH BINARY OPERATION DERIVED FEATURES

被引：0

作者：

Lee, Hung-Shin ^{[1
,2
]}

Tso, Yu ^{[3
]}

Chang, Yun-Fan ^{[3
]}

Wang, Hsin-Min ^{[2
]}

Jeng, Shyh-Kang ^{[1
]}

机构：

[1] Natl Taiwan Univ, Dept Elect Engn, Taipei, Taiwan

[2] Inst Sci Informat, Taipei, Taiwan

[3] Res Ctr Informat Technol Innovat, Taipei, Taiwan

来源：

2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) | 2014年

关键词：

speaker verification; SVM; DNN; i-vector;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In this paper, we study the use of two kinds of kernel-based discriminative models, namely support vector machine (SVM) and deep neural network (DNN), for speaker verification. We treat the verification task as a binary classification problem, in which a pair of two utterances, each represented by an i-vector, is assumed to belong to either the "within-speaker" group or the "between-speaker" group. To solve the problem, we employ various binary operations to retain the basic relationship between any pair of i-vectors to form a single vector for training the discriminative models. This study also investigates the correlation of achievable performances with the number of training pairs and the various combinations of basic binary operations, using the SVM and DNN binary classifiers. The experiments are conducted on the male portion of the core task in the NIST 2005 Speaker Recognition Evaluation (SRE), and the results are competitive or even better, in terms of normalized decision cost function (minDCF) and equal error rate (EER), while compared to other non-probabilistic based models, such as the conventional speaker SVMs and the LDA-based cosine distance scoring.

引用

页数：5

共 50 条

[31] Secure Binary Embeddings of Front-end Factor Analysis for Privacy Preserving Speaker Verification
Portelo, Jose
Abad, Alberto
Raj, Bhiksha
Trancoso, Isabel
14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2493 - 2497
[32] Using Kernel Discriminant Analysis to Improve the Characterization of the Alternative Hypothesis for Speaker Verification
Chao, Yi-Hsiang
Tsai, Wei-Ho
Wang, Hsin-Min
Chang, Ruei-Chuan
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (08): : 1675 - 1684
[33] Speaker verification using short utterances with DNN-based estimation of subglottal acoustic features
Guo, Jinxi
Yeung, Gary
Muralidharan, Deepak
Arsikere, Harish
Afshan, Amber
Alwan, Abeer
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 2219 - 2222
[34] Speaker verification robust to talking style variation using multiple kernel learning based on conditional entropy minimization
Ogawa, Tetsuji
Hino, Hideitsu
Murata, Noboru
Kobayashi, Tetsunori
12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 2752 - +
[35] Acoustic features selection of speaker verification based on average KL distance
Luan Yu
Li Hongzuo
Wang Yafei
MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 629 - +
[36] FORMANT-GAPS FEATURES FOR SPEAKER VERIFICATION USING WHISPERED SPEECH
Naini, Abinay Reddy
Rao, Achuth M., V
Ghosh, Prasanta Kumar
2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6231 - 6235
[37] DEEP BOTTLENECK FEATURES FOR I-VECTOR BASED TEXT-INDEPENDENT SPEAKER VERIFICATION
Ghalehjegh, Sina Hamidi
Rose, Richard C.
2015 IEEE WORKSHOP ON AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING (ASRU), 2015, : 555 - 560
[38] Neural Network based Speaker Classification and Verification Systems with Enhanced Features
Ge, Zhenhao
Iyer, Ananth N.
Cheluvaraja, Srinath
Sundaram, Ram
Ganapathiraju, Aravind
PROCEEDINGS OF THE 2017 INTELLIGENT SYSTEMS CONFERENCE (INTELLISYS), 2017, : 1089 - 1094
[39] Text-independent speaker verification using ant colony optimization-based selected features
Nemati, Shahla
Basiri, Mohammad Ehsan
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (01) : 620 - 630
[40] Multitaper MFCC and PLP features for speaker verification using i-vectors
Alam, Md Jahangir
Kinnunen, Tomi
Kenny, Patrick
Ouellet, Pierre
O'Shaughnessy, Douglas
SPEECH COMMUNICATION, 2013, 55 (02) : 237 - 251

← 1 2 3 4 5 →