Discriminative likelihood score weighting based on acoustic-phonetic classification for speaker identification

被引:2
作者
Suh, Youngjoo [1 ]
Kim, Hoirin [1 ]
机构
[1] Korea Adv Inst Sci & Technol, Dept Elect Engn, Taejon 305701, South Korea
关键词
Discriminative training; Acoustic-phonetic classification; Score weighting; Speaker identification;
D O I
10.1186/1687-6180-2014-126
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, a new discriminative likelihood score weighting technique is proposed for speaker identification. The proposed method employs a discriminative weighting of frame-level log-likelihood scores with acoustic-phonetic classification in the Gaussian mixture model (GMM)-based speaker identification. Experiments performed on the Aurora noise-corrupted TIMIT database showed that the proposed approach provides meaningful performance improvement with an overall relative error reduction of 15.8% over the maximum likelihood-based baseline GMM approach.
引用
收藏
页码:1 / 7
页数:7
相关论文
共 19 条
[1]   Support vector machines using GMM supervectors for speaker verification [J].
Campbell, WM ;
Sturim, DE ;
Reynolds, DA .
IEEE SIGNAL PROCESSING LETTERS, 2006, 13 (05) :308-311
[2]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[3]   Front-End Factor Analysis for Speaker Verification [J].
Dehak, Najim ;
Kenny, Patrick J. ;
Dehak, Reda ;
Dumouchel, Pierre ;
Ouellet, Pierre .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2011, 19 (04) :788-798
[4]  
Fine S, 2001, INT CONF ACOUST SPEE, P417, DOI 10.1109/ICASSP.2001.940856
[5]  
Fisher W., 1986, PROC DARPA WORKSHOP, P93
[6]  
Hirsch Hans gunter, 2000, 6 INT C SPOK LANG PR, V2000, P16
[7]   Minimum classification error rate methods for speech recognition [J].
Juang, BH ;
Chou, W ;
Lee, CH .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1997, 5 (03) :257-265
[8]   DISCRIMINATIVE LEARNING FOR MINIMUM ERROR CLASSIFICATION [J].
JUANG, BH ;
KATAGIRI, S .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (12) :3043-3054
[9]   Robust speaker recognition based on filtering in autocorrelation domain and sub-band feature recombination [J].
Kim, Sungtak ;
Ji, Miyoung ;
Kim, Hoirin .
PATTERN RECOGNITION LETTERS, 2010, 31 (07) :593-599
[10]   Real-time speaker identification and verification [J].
Kinnunen, T ;
Karpov, E ;
Fränti, P .
IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (01) :277-288