Mel-Frequency Cepstral Coefficients as Features for Automatic Speaker Recognition

被引：0

作者：

Jokic, Ivan D. ^{[1
]}

Jokic, Stevan D. ^{[1
]}

Delic, Vlado D. ^{[1
]}

Peric, Zoran H. ^{[2
]}

机构：

[1] Univ Novi Sad, Fac Tech Sci, Trg Dositeja Obradovica 6, Novi Sad 21000, Serbia

[2] Univ Nis, Fac Elect Engn, Nish 18000, Serbia

来源：

2015 23RD TELECOMMUNICATIONS FORUM TELFOR (TELFOR) | 2015年

关键词：

Automatic speaker recognition; auditory critical bands; covariance matrix; exponential auditory critical bands; mel-frequency cepstral coefficients; multidimensional Gaussian distribution;

D O I：

暂无

中图分类号：

TN [电子技术、通信技术];

学科分类号：

0809 ;

摘要：

Automatic speaker recognizer can be based on the use of mel-frequency cepstral coefficients as speaker features. Mel-frequency cepstral coefficients depend on energy inside considered auditory critical bands. These auditory critical bands model masking phenomena. Application of triangular auditory critical bands results in better recognition accuracy with respect to the case when rectangular auditory critical bands are applied. Recognition accuracy when exponential auditory critical bands are applied outperforms recognition accuracy of automatic speaker recognizer when triangular or rectangular auditory critical bands are applied. Application of transformation on elements of speaker model, which target decreasing of difference between testing and training models of the same speaker, can increase recognition accuracy.

引用

页码：419 / 424

页数：6

共 14 条

[1]

[Anonymous], 2012, INT C COMP GRAPH SIM

[2]

Attabi Y, 2013, INT CONF ACOUST SPEE, P7527, DOI 10.1109/ICASSP.2013.6639126

[3] A tutorial on text-independent speaker verification [J].

Bimbot, F ;

Bonastre, JF ;

Fredouille, C ;

Gravier, G ;

Magrin-Chagnolleau, I ;

Meignier, S ;

Merlin, T ;

Ortega-García, J ;

Petrovska-Delacrétaz, D ;

Reynolds, DA .

EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) :430-451

[4] Speaker recognition: A tutorial [J].

Campbell, JP .

PROCEEDINGS OF THE IEEE, 1997, 85 (09) :1437-1462

[5]

Dhingra S, 2013, Int J Adv, P4085

[6]

Dobrovic M. M., 2012, 2012 IEEE 10th Jubilee International Symposium on Intelligent Systems and Informatics (SISY 2012), P341, DOI 10.1109/SISY.2012.6339541

[7]

Jokic I., 2014, THESIS, P54

[8] Optimizing feature extraction for speech recognition [J].

Lee, CH ;

Hyun, DH ;

Choi, ES ;

Go, JW ;

Lee, CY .

IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2003, 11 (01) :80-87

[9]

Lyon R. F., 2010, 2010 IEEE International Symposium on Circuits and Systems. ISCAS 2010, P3809, DOI 10.1109/ISCAS.2010.5537724

[10]

Neiberg D, 2006, INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, P809

← 1 2 →