Speaker Identification using Frequency Dsitribution in the Transform Domain

被引：0

作者：

Kekre, H. B. ^{[1
]}

Kulkarni, Vaishali ^{[2
]}

机构：

[1] NMIMS Univ, MPSTME, Comp Dept, Bombay, Maharashtra, India

[2] NMIMS Univ, MPSTME, Elect & Telecommun, Bombay, Maharashtra, India

来源：

INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS | 2012年 / 3卷 / 02期

关键词：

Speaker Identification; DFT; DCT; DST; Hartley; Haar; Walsh; Kekre's Transform;

D O I：

暂无

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In this paper, we propose Speaker Identification using the frequency distribution of various transforms like DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), DST (Discrete Sine Transform), Hartley, Walsh, Haar and Kekre transforms. The speech signal spoken by a particular speaker is converted into frequency domain by applying the different transform techniques. The distribution in the transform domain is utilized to extract the feature vectors in the training and the matching phases. The results obtained by using all the seven transform techniques have been analyzed and compared. It can be seen that DFT, DCT, DST and Hartley transform give comparatively similar results (Above 96%). The results obtained by using Haar and Kekre transform are very poor. The best results are obtained by using DFT (97.19% for a feature vector of size 40).

引用

页码：73 / 78

页数：6

共 28 条

[1] [Anonymous], 1992, THESIS
[2] Bei C.D., 1998, IEEE T COMMUNICATION
[3] A tutorial on text-independent speaker verification
Bimbot, F
Bonastre, JF
Fredouille, C
Gravier, G
Magrin-Chagnolleau, I
Meignier, S
Merlin, T
Ortega-García, J
Petrovska-Delacrétaz, D
Reynolds, DA
[J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) : 430 - 451
[4] TEXT-DEPENDENT SPEAKER VERIFICATION USING VECTOR QUANTIZATION SOURCE-CODING
BURTON, DK
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (02): : 133 - 143
[5] Speaker recognition: A tutorial
Campbell, JP
[J]. PROCEEDINGS OF THE IEEE, 1997, 85 (09) : 1437 - 1462
[6] Chakroborty Sandipan, 2009, INT J SIGNAL PROCESS, V5
[7] Furui S., 1997, AVBPA97, P237
[8] Furui S., 2005, ECTI T COMPUTER INFO, V1
[9] Speaker identification using instantaneous frequencies
Grimaldi, Marco
Cummins, Fred
[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (06): : 1097 - 1111
[10] Hassan Mohd Rasheedur, 2004, 3 INT C EL COMP ENG

← 1 2 3 →