Speaker Identification using Frequency Dsitribution in the Transform Domain

被引:0
作者
Kekre, H. B. [1 ]
Kulkarni, Vaishali [2 ]
机构
[1] NMIMS Univ, MPSTME, Comp Dept, Bombay, Maharashtra, India
[2] NMIMS Univ, MPSTME, Elect & Telecommun, Bombay, Maharashtra, India
关键词
Speaker Identification; DFT; DCT; DST; Hartley; Haar; Walsh; Kekre's Transform;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper, we propose Speaker Identification using the frequency distribution of various transforms like DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), DST (Discrete Sine Transform), Hartley, Walsh, Haar and Kekre transforms. The speech signal spoken by a particular speaker is converted into frequency domain by applying the different transform techniques. The distribution in the transform domain is utilized to extract the feature vectors in the training and the matching phases. The results obtained by using all the seven transform techniques have been analyzed and compared. It can be seen that DFT, DCT, DST and Hartley transform give comparatively similar results (Above 96%). The results obtained by using Haar and Kekre transform are very poor. The best results are obtained by using DFT (97.19% for a feature vector of size 40).
引用
收藏
页码:73 / 78
页数:6
相关论文
共 28 条
  • [1] [Anonymous], 1992, THESIS
  • [2] Bei C.D., 1998, IEEE T COMMUNICATION
  • [3] A tutorial on text-independent speaker verification
    Bimbot, F
    Bonastre, JF
    Fredouille, C
    Gravier, G
    Magrin-Chagnolleau, I
    Meignier, S
    Merlin, T
    Ortega-García, J
    Petrovska-Delacrétaz, D
    Reynolds, DA
    [J]. EURASIP JOURNAL ON APPLIED SIGNAL PROCESSING, 2004, 2004 (04) : 430 - 451
  • [4] TEXT-DEPENDENT SPEAKER VERIFICATION USING VECTOR QUANTIZATION SOURCE-CODING
    BURTON, DK
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1987, 35 (02): : 133 - 143
  • [5] Speaker recognition: A tutorial
    Campbell, JP
    [J]. PROCEEDINGS OF THE IEEE, 1997, 85 (09) : 1437 - 1462
  • [6] Chakroborty Sandipan, 2009, INT J SIGNAL PROCESS, V5
  • [7] Furui S., 1997, AVBPA97, P237
  • [8] Furui S., 2005, ECTI T COMPUTER INFO, V1
  • [9] Speaker identification using instantaneous frequencies
    Grimaldi, Marco
    Cummins, Fred
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2008, 16 (06): : 1097 - 1111
  • [10] Hassan Mohd Rasheedur, 2004, 3 INT C EL COMP ENG