Speaker Identification Approach Based On Time Domain Extracted Features

被引:0
作者
Lupu, Eugen [1 ]
Emerich, Simina [1 ]
机构
[1] Tech Univ Cluj Napoca, Dept Commun, Cluj Napoca, Romania
来源
PROCEEDINGS ELMAR-2010 | 2010年
关键词
speaker identification; TESPAR; epoch; SVM; confusion matrix;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper presents a speaker identification approach based on features extracted by time domain speech analysis. Most features (28) issue from the TESPAR (Time Encoded Signal Processing and Recognition) coding method. The other four features are provided by the time domain analysis of the waveform. The features further employed are: the relative mean square energy, the number of maxima in the energy envelope, the pitch frequency average and the relative number of zero crossings for every utterance. This approach implies low computational requirements for features extraction and provides good recognition rates. For the experiments some classifiers (kNN, Bayes Net, Naive Bayes, RBF and SVM) provided by the WEKA (Waikato Environment for Knowledge Analysis) environment are employed.
引用
收藏
页码:355 / 358
页数:4
相关论文
共 6 条
  • [1] [Anonymous], 2004, U. S. Patent, Patent No. 6748354
  • [2] [Anonymous], 1998, ICSPAT TORONTO
  • [3] El-Manzalawy Y., 2005, WLSVM INTEGRATING LI
  • [4] Lupu E., 2009, ACTA TEHNICA NAPOCEN, V50
  • [5] Lupu E., 2003, 30 SESS SCI PRES MOD, P214
  • [6] Vapnik V., 1995, The nature of statistical learning theory