Assessment of a Speaker Recognition System Based on an Auditory Model and Neural Nets

被引:0
作者
Martinez-Rams, Ernesto A. [1 ]
Garceran-Hernandez, Vicente [2 ]
机构
[1] Univ Oriente, Ave Amer S-N, Santiago De Cuba, Cuba
[2] Univ Politecnica Cartagena, Antiguo Cuartel Antiguones, Murcia, Spain
来源
BIOINSPIRED APPLICATIONS IN ARTIFICIAL AND NATURAL COMPUTATION, PT II | 2009年 / 5602卷
关键词
IDENTIFICATION;
D O I
暂无
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
This paper deals with a new speaker recognition system based on a model of the human auditory system. Our model is based on a human nonlinear cochlear filter-bank and Neural Nets. The efficiency of this system has been tested using a number of Spanish words from the 'Ahumada' database as uttered by a native male speaker. These words were fed into the cochlea model and their corresponding outputs were processed with an envelope component extractor, yielding five parameters that convey different auditory sensations (loudness, roughness and virtual tones). Because this process generates large data sets, the use of multivariate statistical methods and Neural Nets was appropriate. A variety of normalization techniques and classifying methods were tested on this biologically motivated feature set.
引用
收藏
页码:488 / +
页数:3
相关论文
共 25 条
[1]  
Anderson T.R., 1993, P 1993 INT C AC SPEE, V2, P231
[2]  
Anderson T.R., 1991, P AC SPEECH SIGN PRO, P149
[3]  
ANDERSON TR, 1994, IEEE INT C NEUR NETW, V7, P4466
[4]  
[Anonymous], 1960, Experiments in Hearing
[5]   SPEECH ANALYSIS AND SYNTHESIS BY LINEAR PREDICTION OF SPEECH WAVE [J].
ATAL, BS ;
HANAUER, SL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1971, 50 (02) :637-+
[6]  
Colombi J.M., 1992, THESIS
[7]  
Colombi J.M., 1993, IEEE INT C NEUR NETW, P1914
[8]  
Fant G., 1971, ACOUSTIC THEORY SPEE
[9]   THERAPEUTIC ACTIVITY OF PRETAZETTINE ON EHRLICH ASCITES-CARCINOMA - ADJUVANT EFFECT ON STANDARD DRUGS IN ABC REGIMEN [J].
FURUSAWA, E ;
LUM, MKM ;
FURUSAWA, S .
CHEMOTHERAPY, 1981, 27 (04) :277-286
[10]  
Hunt M. J., 1988, ICASSP 88: 1988 International Conference on Acoustics, Speech, and Signal Processing (Cat. No.88CH2561-9), P215, DOI 10.1109/ICASSP.1988.196552