A forward masking auditory model and its application in speaker identification and speech recognition

被引：0

作者：

Liu, ZM ^{[1
]}

Wu, XH ^{[1
]}

Zhen, B ^{[1
]}

Chi, HS ^{[1
]}

机构：

[1] Peking Univ, Ctr Informat Sci, Natl Key Lab Machine Percept, Beijing 100871, Peoples R China

来源：

CHINESE JOURNAL OF ELECTRONICS | 2001年 / 10卷 / 02期

关键词：

forward masking; MFCC; firing-rate cepstrum; synchronized rate cepstrum;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

A novel computational auditory model which simulates the forward-masking mechanism of auditory nerve discharge is presented. Both features based on the model are extracted: FMFRC (forward masking firing-rate cepstrum) and FMSRC (forward masking synchronized rate cepstrum). Isolated-word speech recognition and text-dependent speaker identification experiments based on TI46 are performed. The results show that the new features based on the forward masking model is far more robust than MFCC (mel-frequency cepstrum coefficients) and the performance will be improved compared to the features without such dynamic property. Moreover, the model and the feature extraction method based on it are feasible in practice and promising in robust speech recognition and speaker identification.

引用

页码：196 / 199

页数：4

共 8 条

[1] [Anonymous], 1993, FUNDAMENTAL SPEECH R
[2] Speaker recognition: A tutorial
Campbell, JP
[J]. PROCEEDINGS OF THE IEEE, 1997, 85 (09) : 1437 - 1462
[3] ON THE ROLE OF SPECTRAL TRANSITION FOR SPEECH-PERCEPTION
FURUI, S
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 80 (04) : 1016 - 1025
[4] SIMULATION OF MECHANICAL TO NEURAL TRANSDUCTION IN THE AUDITORY RECEPTOR
MEDDIS, R
[J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 79 (03) : 702 - 711
[5] Patterson R. D., 1988, APU Report 2341
[6] PATTERSON RD, 1990, ADV SPEECH HEARING L, V3
[7] STROPE B, 1996, IEEE INT C AC SPEECH, V1, P37
[8] Wu Xihong, 1999, Chinese Journal of Electronics, V8, P413

← 1 →