A forward masking auditory model and its application in speaker identification and speech recognition

被引:0
作者
Liu, ZM [1 ]
Wu, XH [1 ]
Zhen, B [1 ]
Chi, HS [1 ]
机构
[1] Peking Univ, Ctr Informat Sci, Natl Key Lab Machine Percept, Beijing 100871, Peoples R China
来源
CHINESE JOURNAL OF ELECTRONICS | 2001年 / 10卷 / 02期
关键词
forward masking; MFCC; firing-rate cepstrum; synchronized rate cepstrum;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A novel computational auditory model which simulates the forward-masking mechanism of auditory nerve discharge is presented. Both features based on the model are extracted: FMFRC (forward masking firing-rate cepstrum) and FMSRC (forward masking synchronized rate cepstrum). Isolated-word speech recognition and text-dependent speaker identification experiments based on TI46 are performed. The results show that the new features based on the forward masking model is far more robust than MFCC (mel-frequency cepstrum coefficients) and the performance will be improved compared to the features without such dynamic property. Moreover, the model and the feature extraction method based on it are feasible in practice and promising in robust speech recognition and speaker identification.
引用
收藏
页码:196 / 199
页数:4
相关论文
共 8 条