ZCPA Features for Speech Recognition Implementation into MASPER training procedure for Slovak language

被引:0
作者
Kacur, Juraj [1 ]
Varga, Mario [1 ]
Rozinaj, Gregor [1 ]
机构
[1] Slovak Univ Technol Bratislava, FEI, Inst Telecommun, Bratislava, Slovakia
来源
2012 IX INTERNATIONAL SYMPOSIUM ON TELECOMMUNICATIONS (BIHTEL) | 2012年
关键词
speech recognition; ZCPA; MFCC; PLP; speech features; HMM; MASPER;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this article we present implementation, modifications and optimization of zero-crossing peak amplitude (ZCPA) speech feature extraction method into Slovak speech recognition system. ZCPA features are closely mimicking the human auditory system in the time domain, and thus they should be more robust against common noises. Except the basic configuration several modifications have been suggested, implemented and evaluated. Furthermore, optimization of settings on a real system using professional database and MASPER training procedure have been found and compared to classical features presented by MFCC and PLP in different scenarios and noise conditions.
引用
收藏
页数:4
相关论文
共 9 条
[1]  
[Anonymous], FUNDAMENTALS SPEECH
[2]  
[Anonymous], 1990, Hidden markov models for speech recognition
[3]  
[Anonymous], P INTERSPEECH 05 LIS
[4]  
DARJAA S, 2006, P 11 INT C SPEECH CO, P449
[5]  
Gajic B., 2003, ACOUSTICS SPEECH SIG
[6]   PS-ZCPA based feature extraction with auditory masking, modulation enhancement and noise reduction for robust ASR [J].
Ghulam, M ;
Fukuda, T ;
Katsurada, K ;
Horikawa, J ;
Nitta, T .
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2006, E89D (03) :1015-1023
[7]   Perceptual features for automatic speech recognition in noisy environments [J].
Haque, Serajul ;
Togneri, Roberto ;
Zaknich, Anthony .
SPEECH COMMUNICATION, 2009, 51 (01) :58-75
[8]  
Honig F., 2005, INTERSPEECH, P2997
[9]  
LINDBERG B, 2000, P ICSLP 2000 BEIJ CH