Accuracy of MFCC-based speaker recognition in Series 60 device

被引:9
作者
Saastamoinen, J [1 ]
Karpov, E [1 ]
Hautamäki, V [1 ]
Fränti, P [1 ]
机构
[1] Univ Joensuu, Dept Comp Sci, FIN-80101 Joensuu, Finland
关键词
speaker identification; fixed point arithmetic; round-off error; MFCC; FFT; Symbian;
D O I
10.1155/ASP.2005.2816
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A fixed point implementation of speaker recognition based on MFCC signal processing is considered. We analyze the numerical error of the MFCC and its effect on the recognition accuracy. Techniques to reduce the information loss in a converted fixed point implementation are introduced. We increase the signal processing accuracy by adjusting the ratio of presentation accuracy of the operators and the signal. The signal processing error is found out to be more important to the speaker recognition accuracy than the error in the classification algorithm. The results are verified by applying the alternative technique to speech data. We also discuss the specific programming requirements set up by the Symbian and Series 60.
引用
收藏
页码:2816 / 2827
页数:12
相关论文
共 17 条
[1]   Error analysis of the Kmetz/Maenner algorithm [J].
Arnold, M ;
Bailey, T ;
Cowles, J .
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2003, 33 (1-2) :37-53
[2]  
DATTALO S, 2003, LOGARITHMS DEC
[3]  
*DIG INC, 2003, PROGR SER DIG INC, V60
[4]  
Frigo M, 1998, INT CONF ACOUST SPEE, P1381, DOI 10.1109/ICASSP.1998.681704
[5]  
GUNASEKARA O, 1998, DEV DIGITAL CELLULAR
[6]  
HARRISON R, 2003, SYMBIAN OS C MOBILE
[7]  
Kabal P., 1986, ICASSP 86 Proceedings. IEEE-IECEJ-ASJ International Conference on Acoustics, Speech and Signal Processing (Cat. No.86CH2243-4), P221
[8]  
KINNUNEN T, 2004, THESIS U JOENSUU JOE
[9]  
KINNUNEN T, IN PRESS IEEE T SPEE
[10]  
KINNUNEN T, 2003, P 8 EUR C SPEECH COM, P2641