An efficient lattice-based phonetic search method for accelerating keyword spotting in large speech databases

被引:3
作者
Tetariy, Ella [1 ]
Gishri, Michal [1 ]
Har-Lev, Baruch [1 ]
Aharonson, Vered [1 ]
Moyal, Ami [1 ]
机构
[1] Afeka Acad Coll Engn, ACLP, Tel Aviv, Israel
关键词
Keyword spotting; Phonetic search; Anchor-based search; Searching large speech databases; Efficient phonetic search;
D O I
10.1007/s10772-012-9171-3
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper describes an algorithm for the reduction of computational complexity in phonetic search Key-Word Spotting (KWS). This reduction is particularly important when searching for keywords within very large speech databases and aiming for rapid response time. The suggested algorithm consists of an anchor-based phoneme search that reduces the search space by generating hypotheses only around phonemes recognized with high reliability. Three databases have been used for the evaluation: IBM Voicemail I and Voicemail II, consisting of long spontaneous utterances and theWall Street Journal portion of the MACROPHONE database, consisting of read speech utterances. The results indicated a significant reduction of nearly 90% in the computational complexity of the search while improving the false alarm rate, with only a small decrease in the detection rate in both databases. Search space reduction, as well as, performance gain or loss can be controlled according to the user preferences via the suggested algorithm parameters and thresholds.
引用
收藏
页码:161 / 169
页数:9
相关论文
共 16 条
[1]  
Alon G, 2005, KEY WORD SPOTTING BA
[2]  
Amir A., 2001, Proceedings of the 2001 ACM CIKM. Tenth International Conference on Information and Knowledge Management, P580, DOI 10.1145/502585.502697
[3]  
Bernstein J., 1994, MACROPHONE
[4]  
Clements M., 2001, P BROADC ENG C WASH, P131
[5]  
Gishri M, 2010, P 7 C INT LANG RES E
[6]  
Gusfield D., 1997, ALGORITHMS STRINGS T
[7]  
Hermelin D., 2009, P 26 INT S THEOR ASP
[8]  
James D. A., 1994, P INT C AC SPEECH SI, V1, P337
[9]  
Padmanabhan M., 1998, VOICEMAIL CORPUS I
[10]  
Padmanabhan M., 2002, VOICEMAIL CORPUS 2