Querying out-of-vocabulary words in lexicon-based keyword spotting

被引:0
作者
Joan Puigcerver
Alejandro H. Toselli
Enrique Vidal
机构
[1] Universitat Politècnica de València,PRHLT Research Center
来源
Neural Computing and Applications | 2017年 / 28卷
关键词
Keyword spotting; Lexicon-based; Smoothing; Out-of-vocabulary; Handwritten text recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Lexicon-based handwritten text keyword spotting (KWS) has proven to be a faster and more accurate alternative to lexicon-free methods. Nevertheless, since lexicon-based KWS relies on a predefined vocabulary, fixed in the training phase, it does not support queries involving out-of-vocabulary (OOV) keywords. In this paper, we outline previous work aimed at solving this problem and present a new approach based on smoothing the (null) scores of OOV keywords by means of the information provided by “similar” in-vocabulary words. Good results achieved using this approach are compared with previously published alternatives on different data sets.
引用
收藏
页码:2373 / 2382
页数:9
相关论文
共 25 条
[1]  
Fischer A(2012)Lexicon-free handwritten word spotting using character HMMs Pattern Recognit Lett 33 934-942
[2]  
Keller A(2012)A novel word spotting method based on recurrent neural networks IEEE Trans Pattern Anal Mach Intell 34 211-224
[3]  
Frinken V(2000)A line-oriented approach to word spotting in handwritten documents Pattern Anal Appl 3 153-168
[4]  
Bunke H(2007)Keyword-guided word spotting in historical printed documents using synthetic data and user feedback Int J Doc Anal Recognit 9 167-177
[5]  
Frinken V(2002)The IAM-database: an English sentence database for offline handwriting recognition Int J Doc Anal Recognit 5 39-46
[6]  
Fischer A(2007)Word spotting for historical documents Int J Doc Anal Recognit 9 139-152
[7]  
Manmatha R(1996)Tries for approximate string matching IEEE Trans Knowl Data Eng 8 540-547
[8]  
Bunke H(undefined)undefined undefined undefined undefined-undefined
[9]  
Kolcz A(undefined)undefined undefined undefined undefined-undefined
[10]  
Alspector J(undefined)undefined undefined undefined undefined-undefined