Visual Language Model for Keyword Spotting on Historical Mongolian Document Images

被引:0
作者
Wei, Hongxi [1 ]
Gao, Guanglai [1 ]
机构
[1] Inner Mongolia Univ, Sch Comp Sci, Hohhot 010021, Peoples R China
来源
2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC) | 2017年
关键词
Visual Language Model; Query Likelihood Model; KL Divergence; Smoothing; Keyword Spotting; WORDS;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Bag-of-Visual-Words (BoVW) approach has been attracted some attention in the field of keyword spotting. However, the BoVW approach discards the spatial relations of the visual words. Therefore, a visual language model is integrated into the BoVW framework in this study so as to add the spatial information. To accomplish the process of keyword spotting, two well-known retrieval schemes, including query likelihood model and KL divergence, have been adopted. The experimental results show that the visual language model can significantly improve the performance of keyword spotting on a collection of historical Mongolian document images than the original BoVW approach. Meanwhile, the influence of different codebook sizes on the performance has been analyzed in this paper. And the best appropriate size of the codebook has been determined.
引用
收藏
页码:1737 / 1742
页数:6
相关论文
共 26 条
[1]   A study of Bag-of-Visual-Words representations for handwritten keyword spotting [J].
Aldavert, David ;
Rusinol, Marcal ;
Toledo, Ricardo ;
Llados, Josep .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (03) :223-234
[2]  
[Anonymous], 2008, P 7 ACM INT C IMAGE
[3]  
[Anonymous], 2007, P INT WORKSHOP WORKS
[4]  
[Anonymous], 1998, SIGIR 98 P 21 ANN IN, DOI DOI 10.1145/290941.291008
[5]  
Chen X, 2009, LECT NOTES ARTIF INT, V5476, P867, DOI 10.1007/978-3-642-01307-2_90
[6]   On the Influence of Key Point Encoding for Handwritten Word Spotting [J].
Fernandez-Mota, David ;
Riba, Pau ;
Fornes, Alicia ;
Llados, Josep .
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, :476-481
[7]   A Study of Language Model for Image Retrieval [J].
Geng, Bo ;
Yang, Linjun ;
Xu, Chao .
2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, :158-+
[8]  
Hongxi Wei, 2016, ICIC Express Letters, Part B: Applications, V7, P1769
[9]  
Hongxi Wei, 2010, 2010 3rd International Conference on Advanced Computer Theory and Engineering (ICACTE 2010), P43, DOI 10.1109/ICACTE.2010.5579111
[10]  
Lafferty John, 2001, P SIGIR, P111, DOI DOI 10.1145/383952.383970