A Case Study Of BoVW For Keyword Spotting On Historical Mongolian Document Images

被引:0
作者
Guo, Xing [1 ]
Wei, Hongxi [1 ]
Su, Xiangdong [1 ]
机构
[1] Inner Mongolia Univ, Sch Comp Sci, Hohhot 010021, Peoples R China
来源
2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016) | 2016年
关键词
Bag of Visual Words; Keyword Spotting; Document Image Retrieval; SIFT Descriptors;
D O I
暂无
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
This paper proposes a Bag of Visual Words (BoVW) based approach for keyword spotting on the Mongolian historical document images. In this paper, the first step is dividing the scanned Mongolian historical document images into word images by some preprocessing steps, such as connected component analysis, binarization etc. Then, all of image in our training set are processed in the following steps, including extracting keypoints, obtaining local descriptors and formulating visual word. Finally, each word image can be represented as a histogram of visual words by a codebook. In the retrieval stage, a provided query keyword image is also converted into a histogram of visual words through the above-mentioned procedure. After that, similarities between a query keyword image and whole candidate of word images can be calculated. Therefore, a sorted list will be returned in descending order of the similarities. Moreover, spatial information of visual word is introduced into the original framework of BoVW by the spatial pyramid matching (SPM) technology. Experimental results show that addition of spatial information obtains a good performance on our dataset.
引用
收藏
页码:374 / 378
页数:5
相关论文
共 15 条
  • [1] [Anonymous], 2006, 2006 IEEE COMP SOC C
  • [2] Automatic panoramic image stitching using invariant features
    Brown, Matthew
    Lowe, David G.
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2007, 74 (01) : 59 - 73
  • [3] Csurka G., 2004, WORKSH STAT LEARN CO, V1, P1, DOI DOI 10.1234/12345678
  • [4] Classical Mongolian Words Recognition in Historical Document
    Gao, Guanglai
    Su, Xiangdong
    Wei, Hongxi
    Gong, Yeyun
    [J]. 11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 692 - 697
  • [5] Lazebnik S., COMPUTER VISION PATT, V2, P2169
  • [6] Lowe D G., 2005, INT J COMPUT VISION, V60, P90
  • [7] Manmatha R., 1996, Proceedings of the 1st ACM International Conference on Digital Libraries, P151, DOI 10.1145/226931.226960
  • [8] A performance evaluation of local descriptors
    Mikolajczyk, K
    Schmid, C
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (10) : 1615 - 1630
  • [9] Scale & affine invariant interest point detectors
    Mikolajczyk, K
    Schmid, C
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (01) : 63 - 86
  • [10] Word spotting for historical documents
    Rath, Tony M.
    Manmatha, R.
    [J]. INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2007, 9 (2-4) : 139 - 152