Bayesian background models for keyword spotting in handwritten documents

被引:6
|
作者
Kumar, Gaurav [1 ]
Govindaraju, Venu [1 ]
机构
[1] Univ Buffalo, Dept Comp Sci & Engn, 113 Davis Hall, Amherst, NY 14260 USA
关键词
Handwriting recognition; Keyword spotting; Bayesian generalized linear models; Bayesian generalized kernel models; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.patcog.2016.06.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background in a handwritten document can be anything other than the words we are interested in. The characteristics of the background are typically captured by a background model to achieve spotting in handwritten documents. We propose two such Bayesian background models for keyword spotting in handwritten documents. Firstly, we present a background model using the Bayesian generalized linear model called (VDBM) and secondly propose a Bayesian generalized kernel background model called BGKBM. Given a set of handwritten documents and a bunch of keyword and non-keyword scores, the models learn an efficient Bayesian rejection criteria to output the most confident keyword regions in the handwritten document. For the variational dynamic background model (VDBM) the inference of parameters is done using variational methods and for the Bayesian generalized kernel background model (BGKBM), the inference is done using a proposed Markov chain Monte Carlo (MCMC) approach. The models are built on top of the scores returned by a handwritten recognizer for keywords and non-keywords. The approach is recognition based and works at line level. The methods have been validated on publicly available IAM dataset and compared with other state of the art line level keyword spotting approaches.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [21] A Bayesian Approach to Script Independent Multilingual Keyword Spotting
    Kumar, Gaurav
    Govindaraju, Venu
    2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, : 357 - 362
  • [22] Handwritten Annotation Spotting in Printed Documents Using Top-Down Visual Saliency Models
    Pandey, Shilpa
    Harit, Gaurav
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2022, 21 (03)
  • [23] A study of Bag-of-Visual-Words representations for handwritten keyword spotting
    David Aldavert
    Marçal Rusiñol
    Ricardo Toledo
    Josep Lladós
    International Journal on Document Analysis and Recognition (IJDAR), 2015, 18 : 223 - 234
  • [24] HMM word graph based keyword spotting in handwritten document images
    Toselli, Alejandro Hector
    Vidal, Enrique
    Romero, Veronica
    Frinken, Volkmar
    INFORMATION SCIENCES, 2016, 370 : 497 - 518
  • [25] Assisted transcription of historical documents by keyword spotting: a performance model
    Santoro, Adolfo
    De Stefano, Claudio
    Marcelli, Angelo
    2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1, 2017, : 971 - 976
  • [26] A line-oriented approach to word spotting in handwritten documents
    Kolcz, A
    Alspector, J
    Augusteijn, M
    Carlson, R
    Popescu, GV
    PATTERN ANALYSIS AND APPLICATIONS, 2000, 3 (02) : 153 - 168
  • [27] ON THE INFLUENCE OF WORD REPRESENTATIONS FOR HANDWRITTEN WORD SPOTTING IN HISTORICAL DOCUMENTS
    Llados, Josep
    Rusinol, Marcal
    Fornes, Alicia
    Fernandez, David
    Dutta, Anjan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (05)
  • [28] Segmentation-free Word Spotting for Handwritten Arabic Documents
    Khaissidi, G.
    Elfakir, Y.
    Mrabti, M.
    Lakhliai, Z.
    Chenouni, D.
    El Yacoubi, M.
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2016, 4 (01): : 6 - 10
  • [29] Statistical script independent word spotting in offline handwritten documents
    Wshah, Safwan
    Kumar, Gaurav
    Govindaraju, Venu
    PATTERN RECOGNITION, 2014, 47 (03) : 1039 - 1050
  • [30] A New Smoothing Method for Lexicon-Based Handwritten Text Keyword Spotting
    Puigcerver, Joan
    Toselli, Alejandro H.
    Vidal, Enrique
    PATTERN RECOGNITION AND IMAGE ANALYSIS (IBPRIA 2015), 2015, 9117 : 23 - 30