Bayesian background models for keyword spotting in handwritten documents

被引:6
|
作者
Kumar, Gaurav [1 ]
Govindaraju, Venu [1 ]
机构
[1] Univ Buffalo, Dept Comp Sci & Engn, 113 Davis Hall, Amherst, NY 14260 USA
关键词
Handwriting recognition; Keyword spotting; Bayesian generalized linear models; Bayesian generalized kernel models; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.patcog.2016.06.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background in a handwritten document can be anything other than the words we are interested in. The characteristics of the background are typically captured by a background model to achieve spotting in handwritten documents. We propose two such Bayesian background models for keyword spotting in handwritten documents. Firstly, we present a background model using the Bayesian generalized linear model called (VDBM) and secondly propose a Bayesian generalized kernel background model called BGKBM. Given a set of handwritten documents and a bunch of keyword and non-keyword scores, the models learn an efficient Bayesian rejection criteria to output the most confident keyword regions in the handwritten document. For the variational dynamic background model (VDBM) the inference of parameters is done using variational methods and for the Bayesian generalized kernel background model (BGKBM), the inference is done using a proposed Markov chain Monte Carlo (MCMC) approach. The models are built on top of the scores returned by a handwritten recognizer for keywords and non-keywords. The approach is recognition based and works at line level. The methods have been validated on publicly available IAM dataset and compared with other state of the art line level keyword spotting approaches.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [31] Enhancing Low Resource Keyword Spotting with Automatically Retrieved Web Documents
    Zhang, Le
    Karakos, Damianos
    Hartmann, William
    Hsiao, Roger
    Schwartz, Richard
    Tsakalidis, Stavros
    16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5, 2015, : 839 - 843
  • [32] Graph Based Keyword Spotting in Medieval Slavic Documents - A Project Outline
    Riesen, Kaspar
    Brodic, Darko
    Milivojevic, Zoran N.
    Maluckov, Cedomir A.
    DIGITAL HERITAGE: PROGRESS IN CULTURAL HERITAGE: DOCUMENTATION, PRESERVATION, AND PROTECTION, 2014, 8740 : 724 - 731
  • [33] Improving HMM-Based Keyword Spotting with Character Language Models
    Fischer, Andreas
    Frinken, Volkmar
    Bunke, Horst
    Suen, Ching Y.
    2013 12TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2013, : 506 - 510
  • [34] Segmentation Free Keyword Spotting Framework using Dynamic Background Model
    Kumar, Gaurav
    Wshah, Safwan
    Govindaraju, Venu
    Ramachandrula, Sitaram
    DOCUMENT RECOGNITION AND RETRIEVAL XX, 2013, 8658
  • [35] ICFHR2016 Handwritten Keyword Spotting Competition (H-KWS 2016)
    Pratikakis, Ioannis
    Zagoris, Konstantinos
    Gatos, Basilis
    Puigcerver, Joan
    Toselli, Alejandro H.
    Vidal, Enrique
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 613 - 618
  • [36] Word Spotting for Handwritten Documents using Chamfer Distance and Dynamic Time Warping
    Saabni, Raid M.
    El-Sana, Jihad A.
    DOCUMENT RECOGNITION AND RETRIEVAL XVIII, 2011, 7874
  • [37] Math Spotting: Retrieving Math in Technical Documents Using Handwritten Query Images
    Zanibbi, Richard
    Yu, Li
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 446 - 451
  • [38] State sequence pooling training of acoustic models for keyword spotting
    Lopatka, Kuba
    Bocklet, Tobias
    INTERSPEECH 2020, 2020, : 4338 - 4342
  • [39] Robust Keyword Spotting with Rapidly Adapting Point Process Models
    Jansen, Aren
    Niyogi, Partha
    INTERSPEECH 2009: 10TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2009, VOLS 1-5, 2009, : 2727 - 2730
  • [40] Hough Transform-Based Angular Features for Learning-Free Handwritten Keyword Spotting
    Kundu, Subhranil
    Malakar, Samir
    Geem, Zong Woo
    Moon, Yoon Young
    Singh, Pawan Kumar
    Sarkar, Ram
    SENSORS, 2021, 21 (14)