Bayesian background models for keyword spotting in handwritten documents

被引:6
|
作者
Kumar, Gaurav [1 ]
Govindaraju, Venu [1 ]
机构
[1] Univ Buffalo, Dept Comp Sci & Engn, 113 Davis Hall, Amherst, NY 14260 USA
关键词
Handwriting recognition; Keyword spotting; Bayesian generalized linear models; Bayesian generalized kernel models; CLASSIFICATION; REGRESSION;
D O I
10.1016/j.patcog.2016.06.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Background in a handwritten document can be anything other than the words we are interested in. The characteristics of the background are typically captured by a background model to achieve spotting in handwritten documents. We propose two such Bayesian background models for keyword spotting in handwritten documents. Firstly, we present a background model using the Bayesian generalized linear model called (VDBM) and secondly propose a Bayesian generalized kernel background model called BGKBM. Given a set of handwritten documents and a bunch of keyword and non-keyword scores, the models learn an efficient Bayesian rejection criteria to output the most confident keyword regions in the handwritten document. For the variational dynamic background model (VDBM) the inference of parameters is done using variational methods and for the Bayesian generalized kernel background model (BGKBM), the inference is done using a proposed Markov chain Monte Carlo (MCMC) approach. The models are built on top of the scores returned by a handwritten recognizer for keywords and non-keywords. The approach is recognition based and works at line level. The methods have been validated on publicly available IAM dataset and compared with other state of the art line level keyword spotting approaches.
引用
收藏
页码:84 / 91
页数:8
相关论文
共 50 条
  • [41] Discriminative keyword spotting
    Keshet, Joseph
    Grangier, David
    Bengio, Samy
    SPEECH COMMUNICATION, 2009, 51 (04) : 317 - 329
  • [42] LDA-Based Word Image Representation for Keyword Spotting on Historical Mongolian Documents
    Wei, Hongxi
    Gao, Guanglai
    Su, Xiangdong
    NEURAL INFORMATION PROCESSING, ICONIP 2016, PT IV, 2016, 9950 : 432 - 441
  • [43] A Model for Evaluating the Performance of a Multiple Keywords Spotting System for the Transcription of Historical Handwritten Documents
    Marcelli, Angelo
    De Gregorio, Giuseppe
    Santoro, Adolfo
    JOURNAL OF IMAGING, 2020, 6 (11)
  • [44] METRIC LEARNING FOR KEYWORD SPOTTING
    Huh, Jaesung
    Lee, Minjae
    Heo, Heesoo
    Mun, Seongkyu
    Chung, Joon Son
    2021 IEEE SPOKEN LANGUAGE TECHNOLOGY WORKSHOP (SLT), 2021, : 133 - 140
  • [45] A New Keyword Spotting Approach
    Bahi, Halima
    Benati, Nadia
    2009 INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS (ICMCS 2009), 2009, : 77 - +
  • [46] Latency Control for Keyword Spotting
    Jose, Christin
    Wang, Joseph
    Strimel, Grant P.
    Khursheed, Mohammad Omar
    Mishchenko, Yuriy
    Kulis, Brian
    INTERSPEECH 2022, 2022, : 1891 - 1895
  • [47] FEDERATED LEARNING FOR KEYWORD SPOTTING
    Leroy, David
    Coucke, Alice
    Lavril, Thibaut
    Gisselbrecht, Thibault
    Dureau, Joseph
    2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2019, : 6341 - 6345
  • [48] Keyword Spotting with Quaternionic ResNet: Application to Spotting in Greek Manuscripts
    Sfikas, Giorgos
    Retsinas, George
    Giotis, Angelos P.
    Gatos, Basilis
    Nikou, Christophoros
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 382 - 396
  • [49] Training Keyword Spotting Models on Non-IID Data with Federated Learning
    Hard, Andrew
    Partridge, Kurt
    Nguyen, Cameron
    Subrahmanya, Niranjan
    Shah, Aishanee
    Zhu, Pai
    Moreno, Ignacio Lopez
    Mathews, Rajiv
    INTERSPEECH 2020, 2020, : 4343 - 4347
  • [50] Keyword Spotting in Historical Devanagari Manuscripts by Word Matching
    Sharada, B.
    Sushma, S. N.
    Bharathlal
    DATA ANALYTICS AND LEARNING, 2019, 43 : 65 - 76