HMM word graph based keyword spotting in handwritten document images

被引:41
|
作者
Toselli, Alejandro Hector [1 ]
Vidal, Enrique [1 ]
Romero, Veronica [1 ]
Frinken, Volkmar [2 ,3 ,4 ]
机构
[1] Univ Politecn Valencia, Camino Vera S-N, E-46022 Valencia, Spain
[2] Kyushu Univ, Fac Informat Sci & Elect Engn, Fukuoka 812, Japan
[3] Univ Calif Davis, Elect & Comp Engn, Davis, CA 95616 USA
[4] ONU Technol Inc, San Jose, CA USA
基金
欧盟地平线“2020”;
关键词
Keyword spotting; Handwritten text recognition; Word graph; Posterior probability; Confidence score; INTERACTIVE TRANSCRIPTION; HISTORICAL DOCUMENTS; CONFIDENCE MEASURES; SEGMENTATION; RECOGNITION; ALGORITHM; FILLER; MODEL;
D O I
10.1016/j.ins.2016.07.063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Line-level keyword spotting (KWS) is presented on the basis of frame-level word posterior probabilities. These posteriors are obtained using word graphs derived from the recognition process of a full-fledged handwritten text recognizer based on hidden Markov models and N-gram language models. This approach has several advantages. First, since it uses a holistic, segmentation-free technology, it does not require any kind of word or character segmentation. Second, the use of language models allows the context of each spotted word to be taken into account, thereby considerably increasing KWS accuracy. And third, the proposed KWS scores are based on true posterior probabilities, taking into account all (or most) possible word segmentations of the input image. These scores are properly bounded and normalized. This mathematically clean formulation lends itself to smooth, threshold-based keyword queries which, in turn, permit comfortable trade-offs between search precision and recall. Experiments are carried out on several historic collections of handwritten text images, as well as a well-known data set of modern English handwritten text. According to the empirical results, the proposed approach achieves KWS results comparable to those obtained with the recently-introduced "BLSTM neural networks KWS" approach and clearly outperform the popular, state-of-the-art "Filler HMM" KWS method. Overall, the results clearly support all the above-claimed advantages of the proposed approach. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:497 / 518
页数:22
相关论文
共 50 条
  • [41] Hough Transform-Based Angular Features for Learning-Free Handwritten Keyword Spotting
    Kundu, Subhranil
    Malakar, Samir
    Geem, Zong Woo
    Moon, Yoon Young
    Singh, Pawan Kumar
    Sarkar, Ram
    SENSORS, 2021, 21 (14)
  • [42] Bayesian background models for keyword spotting in handwritten documents
    Kumar, Gaurav
    Govindaraju, Venu
    PATTERN RECOGNITION, 2017, 64 : 84 - 91
  • [43] Keyword Spotting in Offline Chinese Handwritten Documents Using a Statistical Model
    Huang, Liang
    Yin, Fei
    Chen, Qing-Hu
    Liu, Cheng-Lin
    11TH INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR 2011), 2011, : 78 - 82
  • [44] Bayesian Active Learning for Keyword Spotting in Handwritten Documents
    Kumar, Gaurav
    Govindaraju, Venu
    2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2041 - 2046
  • [45] Keyword spotting in handwritten chinese documents using semi-markov conditional random fields
    Zhang, Heng
    Zhou, Xiang-Dong
    Liu, Cheng-Lin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 58 : 49 - 61
  • [46] A survey of document image word spotting techniques
    Giotis, Angelos P.
    Sfikas, Giorgos
    Gatos, Basilis
    Nikou, Christophoros
    PATTERN RECOGNITION, 2017, 68 : 310 - 332
  • [47] SpottingNet: Learning the Similarity of Word Images with Convolutional Neural Network for Word Spotting in Handwritten Historical Documents
    Zhong, Zhuoyao
    Pan, Weishen
    Jin, Lianwen
    Mouchere, Harold
    Viard-Gaudin, Christian
    PROCEEDINGS OF 2016 15TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2016, : 295 - 300
  • [48] A Resource-Dependent Approach to Word Modeling for Keyword Spotting
    Chen, I-Fan
    Lee, Chin-Hui
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 2543 - 2547
  • [49] A Hybrid HMM/DNN Approach to Keyword Spotting of Short Words
    Chen, I-Fan
    Lee, Chin-Hui
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 1573 - 1577
  • [50] Document Image Retrieval Based on Keyword Spotting Using Relevance Feedback
    Keyvanpour, M.
    Tavoli, R.
    Mozaffari, S.
    INTERNATIONAL JOURNAL OF ENGINEERING, 2014, 27 (01): : 7 - 13