HMM word graph based keyword spotting in handwritten document images

被引:41
|
作者
Toselli, Alejandro Hector [1 ]
Vidal, Enrique [1 ]
Romero, Veronica [1 ]
Frinken, Volkmar [2 ,3 ,4 ]
机构
[1] Univ Politecn Valencia, Camino Vera S-N, E-46022 Valencia, Spain
[2] Kyushu Univ, Fac Informat Sci & Elect Engn, Fukuoka 812, Japan
[3] Univ Calif Davis, Elect & Comp Engn, Davis, CA 95616 USA
[4] ONU Technol Inc, San Jose, CA USA
基金
欧盟地平线“2020”;
关键词
Keyword spotting; Handwritten text recognition; Word graph; Posterior probability; Confidence score; INTERACTIVE TRANSCRIPTION; HISTORICAL DOCUMENTS; CONFIDENCE MEASURES; SEGMENTATION; RECOGNITION; ALGORITHM; FILLER; MODEL;
D O I
10.1016/j.ins.2016.07.063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Line-level keyword spotting (KWS) is presented on the basis of frame-level word posterior probabilities. These posteriors are obtained using word graphs derived from the recognition process of a full-fledged handwritten text recognizer based on hidden Markov models and N-gram language models. This approach has several advantages. First, since it uses a holistic, segmentation-free technology, it does not require any kind of word or character segmentation. Second, the use of language models allows the context of each spotted word to be taken into account, thereby considerably increasing KWS accuracy. And third, the proposed KWS scores are based on true posterior probabilities, taking into account all (or most) possible word segmentations of the input image. These scores are properly bounded and normalized. This mathematically clean formulation lends itself to smooth, threshold-based keyword queries which, in turn, permit comfortable trade-offs between search precision and recall. Experiments are carried out on several historic collections of handwritten text images, as well as a well-known data set of modern English handwritten text. According to the empirical results, the proposed approach achieves KWS results comparable to those obtained with the recently-introduced "BLSTM neural networks KWS" approach and clearly outperform the popular, state-of-the-art "Filler HMM" KWS method. Overall, the results clearly support all the above-claimed advantages of the proposed approach. (C) 2016 Elsevier Inc. All rights reserved.
引用
收藏
页码:497 / 518
页数:22
相关论文
共 50 条
  • [21] Integrating Visual Word Embeddings into Translation Language Model for Keyword Spotting on Historical Mongolian Document Images
    Wei, Hongxi
    Zhang, Hui
    Gao, Guanglai
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2017, PT II, 2018, 10736 : 616 - 625
  • [22] Keyword Spotting in Historical Devanagari Manuscripts by Word Matching
    Sharada, B.
    Sushma, S. N.
    Bharathlal
    DATA ANALYTICS AND LEARNING, 2019, 43 : 65 - 76
  • [23] Probabilistic multi-word spotting in handwritten text images
    Alejandro H. Toselli
    Enrique Vidal
    Joan Puigcerver
    Ernesto Noya-García
    Pattern Analysis and Applications, 2019, 22 : 23 - 32
  • [24] Multi-task learning for simultaneous script identification and keyword spotting in document images
    Cheikhrouhou, Ahmed
    Kessentini, Yousri
    Kanoun, Slim
    PATTERN RECOGNITION, 2021, 113
  • [25] Visual Language Model for Keyword Spotting on Historical Mongolian Document Images
    Wei, Hongxi
    Gao, Guanglai
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 1737 - 1742
  • [26] A Case Study Of BoVW For Keyword Spotting On Historical Mongolian Document Images
    Guo, Xing
    Wei, Hongxi
    Su, Xiangdong
    2016 9TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI 2016), 2016, : 374 - 378
  • [27] Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents
    Zhang, Heng
    Wang, Da-Han
    Liu, Cheng-Lin
    PATTERN RECOGNITION, 2014, 47 (05) : 1880 - 1890
  • [28] Variational Dynamic Background Model for Keyword Spotting in Handwritten Documents
    Kumar, Gaurav
    Wshah, Safwan
    Govindaraju, Venu
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021
  • [29] Two-Stage Approach to Keyword Spotting in Handwritten Documents
    Haji, Mehdi
    Ameri, Mohammad R.
    Bui, Tien D.
    Suen, Ching Y.
    Ponson, Dominique
    DOCUMENT RECOGNITION AND RETRIEVAL XXI, 2014, 9021
  • [30] ON THE INFLUENCE OF WORD REPRESENTATIONS FOR HANDWRITTEN WORD SPOTTING IN HISTORICAL DOCUMENTS
    Llados, Josep
    Rusinol, Marcal
    Fornes, Alicia
    Fernandez, David
    Dutta, Anjan
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (05)