Keyword spotting in unconstrained handwritten Chinese documents using contextual word model

被引:11
作者
Huang, Liang [1 ]
Yin, Fei [2 ]
Chen, Qing-Hu [1 ]
Liu, Cheng-Lin [2 ]
机构
[1] Wuhan Univ, Sch Elect Informat, Wuhan 430079, Hubei, Peoples R China
[2] Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
Keyword spotting; Chinese handwritten documents; Word similarity; Contextual word model; RETRIEVAL; SHAPE; SEGMENTATION; RECOGNITION; ONLINE;
D O I
10.1016/j.imavis.2013.10.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a method for keyword spotting in off-line Chinese handwritten documents using a contextual word model, which measures the similarity between the query word and every candidate word in the document by combining a character classifier and the geometric context as well as linguistic context. The geometric context model characterizes the single-character likeliness and between-character relationship. The linguistic model utilizes the dependency of the word with the external adjacent characters. The combining weights are optimized on training documents. Experiments on a large handwriting database CASIA-HWDB demonstrate the effectiveness of the proposed method and justify the benefits of geometric and linguistic contexts. Compared to transcription-based text search, the proposed method can provide higher recall rate, and for spotting words of four characters, the proposed method provides both higher precision and recall rate. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:958 / 968
页数:11
相关论文
共 50 条
  • [11] Benchmarking discriminative approaches for word spotting in handwritten documents
    Bideault, Gautier
    Mioulet, Luc
    Chatelain, Clement
    Paquet, Thierry
    [J]. 2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), 2015, : 201 - 205
  • [12] Bayesian background models for keyword spotting in handwritten documents
    Kumar, Gaurav
    Govindaraju, Venu
    [J]. PATTERN RECOGNITION, 2017, 64 : 84 - 91
  • [13] Bayesian Active Learning for Keyword Spotting in Handwritten Documents
    Kumar, Gaurav
    Govindaraju, Venu
    [J]. 2014 22ND INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2014, : 2041 - 2046
  • [14] Word spotting for Handwritten Arabic documents using Harris detector
    Elfakiri, Youssef
    Chenouni, Driss
    Khaissidi, Ghizlane
    El Yacoubi, Mounim
    Mrabti, Mostafa
    [J]. 2016 INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY FOR ORGANIZATIONS DEVELOPMENT (IT4OD), 2016,
  • [15] Learning-based word spotting system for Arabic handwritten documents
    Khayyat, Muna
    Lam, Louisa
    Suen, Ching Y.
    [J]. PATTERN RECOGNITION, 2014, 47 (03) : 1021 - 1030
  • [16] Mental model for handwritten keyword spotting
    Brik, Youcef
    Ziou, Djemel
    [J]. JOURNAL OF ELECTRONIC IMAGING, 2018, 27 (05)
  • [17] Character confidence based on N-best list for keyword spotting in online Chinese handwritten documents
    Zhang, Heng
    Wang, Da-Han
    Liu, Cheng-Lin
    [J]. PATTERN RECOGNITION, 2014, 47 (05) : 1880 - 1890
  • [18] Keyword spotting in historical handwritten documents based on graph matching
    Stauffer, Michael
    Fischer, Andreas
    Riesen, Kaspar
    [J]. PATTERN RECOGNITION, 2018, 81 : 240 - 253
  • [19] ON THE INFLUENCE OF WORD REPRESENTATIONS FOR HANDWRITTEN WORD SPOTTING IN HISTORICAL DOCUMENTS
    Llados, Josep
    Rusinol, Marcal
    Fornes, Alicia
    Fernandez, David
    Dutta, Anjan
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2012, 26 (05)
  • [20] Query-Based Word Spotting in Handwritten Documents Using HMM
    Bharathi, V. C.
    Veningston, K.
    Rao, P. V. Venkateswara
    [J]. DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 31 - 39