Probabilistic multi-word spotting in handwritten text images

被引:0
|
作者
Alejandro H. Toselli
Enrique Vidal
Joan Puigcerver
Ernesto Noya-García
机构
[1] PRHLT Research Centre,
[2] Universitat Politècnica de València,undefined
来源
Pattern Analysis and Applications | 2019年 / 22卷
关键词
Handwritten text processing; Keyword spotting; Multi-word Boolean queries; Image processing; Pattern recognition;
D O I
暂无
中图分类号
学科分类号
摘要
Keyword spotting techniques are becoming cost-effective solutions for information retrieval in handwritten documents. We explore the extension of the single-word, line-level probabilistic indexing approach described in our previous works to allow for page-level search of queries consisting in Boolean combinations of several single-keywords. We propose heuristic rules to combine the single-word relevance probabilities into probabilistically consistent confidence scores of the multi-word boolean combinations. An empirical study, also presented in this paper, evaluates the search performance of word-pair queries involving AND and OR Boolean operations. Results of this study support the proposed approach and clearly show its effectiveness. Finally, a web-based demonstration system based on the proposed methods is presented.
引用
收藏
页码:23 / 32
页数:9
相关论文
共 24 条
  • [1] Probabilistic multi-word spotting in handwritten text images
    Toselli, Alejandro H.
    Vidal, Enrique
    Puigcerver, Joan
    Noya-Garcia, Ernesto
    PATTERN ANALYSIS AND APPLICATIONS, 2019, 22 (01) : 23 - 32
  • [2] Lexicon-based probabilistic indexing of handwritten text images
    Vidal, Enrique
    Toselli, Alejandro H.
    Puigcerver, Joan
    NEURAL COMPUTING & APPLICATIONS, 2023, 35 (24) : 17501 - 17520
  • [3] Lexicon-based probabilistic indexing of handwritten text images
    Enrique Vidal
    Alejandro H. Toselli
    Joan Puigcerver
    Neural Computing and Applications, 2023, 35 : 17501 - 17520
  • [4] A voting-based technique for word spotting in handwritten document images
    Majumder, Shamik
    Ghosh, Subhrangshu
    Malakar, Samir
    Sarkar, Ram
    Nasipuri, Mita
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12411 - 12434
  • [5] HMM word graph based keyword spotting in handwritten document images
    Toselli, Alejandro Hector
    Vidal, Enrique
    Romero, Veronica
    Frinken, Volkmar
    INFORMATION SCIENCES, 2016, 370 : 497 - 518
  • [6] A voting-based technique for word spotting in handwritten document images
    Shamik Majumder
    Subhrangshu Ghosh
    Samir Malakar
    Ram Sarkar
    Mita Nasipuri
    Multimedia Tools and Applications, 2021, 80 : 12411 - 12434
  • [7] Approximate Search for Keywords in Handwritten Text Images
    Andres, Jose
    Toselli, Alejandro H.
    Vidal, Enrique
    DOCUMENT ANALYSIS SYSTEMS, DAS 2022, 2022, 13237 : 367 - 381
  • [8] Handwritten-word spotting using biologically inspired features
    van der Zant, Tijn
    Schomaker, Lambert
    Haak, Koen
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (11) : 1945 - 1957
  • [9] Statistical script independent word spotting in offline handwritten documents
    Wshah, Safwan
    Kumar, Gaurav
    Govindaraju, Venu
    PATTERN RECOGNITION, 2014, 47 (03) : 1039 - 1050
  • [10] Lexicon-free handwritten word spotting using character HMMs
    Fischer, Andreas
    Keller, Andreas
    Frinken, Volkmar
    Bunke, Horst
    PATTERN RECOGNITION LETTERS, 2012, 33 (07) : 934 - 942