High Performance Query-by-Example Keyword Spotting Using Query-by-String Techniques

被引:0
|
作者
Vidal, Enrique [1 ]
Toselli, Alejandro H. [1 ]
Puigcerver, Joan [1 ]
机构
[1] Univ Politecn Valencia, Valencia, Spain
来源
2015 13TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR) | 2015年
关键词
WORD; DOCUMENTS; SEGMENTATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Keyword Spotting (KWS) has been traditionally considered under two distinct frameworks: Query-by-Example (QbE) and Query-by-String (QbS). In both cases the user of the system wished to find occurrences of a particular keyword in a collection of document images. The difference is that, in QbE, the keyword is given as an exemplar image while, in QbS the keyword is given as a text string. In several works, the QbS scenario has been approached using QbE techniques; but the converse has not been studied in depth yet, despite of the fact that QbS systems typically achieve higher accuracy. In the present work, we present a very effective probabilistic approach to QbE KWS, based on highly accurate QbS KWS techniques which rely on models which need to be trained from annotated data. To assess the effectiveness of this approach, we tackle the segmentation-free QbE task of the ICFHR-2014 Competition on Handwritten KWS. Our approach achieves a mean average precision (mAP) as high as 0.715, which improves by more than 70% the best mAP achieved in this competition (0.419 under the same experimental conditions).
引用
收藏
页码:741 / 745
页数:5
相关论文
共 4 条
  • [1] Image query-by-example using region-based shape matching
    Saber, E
    Tekalp, AM
    IMAGE AND VIDEO PROCESSING IV, 1996, 2666 : 200 - 211
  • [2] One Step Is Not Enough: A Multi-Step Procedure for Building the Training Set of a Query by String Keyword Spotting System to Assist the Transcription of Historical Document
    Parziale, Antonio
    Capriolo, Giuliana
    Marcelli, Angelo
    JOURNAL OF IMAGING, 2020, 6 (10)
  • [3] Unsupervised Bottleneck Features for Low-Resource Query-by-Example Spoken Term Detection
    Chen, Hongjie
    Leung, Chewing-Chi
    Xie, Lei
    Ma, Bin
    Lie, Haizhou
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 923 - 927
  • [4] Query-Based Word Spotting in Handwritten Documents Using HMM
    Bharathi, V. C.
    Veningston, K.
    Rao, P. V. Venkateswara
    DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 31 - 39