Nonlinear Manifold Embedding on Keyword Spotting using t-SNE

被引:5
作者
Retsinas, George [1 ,2 ]
Stamatopoulos, Nikolaos [1 ]
Louloudis, Georgios [1 ]
Sfikas, Giorgos [1 ]
Gatos, Basilis [1 ]
机构
[1] Natl Ctr Sci Res Demokritos, Computat Intelligence Lab, Inst Informat & Telecommun, GR-15310 Athens, Greece
[2] Natl Tech Univ Athens, Sch Elect & Comp Engn, GR-15773 Athens, Greece
来源
2017 14TH IAPR INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION (ICDAR), VOL 1 | 2017年
关键词
DIMENSIONALITY REDUCTION; EIGENMAPS;
D O I
10.1109/ICDAR.2017.86
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nonlinear manifold embedding has attracted considerable attention due to its highly-desired property of efficiently encoding local structure, i.e. intrinsic space properties, into a low-dimensional space. The benefit of such an approach is twofold: it leads to compact representations while addressing the often-encountered curse of dimensionality. The latter plays an important role in retrieval applications, such as keyword spotting, where a sorted list of retrieved objects with respect to a distance metric is required. In this work, we explore the efficiency of the popular manifold embedding method t-distributed Stochastic Neighbor Embedding (t-SNE) on the Query-by-Example keyword spotting task. The main contribution of this work is the extension of t-SNE in order to support out-of-sample (OOS) embedding which is essential for mapping query images to the embedding space. The experimental results demonstrate a significant increase in keyword spotting performance when the word similarity is calculated on the embedding space.
引用
收藏
页码:487 / 492
页数:6
相关论文
共 17 条
[1]   A study of Bag-of-Visual-Words representations for handwritten keyword spotting [J].
Aldavert, David ;
Rusinol, Marcal ;
Toledo, Ricardo ;
Llados, Josep .
INTERNATIONAL JOURNAL ON DOCUMENT ANALYSIS AND RECOGNITION, 2015, 18 (03) :223-234
[2]  
[Anonymous], 2007, Proceedings of Machine Learning Research
[3]  
[Anonymous], 2009, J Mach Learn Res
[4]  
Belkin M, 2002, ADV NEUR IN, V14, P585
[5]  
Bengio Y, 2004, ADV NEUR IN, V16, P177
[6]   Parametric nonlinear dimensionality reduction using kernel t-SNE [J].
Gisbrecht, Andrej ;
Schulz, Alexander ;
Hammer, Barbara .
NEUROCOMPUTING, 2015, 147 :71-82
[7]   A Simple and Fast Word Spotting Method [J].
Kovalchuk, Alon ;
Wolf, Lior ;
Dershowitz, Nachum .
2014 14TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR), 2014, :3-8
[8]   Holistic word recognition for handwritten historical documents [J].
Lavrenko, V ;
Rath, TM ;
Manmatha, R .
FIRST INTERNATIONAL WORKSHOP ON DOCUMENT IMAGE ANALYSIS FOR LIBRARIES, PROCEEDINGS, 2004, :278-287
[9]   ICFHR 2014 Competition on Handwritten KeyWord Spotting (H-KWS 2014) [J].
Pratikakis, Ioannis ;
Zagoris, Konstantinos ;
Gatos, Basilis ;
Louloudis, Georgios ;
Stamatopoulos, Nikolaos .
2014 14th International Conference on Frontiers in Handwriting Recognition (ICFHR), 2014, :814-819
[10]   Keyword Spotting in Handwritten Documents using Projections of Oriented Gradients [J].
Retsinas, George ;
Louloudis, Georgios ;
Stamatopoulos, Nikolaos ;
Gatos, Basilis .
PROCEEDINGS OF 12TH IAPR WORKSHOP ON DOCUMENT ANALYSIS SYSTEMS, (DAS 2016), 2016, :411-416