LARGE-SCALE SEMI-SUPERVISED LEARNING BY APPROXIMATE LAPLACIAN EIGENMAPS, VLAD AND PYRAMIDS

被引：0

作者：

Mantziou, Eleni ^{[1
]}

Papadopoulos, Symeon ^{[1
]}

Kompatsiaris, Yiannis ^{[1
]}

机构：

[1] CERTH ITI, Thessaloniki, Greece

来源：

2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS) | 2013年

关键词：

D O I：

暂无

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The paper builds upon recent advances in feature representation and dimensionality reduction to propose a semi-supervised image annotation framework that achieves state-of-theart accuracy at substantial gains in computation cost. More specifically, the framework combines the VLAD feature aggregation method with spatial pyramids and PCA for image representation, and proposes the use of Approximate Laplacian Eigenmaps (ALEs) for learning concepts in time linear to the number of images (labeled and unlabeled) available at training. A set of thorough experiments on MIR-Flickr and ImageCLEF 2012 ground truth annotations explore the impact of PCA and pyramids on the attained accuracy, and demonstrate that the proposed framework achieves virtually the same accuracy with a state-of-the-art manifold learning approach, while at the same time offering substantial speedup (in the order of x80), making possible the completion of a training/testing run for a set of 25k images in less than 3 minutes in a commodity workstation.

引用

页数：4

共 13 条

[1] [Anonymous], 2008, COMP VIS PATT REC 20
[2] Learning eigenfunctions links spectral embedding and kernel PCA
Bengio, Y
Delalleau, O
Le Roux, N
Paiement, JF
Vincent, P
Ouimet, M
[J]. NEURAL COMPUTATION, 2004, 16 (10) : 2197 - 2219
[3] Fergus Rob, 2009, Advances in Neural Information Processing Systems, V22, P522
[4] Multimodal semi-supervised learning for image classification
Guillaumin, Matthieu
Verbeek, Jakob
Schmid, Cordelia
[J]. 2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 902 - 909
[5] Huiskes M. J., P 2008 ACM MIR 08
[6] Jégou H, 2010, PROC CVPR IEEE, P3304, DOI 10.1109/CVPR.2010.5540039
[7] Lazebnik S., COMPUTER VISION PATT, V2, P2169
[8] Modeling the shape of the scene: A holistic representation of the spatial envelope
Oliva, A
Torralba, A
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2001, 42 (03) : 145 - 175
[9] Papadopoulos S., 2013, 19 INT C MMM JAN
[10] Perronnin F., 2010, ECCV

← 1 2 →