Mobile Visual Search Compression With Grassmann Manifold Embedding

被引：3

作者：

Zhang, Zhaobin ^{[1
]}

Li, Li ^{[1
]}

Li, Zhu ^{[1
]}

Li, Houqiang ^{[2
]}

机构：

[1] Univ Missouri, Dept Comp Sci & Elect Engn, Kansas City, MO 64110 USA

[2] Univ Sci & Technol China, Chinese Acad Sci, Key Lab Technol Geospatial Informat Proc & Applic, Hefei 230027, Anhui, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2019年 / 29卷 / 11期

关键词：

Transforms; Visualization; Manifolds; Training; Image coding; Feature extraction; Measurement; Mobile visual search; Grassmann manifold; subspaces embedding; CDVS; SIFT; compact descriptors; IMAGE FEATURES; RECOGNITION; EFFICIENT; TRACKING;

D O I：

10.1109/TCSVT.2018.2881177

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

With the increasing popularity of mobile phones and tablets, the explosive growth of query-by-capture applications calls for a compact representation of the query image feature. Compact descriptors for visual search (CDVS) is a recently released standard from the ISO/IEC moving pictures experts group, which achieves state-of-the-art performance in the context of image retrieval applications. However, they did not consider the matching characteristics in local space in a large-scale database, which might deteriorate the performance. In this paper, we propose a more compact representation with scale invariant feature transform (SIFT) descriptors for the visual query based on Grassmann manifold. Due to the drastic variations in image content, it is not sufficient to capture all the information using a single transform. To achieve more efficient representations, a SIFT manifold partition tree (SMPT) is initially constructed to divide the large dataset into small groups at multiple scales, which aims at capturing more discriminative information. Grassmann manifold is then applied to prune the SMPT and search for the most distinctive transforms. The experimental results demonstrate that the proposed framework achieves state-of-the-art performance on the standard benchmark CDVS dataset.

引用

页码：3356 / 3366

页数：11

共 47 条

[1]

Alahi A, 2012, PROC CVPR IEEE, P510, DOI 10.1109/CVPR.2012.6247715

[2]

[Anonymous], 2008, THESIS

[3]

[Anonymous], MPEG2012M25929 ISOIE

[4]

[Anonymous], IEEE T CIRCUITS SYST

[5] Large-Scale Video Retrieval Using Image Queries [J].

Araujo, Andre ;

Girod, Bernd .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (06) :1406-1420

[6]

Bay H, 2006, COMPUT VIS IMAGE UND, V110, P404, DOI DOI 10.1016/j.cviu.2007.09.014

[7] Shape matching and object recognition using shape contexts [J].

Belongie, S ;

Malik, J ;

Puzicha, J .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (04) :509-522

[8] BRIEF: Binary Robust Independent Elementary Features [J].

Calonder, Michael ;

Lepetit, Vincent ;

Strecha, Christoph ;

Fua, Pascal .

COMPUTER VISION-ECCV 2010, PT IV, 2010, 6314 :778-792

[9]

Chandrasekhar V. R., 2011, P 2 ANN ACM C MULT S, P117, DOI DOI 10.1145/1943552.1943568

[10] Compressed Histogram of Gradients: A Low-Bitrate Descriptor [J].

Chandrasekhar, Vijay ;

Takacs, Gabriel ;

Chen, David M. ;

Tsai, Sam S. ;

Reznik, Yuriy ;

Grzeszczuk, Radek ;

Girod, Bernd .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 96 (03) :384-399

← 1 2 3 4 5 →