Accurate and efficient cross-domain visual matching leveraging multiple feature representations

被引：0

作者：

Gang Sun

Shuhui Wang

Xuehui Liu

Qingming Huang

Yanyun Chen

Enhua Wu

机构：

[1] Chinese Academy of Sciences,State Key Laboratory of Computer Science, Institute of Software

[2] University of Chinese Academy of Sciences,Key Laboratory of Intelligent Information Processing (CAS), Institute of Computing Technology

[3] Chinese Academy of Sciences,undefined

[4] University of Macau,undefined

来源：

The Visual Computer | 2013年 / 29卷

关键词：

Visual matching; Cross-domain; Multiple features; Hyperplane hashing;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Cross-domain visual matching aims at finding visually similar images across a wide range of visual domains, and has shown a practical impact on a number of applications. Unfortunately, the state-of-the-art approach, which estimates the relative importance of the single feature dimensions still suffers from low matching accuracy and high time cost. To this end, this paper proposes a novel cross-domain visual matching framework leveraging multiple feature representations. To integrate the discriminative power of multiple features, we develop a data-driven, query specific feature fusion model, which estimates the relative importance of the individual feature dimensions as well as the weight vector among multiple features simultaneously. Moreover, to alleviate the computational burden of an exhaustive subimage search, we design a speedup scheme, which employs hyperplane hashing for rapidly collecting the hard-negatives. Extensive experiments carried out on various matching tasks demonstrate that the proposed approach outperforms the state-of-the-art in both accuracy and efficiency.

引用

页码：565 / 575

页数：10

共 53 条

[1] Belongie S.(2002)Shape matching and object recognition using shape contexts IEEE Trans. Pattern Anal. Mach. Intell. 24 509-522
[2] Malik J.(2009)Sketch2Photo: Internet image montage ACM Trans. Graph. 28 124-1636
[3] Puzicha J.(2008)A perception-based color space for illumination-invariant image processing ACM Trans. Graph. 27 61-1285
[4] Chen T.(2011)Sketch-based image retrieval: benchmark and bag-of-features descriptors IEEE Trans. Vis. Comput. Graph. 17 1624-72
[5] Cheng M.M.(2007)Scene completion using millions of photographs ACM Trans. Graph. 26 4-110
[6] Tan P.(2011)CG2Real: improving the realism of computer generated images using a large collection of photographs IEEE Trans. Vis. Comput. Graph. 17 1273-987
[7] Shamir A.(2004)Learning the kernel matrix with semidefinite programming J. Mach. Learn. Res. 5 27-175
[8] Hu S.M.(2004)Distinctive image features from scale-invariant keypoints Int. J. Comput. Vis. 60 91-2521
[9] Chong H.Y.(2002)Multiresolution gray-scale and rotation invariant texture classification with local binary patterns IEEE Trans. Pattern Anal. Mach. Intell. 24 971-655
[10] Gortler S.J.(2001)Modeling the shape of the scene: a holistic representation of the spatial envelope Int. J. Comput. Vis. 42 145-1188

← 1 2 3 4 5 6 →