3D Visual Phrases for Landmark Recognition

被引:0
作者
Hao, Qiang [1 ]
Cai, Rui
Li, Zhiwei
Zhang, Lei
Pang, Yanwei [1 ]
Wu, Feng
机构
[1] Tianjin Univ, Tianjin 300072, Peoples R China
来源
2012 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2012年
关键词
IMAGE; COLLECTIONS; WORLD;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the problem of landmark recognition and propose to leverage 3D visual phrases to improve the performance. A 3D visual phrase is a triangular facet on the surface of a reconstructed 3D landmark model. In contrast to existing 2D visual phrases which are mainly based on co-occurrence statistics in 2D image planes, such 3D visual phrases explicitly characterize the spatial structure of a 3D object (landmark), and are highly robust to projective transformations due to viewpoint changes. We present an effective solution to discover, describe, and detect 3D visual phrases. The experiments on 10 landmarks have achieved promising results, which demonstrate that our approach provides a good balance between precision and recall of landmark recognition while reducing the dependence on post-verification to reject false positives.
引用
收藏
页码:3594 / 3601
页数:8
相关论文
共 26 条
[1]  
[Anonymous], BMVC
[2]  
ARYA S, 1993, PROCEEDINGS OF THE FOURTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P271
[3]  
Broadhurst A, 2001, EIGHTH IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOL I, PROCEEDINGS, P388, DOI 10.1109/ICCV.2001.937544
[4]   Spatial-Bag-of-Features [J].
Cao, Yang ;
Wang, Changhu ;
Li, Zhiwei ;
Zhang, Liqing ;
Zhang, Lei .
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, :3352-3359
[5]  
Changchang Wu, 2011, 2011 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), P3057, DOI 10.1109/CVPR.2011.5995552
[6]   Total recall: Automatic query expansion with a generative feature model for object retrieval [J].
Chum, Ondrej ;
Philbin, James ;
Sivic, Josef ;
Isard, Michael ;
Zisserman, Andrew .
2007 IEEE 11TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, VOLS 1-6, 2007, :496-+
[7]  
Chum O, 2009, PROC CVPR IEEE, P17, DOI 10.1109/CVPRW.2009.5206531
[8]   Approximating maximum clique by removing subgraphs [J].
Feige, U .
SIAM JOURNAL ON DISCRETE MATHEMATICS, 2004, 18 (02) :219-225
[9]  
Hartley R.I., 2004, Multiple View Geometry in Computer Vision, Vsecond, DOI [DOI 10.1017/CBO9780511811685, 10.1016/S0143-8166(01)00145-2]
[10]  
Irschara A, 2009, PROC CVPR IEEE, P2591, DOI 10.1109/CVPRW.2009.5206587