Holistic Context Models for Visual Recognition

被引:54
作者
Rasiwasia, Nikhil [1 ]
Vasconcelos, Nuno [1 ]
机构
[1] Univ Calif San Diego, Dept Elect & Comp Engn, Stat Visual Comp Lab, La Jolla, CA 92093 USA
基金
美国国家科学基金会;
关键词
Computer vision; scene classification; context; image retrieval; topic models; IMAGE FEATURES; SCENE; OBJECTS; KERNEL; WORDS;
D O I
10.1109/TPAMI.2011.175
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A novel framework to context modeling based on the probability of co-occurrence of objects and scenes is proposed. The modeling is quite simple, and builds upon the availability of robust appearance classifiers. Images are represented by their posterior probabilities with respect to a set of contextual models, built upon the bag-of-features image representation, through two layers of probabilistic modeling. The first layer represents the image in a semantic space, where each dimension encodes an appearance-based posterior probability with respect to a concept. Due to the inherent ambiguity of classifying image patches, this representation suffers from a certain amount of contextual noise. The second layer enables robust inference in the presence of this noise by modeling the distribution of each concept in the semantic space. A thorough and systematic experimental evaluation of the proposed context modeling is presented. It is shown that it captures the contextual "gist" of natural images. Scene classification experiments show that contextual classifiers outperform their appearance-based counterparts, irrespective of the precise choice and accuracy of the latter. The effectiveness of the proposed approach to context modeling is further demonstrated through a comparison to existing approaches on scene classification and image retrieval, on benchmark data sets. In all cases, the proposed approach achieves superior results.
引用
收藏
页码:902 / 917
页数:16
相关论文
共 62 条
[51]  
Rowley H.A., 1996, PROC CVPR IEEE, P203, DOI DOI 10.1109/CVPR.1996.517075
[52]  
Sivic J., 2005, IEEE International Conference on Computer Vision, V1, P65
[53]   Content-based image retrieval at the end of the early years [J].
Smeulders, AWM ;
Worring, M ;
Santini, S ;
Gupta, A ;
Jain, R .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2000, 22 (12) :1349-1380
[54]  
Smith JR, 2003, 2003 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL II, PROCEEDINGS, P445
[55]   Contextual priming for object detection [J].
Torralba, A .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2003, 53 (02) :169-191
[56]  
Torresani L, 2010, LECT NOTES COMPUT SC, V6311, P776, DOI 10.1007/978-3-642-15549-9_56
[57]  
van Gemert Jan C., 2008, Computer Vision. Proceedings 10th European Conference on Computer Vision, ECCV 2008, P696, DOI 10.1007/978-3-540-88690-7_52
[58]  
Vasconcelos M., 2006, P IEEE CVPR, P1001
[59]   Minimum probability of error image retrieval [J].
Vasconcelos, N .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2004, 52 (08) :2322-2336
[60]   Learning Image Similarity from Flickr Groups Using Stochastic Intersection Kernel Machines [J].
Wang, Gang ;
Hoiem, Derek ;
Forsyth, David .
2009 IEEE 12TH INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2009, :428-435