Multi-Class Object Localization by Combining Local Contextual Interactions

被引：26

作者：

Galleguillos, Carolina ^{[1
]}

McFee, Brian ^{[1
]}

Belongie, Serge ^{[1
]}

Lanckriet, Gert ^{[2
]}

机构：

[1] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92103 USA

[2] Univ Calif San Diego, Elect & Comp Engn Dept, La Jolla, CA 92093 USA

来源：

2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年

关键词：

D O I：

10.1109/CVPR.2010.5540223

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent work in object localization has shown that the use of contextual cues can greatly improve accuracy over models that use appearance features alone. Although many of these models have successfully explored different types of contextual sources, they only consider one type of contextual interaction (e. g., pixel, region or object level interactions), leaving open questions about the true potential contribution of context. Furthermore, contributions across object classes and over appearance features still remain unknown. In this work, we introduce a novel model for multi-class object localization that incorporates different levels of contextual interactions. We study contextual interactions at pixel, region and object level by using three different sources of context: semantic, boundary support and contextual neighborhoods. Our framework learns a single similarity metric from multiple kernels, combining pixel and region interactions with appearance features, and then uses a conditional random field to incorporate object level interactions. We perform experiments on two challenging image databases: MSRC and PASCAL VOC 2007. Experimental results show that our model outperforms current state-of-the-art contextual frameworks and reveals individual contributions for each contextual interaction level, as well as the importance of each type of feature in object localization.

引用

页码：113 / 120

页数：8