Multi-Class Object Localization by Combining Local Contextual Interactions

被引:26
|
作者
Galleguillos, Carolina [1 ]
McFee, Brian [1 ]
Belongie, Serge [1 ]
Lanckriet, Gert [2 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92103 USA
[2] Univ Calif San Diego, Elect & Comp Engn Dept, La Jolla, CA 92093 USA
来源
2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2010年
关键词
D O I
10.1109/CVPR.2010.5540223
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent work in object localization has shown that the use of contextual cues can greatly improve accuracy over models that use appearance features alone. Although many of these models have successfully explored different types of contextual sources, they only consider one type of contextual interaction (e. g., pixel, region or object level interactions), leaving open questions about the true potential contribution of context. Furthermore, contributions across object classes and over appearance features still remain unknown. In this work, we introduce a novel model for multi-class object localization that incorporates different levels of contextual interactions. We study contextual interactions at pixel, region and object level by using three different sources of context: semantic, boundary support and contextual neighborhoods. Our framework learns a single similarity metric from multiple kernels, combining pixel and region interactions with appearance features, and then uses a conditional random field to incorporate object level interactions. We perform experiments on two challenging image databases: MSRC and PASCAL VOC 2007. Experimental results show that our model outperforms current state-of-the-art contextual frameworks and reveals individual contributions for each contextual interaction level, as well as the importance of each type of feature in object localization.
引用
收藏
页码:113 / 120
页数:8
相关论文
共 50 条
  • [1] Contextual disamriguation for multi-class object detection
    Fan, XD
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 2873 - 2876
  • [2] Multi-class GAN for generating multi-class images in object recognition
    Wang, Bingxu
    Lan, Jinhui
    Gao, Jiangjiang
    JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 2022, 39 (05) : 897 - 906
  • [3] Combining Appearance and Range Based Information for Multi-class Generic Object Recognition
    Hegazy, Doaa
    Denzler, Joachim
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 741 - 748
  • [4] Scalable Multi-class Object Detection
    Razavi, Nima
    Gall, Juergen
    Van Gool, Luc
    2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 1505 - 1512
  • [5] Beyond Active Noun Tagging: Modeling Contextual Interactions for Multi-Class Active Learning
    Siddiquie, Behjat
    Gupta, Abhinav
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 2979 - 2986
  • [6] Contextual-Guided Bag-of-Visual-Words Model for Multi-class Object Categorization
    Mirza-Mohammadi, Mehdi
    Escalera, Sergio
    Radeva, Petia
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2009, 5702 : 748 - 756
  • [7] Binary codes for multi-class decision combining
    Windeatt, T
    Ghaderi, R
    SENSOR FUSION: ARCHITECTURES, ALGORITHMS, AND APPLICATIONS IV, 2000, 4051 : 23 - 34
  • [8] Research on Multi-class SVM Combining with LDA
    Chen Junjie
    Zhao Wei
    Li Haifang
    2008 INTERNATIONAL SYMPOSIUM ON INTELLIGENT INFORMATION TECHNOLOGY APPLICATION WORKSHOP: IITA 2008 WORKSHOPS, PROCEEDINGS, 2008, : 375 - 378
  • [9] Joint Learning for Multi-class Object Detection
    Fard, Hamidreza Odabai
    Chaouch, Mohamed
    Quoc-cuong Pham
    Vacavant, Antoine
    Chateau, Thierry
    PROCEEDINGS OF THE 2014 9TH INTERNATIONAL CONFERENCE ON COMPUTER VISION, THEORY AND APPLICATIONS (VISAPP 2014), VOL 2, 2014, : 104 - 112
  • [10] Layered Object Detection for Multi-Class Segmentation
    Yang, Yi
    Hallman, Sam
    Ramanan, Deva
    Fowlkes, Charless
    2010 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2010, : 3113 - 3120